Gene Clim_2439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2439 
Symbol 
ID6355910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2673177 
End bp2674508 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content53% 
IMG OID642670029 
Productprotein of unknown function DUF21 
Protein accessionYP_001944439 
Protein GI189347910 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATAT TTTTTCTTCT TTTTCTCATC ATTCTCAACG GCCTGTTCGC CATGTCGGAG 
ATCGCACTGA TAACGGCAAA GCGATCCAGG CTGCAGAGGC TTGCCGCTGA GGGCGATAAA
GCAGCTGATG TTGCACTCAA GCTCGGTCAG GAGCCGACAC GATTTCTTTC GACCATACAG
ATCGGCATTA CTTCGATCGG TATTCTTAAC GGTATCGTCG GTGAGAATGC CCTTGCCGAA
CCGTTTTCAC TCTGGCTGCG CTCTCTTGGA ATGGAGAGTG AGATCAGCAG AATTCTCTCG
ACAGCCCTGA TTGTCGTTTC CATAACCTAT GTGACTATCG TCATTGGTGA GCTGGTACCT
AAAAGACTCG GCCAGTTCAA TCCCGAAGGT ATTGCAAGGC TTGTTTCCCG ACCCATGCTC
GCCCTTGGAA TGCTTACCCG TCCTTTTGTC CGTCTGCTTT CGTTTTCCAC CGATACGATA
CTTCGTCTGA TGGGAAAAAA TCCGCATGCT TCGACGAGTG TAACCGAAGA GGAGATTCAC
GCCATGCTCG AGGAGGGTTC GGAGGCAGGG ATTATCGAAC AGCAGGAGCA TGAAATGGTG
CGCAACGTTT TCAGGCTGGA CGACCGGCAG CTTGGAACCC TTATGGTGCC GAGGGCTGAT
ATCGTTTTTC TTGATGTGGC CCTTCCGCTG GAAGAGAATA TCGATCGGGT GACCGGTTCT
GAACATTCCC GTTTTCCTGT CTGTCAGGGG GGGCTGCAGT CTCTGCTCGG CGTGGTCAAT
GCCAAACAGC TCCTGGCGCA GACGCTTAAA GGGGGGCTTA CGGATTTCGC TGCACAGCTT
CAGCCCTGCG TCTATGTGCC TGAAACCCTG ACGGGAATGG AGCTGCTCGA GCATTTCAGG
CTCTCGGGAA CCCAGATGGT GTTTGTCGTT GACGAGTACG GAGAAATTCA GGGGCTGGTG
ACCATGCAGG ATCTTCTGGA AGCGGTGACC GGCGAGTTTG TTCCCCGTAA TCTCGAAGAT
TCATGGGCAG TGCAGCGAGA AGATGGCTCC TGGCTGCTTG ACGGAATGAT TCCCGTTCCC
GAACTGAAGG ATTCGCTTGA TCTGAAAAGC GTTCCTGAAG AGGATAAAGG GCTTTACCAT
ACGCTGAGCG GACTTCTTAT GTGGCTTCTC GGCAGAATGC CCGTTACCGG GGATGTAACG
GAATGGGAGG GATGGAGACT GGAGGTCATC GATCTCGATG GCAAGCGGAT CGACAAGGTT
CTGGCATCTC CACTCAATGG AGAGTCTGCG TCAGCGGATT CCGGAAATGC AGCCCGCAGT
TCGGAAGGGT AA
 
Protein sequence
MEIFFLLFLI ILNGLFAMSE IALITAKRSR LQRLAAEGDK AADVALKLGQ EPTRFLSTIQ 
IGITSIGILN GIVGENALAE PFSLWLRSLG MESEISRILS TALIVVSITY VTIVIGELVP
KRLGQFNPEG IARLVSRPML ALGMLTRPFV RLLSFSTDTI LRLMGKNPHA STSVTEEEIH
AMLEEGSEAG IIEQQEHEMV RNVFRLDDRQ LGTLMVPRAD IVFLDVALPL EENIDRVTGS
EHSRFPVCQG GLQSLLGVVN AKQLLAQTLK GGLTDFAAQL QPCVYVPETL TGMELLEHFR
LSGTQMVFVV DEYGEIQGLV TMQDLLEAVT GEFVPRNLED SWAVQREDGS WLLDGMIPVP
ELKDSLDLKS VPEEDKGLYH TLSGLLMWLL GRMPVTGDVT EWEGWRLEVI DLDGKRIDKV
LASPLNGESA SADSGNAARS SEG