Gene Clim_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1019 
Symbol 
ID6355468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1116425 
End bp1118143 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content59% 
IMG OID642668642 
ProductTonB-dependent receptor 
Protein accessionYP_001943073 
Protein GI189346544 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.463707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGGTG TCCGGGTTAT GCTGCTGGTT ATCGCGCTGA TCATGGGGGG CGTCATGCAA 
GGACGGCTCC TGGCGGCGGG CAACGACGAT ATCGTTGCCA TAAGCGCGTC CGAACTCGAA
GCCACGGACG CCACGGATGT CGCCGAGCTG CTCAATCGCA TCCCCGGCGT CAAGGCGAGC
GAATCGTCGG TCTCCATACG GGGCTCTTCC AACGTGAAGG TGCTGCTCGA CGGGCGGCCG
ATAAACGATC CGACATCGCA TTCGGGATCG GTGAAATGGT CGATGATCTC CCTCTCCGGC
ATCGAAAAAA TCGTGATCCA CAAAGGGAGG GGGAGCGTCT CGTACGGCGA CAATACGGAG
GGCGGAGTTA TCGTCATCAC CTCGAAAAAG GCCAGCCGCA TAGGAGGAAT GGTCGGCGTC
GGCGCCGGCA ATAACGGGGA AAAGCATGCC GATATCAACC TGCAGGGCCG TTTCGACCGT
TTTGCCGCGA ACCTGACTGC GGGAGCGAAG GGGTACGACG GATTTACGGT CAACGACGAC
AAGCGGGAGT ATCGCGCGGG GTTGCGTCTC GATTACGCTC CGCTTGAAGG CACCTCGCTG
TTCCTTTCCG GCGACTATAG CACGCAGGAG AAGGGGATGC GGGGGTACCC CGGAAGCCGG
ACGCCGAACG CCCGAATGGG GTACGACGAC TGGTCGCTGC TTTTCGGCGT CAGCCGCAAT
ACCCTTGACG GGCGGGCCTG GTTCCGTAAA ACCCTGACGC AGAACAGCGA TTCCGACAGG
GACTTCTTTT CCGGTCTGGA GGTGCTCTCT GCCGGCATGA GTGTAGACGG ACCGGTCAGC
CTGCCCCTTG CCGGATCGCT GAAGGCCGGT TTCGGCTACG AATGGCAGTC GGCAAGCGGC
AGCGGGTTCG GCGCGAAAGA GGAGCGGCGG GGCTGGCTTC ATCTTACCAG GCTTTTTCGG
CAGAAGGATG GCCCGTGGTC GGCCGATGTC GGCATCCGTG AGAACATCTA CTCCGCTTTC
CACAATACCC TGAATCCGGA GGTGAAGGTT GCGTGGAAAA GGAAGCCGTG GAGAGTGGAG
CTGACGGCCG GCGAGACAAA TAACCTGCCG ACCTTCAGGC AGCGCTATAA CGAGACATCG
ACCACCCGTC CCAATCCCGA TCTCGAGATG GAGCAGGCAT TGAACACCGG TTGCTCGGTG
TCGTTCGCTC CCTCGGAAAA GCTCAGCGCC GAACTGTCGT TTTTTCATCG GGACATCACC
GATCGCATCA CCTATGTGCG TGCTTCGGAC AATACCGGAA AGTACGAGAA TTTCGGAGAG
GTCATCTATC AGGGAGTGGA GGCTTCGCTT TCCTGGAAAC CGTCGCCATG GATCGAGTTT
ACGCCTTCGT ACCTGTATCT TCACGCCCGC AACGAAGAGA CCGGCCTCTG GCTGCCCGCC
ACTGCCTTCC ACACCGTTTC GGGAGAGCTT CTCCTGAAAC CGGCAGCCGG GCTCTCCATC
AGGACGGATG TGAAATATAC CGGGAAGGTT TTCGCGAGAA CCGACAACAC CGAGACCATT
GCGGGTTATC TCGTGGCTGC TCTCAGGGTC GATTACCGGA CAGGAGCGGC GCAGTTCTTC
GTCGATATCG ACAATCTGTT CGATATCGAA TATCTCTATG CCGACGGTTA TGATGCCCCG
CCCCGCGAGT GGGAGATCGG CATGAATTAC ACCTTCTGA
 
Protein sequence
MNGVRVMLLV IALIMGGVMQ GRLLAAGNDD IVAISASELE ATDATDVAEL LNRIPGVKAS 
ESSVSIRGSS NVKVLLDGRP INDPTSHSGS VKWSMISLSG IEKIVIHKGR GSVSYGDNTE
GGVIVITSKK ASRIGGMVGV GAGNNGEKHA DINLQGRFDR FAANLTAGAK GYDGFTVNDD
KREYRAGLRL DYAPLEGTSL FLSGDYSTQE KGMRGYPGSR TPNARMGYDD WSLLFGVSRN
TLDGRAWFRK TLTQNSDSDR DFFSGLEVLS AGMSVDGPVS LPLAGSLKAG FGYEWQSASG
SGFGAKEERR GWLHLTRLFR QKDGPWSADV GIRENIYSAF HNTLNPEVKV AWKRKPWRVE
LTAGETNNLP TFRQRYNETS TTRPNPDLEM EQALNTGCSV SFAPSEKLSA ELSFFHRDIT
DRITYVRASD NTGKYENFGE VIYQGVEASL SWKPSPWIEF TPSYLYLHAR NEETGLWLPA
TAFHTVSGEL LLKPAAGLSI RTDVKYTGKV FARTDNTETI AGYLVAALRV DYRTGAAQFF
VDIDNLFDIE YLYADGYDAP PREWEIGMNY TF