Gene Clim_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1064 
Symbol 
ID6353766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1166075 
End bp1168060 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content49% 
IMG OID642668681 
ProductTonB-dependent receptor 
Protein accessionYP_001943112 
Protein GI189346583 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00188221 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA AAGTATGCCT GCTTGTGCTG GCCGGGCTGC TCTGCAGCAG GGGGCTTCTT 
GCCGAAGAGA CGACGAAAAG CTTTACCGGC AGTGAACTGG TCGTTACCTC GAGCCGCGTC
GAGGAAGAAA AGAAAAATGT AACAACGAAC ATTACCGTTA TCAGCAAAGA GGAGATCAAA
CAGTCGTCGG CAAAGGATCT CGGAGACCTG CTTGCGGAAA AAAATCTCGG AACGGTCCAC
AAATACCCTG GCACATTGAC AAGTATTGGA ATCAGGGGTT TCAGGACCGA GTCGCATGGC
AATGATCTCC AGGGAAAAGT GCTCGTGCTT CTCAACGGCC GCAGGGCCGG CACCGGAAAC
CTTGCCAAGA TTGCCGTTGG TGAGATCGAT CGTATCGAAA TTATTCACGG CCCGGCAGCG
GTCCAGTATG GAACGGCAGC TATCGGAGGC GTTATCAATG TGATTACTGC AAGGGGATCC
GGAGAACCCG GACTGTTTTT TGCTCAGGAG CTTGGCAGCA GCGATTATAC CCGGACAACG
CTTGGTACAT CGGGCAAAAT CGGCAATCTT GATTTTTCAG GAAGCGTTTC GCTTTCTGAA
ATGGGCGACT ATAAAACCGG ATCGGGAAAA ACATACTACA ACACGGCATA TGACGATCAG
ACCTCCGGCA GCCTGAATAT CGGCTACGAG TTTACACCTG GGCACAGAGT CGGAGTGAAC
TACACCTATT TTAATGTGGG TGATGGAGGT TCTCCTTACT ATTTGAGCCA GAACGATCTT
GACGACTGGT TCGAGAAGGA GAATTATTCG ACGGATATCG TGTACGAGGG ACGGACTGCT
GACAGCAGGT TATCCTGGAT GGCCAGATAT TTTACCGGTC GCGACTATGA TGTCCAGTAC
GATCCGACCG GAAGCAACCA GGGTTGGGAT GATGATATCC CGTACACGTC CAAAGTCGAT
CACAAGGGAG CCCAGGCACA GTTGACATAT AATTATGACT ATTTTCGTGC TACAGCAGGC
ATAGACTGGC TCAATTACGA AGAGACCACG ACTCCTTATG CTCCGTATAA ATCGGAATAT
GATAATATGG CGGAGTTTCT CCTGCTCAAT GGATTTCTAT TCGACAAACG GCTTGTACTC
TCTGCTGGAT TTCGTTACGA TACGTATGAC TTGACCTCCC AGGGTTATGA GGATTCGTCT
GAATCAGACC GCGATGATGA TAACTTCGTG ATGAATTACG GTGTTGCATG GCATGTAACT
GATGGTATCA AACTGAGGGC ATCCTATGCT GAAGGGTTCA AGATGCCGGC ATCAAAGGAG
CTGGCTGCAG ATTACTATAT TTCTACTACC CATTATGTCG GAAATGCCGA TCTCAAGCCT
GAAGAGAGCA CAACCTGCGA AATCGGCGTT GATGTTGCCG ATAACCGGTT CGCTTCCTCG
TTAACCTGGT TCACTACCGA TTTTAAAAAC AAGATTCAGT CGGTCAGTCT CGGGAGCGGC
GTTTCGTCAT GGGAAAATCT TTACGGGGCA AAGATTTCAG GATTCGAAGG CGAAGTGTCG
TATGCTTTTG AGCCGTTTGG CAACAACTGG CAGTTCAGCC CCTATGCCAG TTTTGTTTAT
CTGACGGAAT TCGAGGACGA CGGAACAGGA GATCGTCTGC TCTATACTCC GGAATGGAAT
GCTACTGTAG GTTTGAGGGT CAACGACCAG AGAGGATTCA GTGGTATGTT CAATCTTGCA
TATACGGGAG AGTCGGATAT ACAGGATTGG GAGACGTCGT GGGCTGGAAC GGTAATTACA
AAGGGCGGTT TTGCGGTTGC AAATCTGACT GCATCAAAAA AATTCATGCT CAGCGAAAAG
AAAAGCGGTC GAGCTCTTAC AATCAAGGGG GAGGTCAACA ATCTCTTCGA TCGCGATTAC
GAGTTCGTAA AGGGTTATCC GATGCCTGGC CGTTCGTTTG CCGTTGGCGT GAGGGTTGAT
ATCTGA
 
Protein sequence
MNKKVCLLVL AGLLCSRGLL AEETTKSFTG SELVVTSSRV EEEKKNVTTN ITVISKEEIK 
QSSAKDLGDL LAEKNLGTVH KYPGTLTSIG IRGFRTESHG NDLQGKVLVL LNGRRAGTGN
LAKIAVGEID RIEIIHGPAA VQYGTAAIGG VINVITARGS GEPGLFFAQE LGSSDYTRTT
LGTSGKIGNL DFSGSVSLSE MGDYKTGSGK TYYNTAYDDQ TSGSLNIGYE FTPGHRVGVN
YTYFNVGDGG SPYYLSQNDL DDWFEKENYS TDIVYEGRTA DSRLSWMARY FTGRDYDVQY
DPTGSNQGWD DDIPYTSKVD HKGAQAQLTY NYDYFRATAG IDWLNYEETT TPYAPYKSEY
DNMAEFLLLN GFLFDKRLVL SAGFRYDTYD LTSQGYEDSS ESDRDDDNFV MNYGVAWHVT
DGIKLRASYA EGFKMPASKE LAADYYISTT HYVGNADLKP EESTTCEIGV DVADNRFASS
LTWFTTDFKN KIQSVSLGSG VSSWENLYGA KISGFEGEVS YAFEPFGNNW QFSPYASFVY
LTEFEDDGTG DRLLYTPEWN ATVGLRVNDQ RGFSGMFNLA YTGESDIQDW ETSWAGTVIT
KGGFAVANLT ASKKFMLSEK KSGRALTIKG EVNNLFDRDY EFVKGYPMPG RSFAVGVRVD
I