Gene Clim_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1001 
Symbol 
ID6355450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1092108 
End bp1093994 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content53% 
IMG OID642668625 
ProductTonB-dependent receptor 
Protein accessionYP_001943056 
Protein GI189346527 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.11298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAA AAACAGTTGC GTTTCTTGCG GCTTGCATGG TATGCCAGAG TGCATTGTAC 
GCAGGGGAAC CCTCCGAACG TTTTTACACT ACCGACGAGG TGGTGGTGAC GAGCAGCCAT
TTTTCCGAGA AGGAAAAGGA GAGCGCCAGG TTTATAACCG TTGCCGACAG CGAGAAACTC
AAAAAAACCG GGGCCACTAA CGCCATCGAA GCGCTTTCCC GAATCGGCGG TCTCGGGTAC
AAGTCCCTCG CTCCCCTTGG CGTCAACAAA CTCGGCATGA ACAGCTCGGT CTACATCAGG
GGGATGGCCG ACGGCGAGCT CATTCTTGTC AACGGCATGC CTGTCCAGCA GGCAGCTTCG
AAAGGGTATG ATCTCAGCGC CATTTCCATA GACCAGATCG AGAGAATCGA GGTACTCAAG
GGAGCCGCGT CCACGCTCTA TGGAGCCGAT GCGATGTCGG GCGTGATCAA TATCGTTACC
AAAAAACCTT CCGGCGAAAC ATCGGCGACC GCCTCGGTTG AATTCGGCAA CGAATCGTGG
ATGAACCACG GCGTCAGCGT CAACCTGCCC GGCGCGACTG TCGGTTTTCG TTACCAGCAC
ATGGACGAAC TCGACAATGT CGGCCGTCAG TTTACCAACA AGTACACCAA TGCCCTTGAC
GAGACCGACC GGTACATGTT CAATCTCAAC CTGTCGCCGT TTGCCGACAC CTATATCGAT
TATCAGTATT CCGGCTACGA AACGGGCTTC ATCGATCGTT ATGATTCGGG GCTGGTCGAA
CGGACGGATC AGCAGAGCGA CTTTCATTTT CTCAATGTTC GTTATGAAAG CGATGTGTTC
AAGGCAAAGC TGTACGGCGT TTACGACAAT CGCGTCCAGA CGGAGTACGT CGATGGGGTA
TTCGAGTCGG AGGTCGAGCG TCTGAATTAC AATTACGGCG CCGAGACCGA TTATCGTTTT
CTTTTTTCCC GCGGTCTCGA GCTGAGCGTG GGAGCGGACT ATGTTCATCG CTATGCCGAA
TATTCCAACA TCTACGGGGA AAAGTCGCGC GATGATTACG GCGTATTTGC GGAGTTGAAA
AAACGGTTCG GCGATGACCT TATCCTGACG TTCGGCGGGA GGGAGCAGTT CATCGACAAC
GAAGCGGAAA CGACCGATTA CAACGTATTT CTGCCAAGCG TCGGTCTTAT GTGGAAAGCT
TCGGATGATC TCAATCTGTT TGCCAATGCG GGCAAGGCTT TTCAGGCTCC GACCTTTACC
CAGCTGTACT ACGACAGCAA AACCATCAAA GGAAATCCCG ATCTCAAACC GGAATCGGGC
TGGAGTTACG AGACAGGGTT CAAGTGGAAC TGCGATTGCG CTTCGGCAAG GGTTTCCGGA
TTCTGGATGA CGTATGACGA CAAGATTCAG ATCGATCGCA AGAAAAAGCC TTACCGCTAT
TTCAATGCCG GCGCGTATGA AACGAAAGGC ATCGAATGGG AGCTCGGGCT GCGCCCGTTT
TATGGCGAAG CAGGGATTCT CGGCAGAGTC TCCCTATCGG CCGCAGGATA CTGGGCGGAT
CCTGTGTCGG AAGATATCTA CGGAGAAAAA TACCAGCCAG GTCCGAAGTT CCAGAATACC
TTCGGCATCA CGTATGCCTC GATACCGTTC GGTCTCGACC TGAGATGCCG GATACTTGCC
GGCCGTCAGG ACAATCTGGA CAACTATACC GCTTTCGATC TTTCCGGAAG GGTGAAAGCG
GGTCCCGGCA ATGTCACGGT CGCTGTTGAA AATCTGTTCG ATACCGAAAT CCAGACCTCC
GGCAATCTGG TTGAAACGGC AAGCAGCCGT TACGTCTATT ACGATCCGGG CCGTCTTCTC
CGGGTCGGAT ACGCGGTTGC ATTGTGA
 
Protein sequence
MTKKTVAFLA ACMVCQSALY AGEPSERFYT TDEVVVTSSH FSEKEKESAR FITVADSEKL 
KKTGATNAIE ALSRIGGLGY KSLAPLGVNK LGMNSSVYIR GMADGELILV NGMPVQQAAS
KGYDLSAISI DQIERIEVLK GAASTLYGAD AMSGVINIVT KKPSGETSAT ASVEFGNESW
MNHGVSVNLP GATVGFRYQH MDELDNVGRQ FTNKYTNALD ETDRYMFNLN LSPFADTYID
YQYSGYETGF IDRYDSGLVE RTDQQSDFHF LNVRYESDVF KAKLYGVYDN RVQTEYVDGV
FESEVERLNY NYGAETDYRF LFSRGLELSV GADYVHRYAE YSNIYGEKSR DDYGVFAELK
KRFGDDLILT FGGREQFIDN EAETTDYNVF LPSVGLMWKA SDDLNLFANA GKAFQAPTFT
QLYYDSKTIK GNPDLKPESG WSYETGFKWN CDCASARVSG FWMTYDDKIQ IDRKKKPYRY
FNAGAYETKG IEWELGLRPF YGEAGILGRV SLSAAGYWAD PVSEDIYGEK YQPGPKFQNT
FGITYASIPF GLDLRCRILA GRQDNLDNYT AFDLSGRVKA GPGNVTVAVE NLFDTEIQTS
GNLVETASSR YVYYDPGRLL RVGYAVAL