Gene Cphamn1_1535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1535 
Symbol 
ID6375213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1657426 
End bp1660344 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content49% 
IMG OID642684028 
ProductTonB-dependent receptor 
Protein accessionYP_001959942 
Protein GI189500472 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.39302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.5632 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATTTC ACACTTCCTT AACCGATCCT TTGCACAATC GTCACAGGAG ACTGTTTGCC 
GGAAAATCCA AAAGAGATAT TCTGTTTGCG ACATGGATTT CCCTCTCTTT TCATCGTGGC
TTTCCAACAT TAACGGAAAA TACAATGAGA CATTTGTTTT CGTATTTATT CATCTGTCTG
ACTACGCTTG CATGCCATGG ATTTGCAGCA CCCGGCATTG CTCAAGCTGA AACAACAGGC
TCCTCTTCCG GCACCACGAT AACCGGTCGT GTAGTCGATG AGGTAGACGG TCTTCCGCTC
CCCGGTGCTA ACATCTCCGT CAAAGGCACG TCAAAAGGTT CTATTACCGG GCAGGACGGA
CGTTACCGGC TTGAAAACGT TTCCGGTGCT ACTGCAGTCA TCGAAGCATC CTATATCGGA
TATGTAAAAA GTGATATTCC AATAACCCTT TCACCCGGCA AAGCCGTTAT AAAAAATATA
CGGCTTAAAC CCGGCGTCTT GATCAGCGAG GAAATCACCG TTGTTGGCGA ACTCCTGAAG
GGCCAGGCAA AAGCACTGAA TCAGCAGAAA AACGACGTCA ATGTCACCAA TGTTGTTGCT
TCAGACCAGA TCGGTAAATT TCCCGACTCC AACATCGGCG ACGCCCTTAA AAGAATTCCG
GGGATCAGCG TTTTTAATGA CCAGGGTGAA GCGAGATTCG GCCATGTACG AGGAACAGAG
CCCCGTTTCA ACTCTGTCAC CGTCAACGGA GAACGCATTC CTTCTGCTGA GGCAGAGAAC
AGGACAATCC AGCTCGACCT CGTTCCATCG GACATGATCC AGACCATCGA GGTCACCAAA
GCCTTGACAC CCGACATGGA TGCCGACGCA ATCGGTGGTT CGATCAATCT TGTAACGAAA
ATCCCTGCGG AAGAAAGATT TTCGCTCTCA GCCGGTGGAG GCTTGAACTT CCTCGATGGG
ACCGGTGGCG AAAGATACCA GTTCGGCGGC ACCTACGGCA ACCGGTTCGC TGACGAAAAA
CTCGGCGTAC TTTTCAGCCT CTCTTATGAC AACAACGATT TTGGCTCTGA CAATATCGAA
GGGGAATGGG ACGCTGGTAA TGACGGCATA GAAGGAATCA AGGAGTTCCA GGTAAGAAAG
TATGATGTTC AGCGTATCCG CAGAAGTTTT TCCGGAGCGC TCGACTACCG CTTCAACGAA
AACCATATCC TCAAGTTCAA TGGTATTTAC AACTGGAGAG ACGACTTTGA AAACCGCTAC
AGGATTAAGT ACAAGGATCT CGATGAAGAT TTTGCCACGG TGGAACGTGA GACAAAAGGT
GGCACGGAGA ATGACGCCCG CCTTGAGGAT CAGCGCATGA TGTCATTCAC GCTTGGCGGC
GAGCATGATT TCGGCAAGCT GGATCTCGAC TGGCAAGCTT CATACTCAAA AGCTTCGGAA
GATCGTCCGA ATGAACGATA CATTAATTTC AGAGCGAAAA ACCAGCCGTT TACCGTCGAT
ATCAGCAACC CGGAAAAACC GTTTGTAACG GTCTTGAACC CGGATGTGTC AGGCGGTATC
AGCGACAGCG AAGACTGGAA GCTCAAGGAG CTGACAGAAG AACATCAGTA CACCGAAGAT
ATCGACAAGA ATTTCGGCCT GAACTTTAAC TATGCTGCAA CCGACGCACT TGCATTCAAA
TTCGGCGGTA AAATCAGGGA CAAGAAAAAG AAGCGTGATA ACGATTTTTA CGAATATGAA
CCGGTTGATG AAGATGCATT CCGCGATGAA GTCTTTGCCA ACCTCAAAAA TGAAACCAAG
GATCATTTTC TTCCGGGTAA CAAATACAAG TCAGGTGTCT TTGTTTCCAA CAAGTTTCTG
GGCGGCCTCG ACCTAGACGG CGATGATTTC GACAAGGAAC TTGTAAAGGA AGGGCTTGCC
GGAAACTTCG ACGCAAAGGA ACAGATAAAA GCGGTCTACG CAATGGCAAC CTGGGACATC
AGCGACAACA CAACACTGCT TGGAGGTGCA AGGCTTGAGC ATACCAGAAA TGAATACGAT
GCATTCAAGT ACTTTGCCGA CGAAGACTCA CTTGCTGCAG TAACAGGCAA GCCGTCTGAC
TACACCAACG TTTTGCCTTA TGTACATCTG CGTTATAACG TCAACGATCA GACAAATATC
AAGCTTGCCT ATACGCACAG TCTTGCTAGA CCAAACTATT TCGATCTGGC CCCGTACCAG
GAAATCGTTG CCGAAGACGA GGAAATAAAA TTAGGCAATC CGGCACTTGA ACCGACCCTC
TCGAAAAACG TCGACCTGAT GATCGAGCAT TACCTGAGCG ATATCGGCAT TCTTTCTGCC
GGCGTCTTCT ACAAGTCGAT CAGCGACTTC ATTGTCACCA AAAAAGAAGA CGTCGATTAT
TCGGGAGACA CGTTTGAGCA ATTCCAGCCG GTCAATGCCG GTGACGGAAC ACTTATGGGC
ATAGAAACTG CCGCACAGTT CCAGCTTCCT TTCATCCCGG GCCTTGGCCT TTACCTGAAC
TATACCTACA CGCACTCTGA AATCGACAAC TTCGATATCA AGGGACGTGA AGGCGACGAC
CTGCCGCTGC CAGGCAACCC GGAGCACACA GCAAATGCGT CGATCGCTTA CGAGAACGGT
CCGTTCAATA TCCGTCTCTC AGGAAACTAT CACAGCGACT TCATCGATGC TGAAGAGGGA
TCGATCGGCG AGAACAAGTG GGAGGACCGC TACTACGACA GCTCATTCAC TCTCGACCTC
AATGGCGGCT ACCGCATGAG CGACATAGTC CAGCTATACT TCGAGGTCAG CAACCTGACA
AACCAGCCGC TGCGCTTCTA TCAGGGTGAA AAGCAGTATC TCGCCCAGGA AGAATGGTAT
GACCGAAGGT TCCTGGTTGG CGTCAAGGCT GATTTCTGA
 
Protein sequence
MLFHTSLTDP LHNRHRRLFA GKSKRDILFA TWISLSFHRG FPTLTENTMR HLFSYLFICL 
TTLACHGFAA PGIAQAETTG SSSGTTITGR VVDEVDGLPL PGANISVKGT SKGSITGQDG
RYRLENVSGA TAVIEASYIG YVKSDIPITL SPGKAVIKNI RLKPGVLISE EITVVGELLK
GQAKALNQQK NDVNVTNVVA SDQIGKFPDS NIGDALKRIP GISVFNDQGE ARFGHVRGTE
PRFNSVTVNG ERIPSAEAEN RTIQLDLVPS DMIQTIEVTK ALTPDMDADA IGGSINLVTK
IPAEERFSLS AGGGLNFLDG TGGERYQFGG TYGNRFADEK LGVLFSLSYD NNDFGSDNIE
GEWDAGNDGI EGIKEFQVRK YDVQRIRRSF SGALDYRFNE NHILKFNGIY NWRDDFENRY
RIKYKDLDED FATVERETKG GTENDARLED QRMMSFTLGG EHDFGKLDLD WQASYSKASE
DRPNERYINF RAKNQPFTVD ISNPEKPFVT VLNPDVSGGI SDSEDWKLKE LTEEHQYTED
IDKNFGLNFN YAATDALAFK FGGKIRDKKK KRDNDFYEYE PVDEDAFRDE VFANLKNETK
DHFLPGNKYK SGVFVSNKFL GGLDLDGDDF DKELVKEGLA GNFDAKEQIK AVYAMATWDI
SDNTTLLGGA RLEHTRNEYD AFKYFADEDS LAAVTGKPSD YTNVLPYVHL RYNVNDQTNI
KLAYTHSLAR PNYFDLAPYQ EIVAEDEEIK LGNPALEPTL SKNVDLMIEH YLSDIGILSA
GVFYKSISDF IVTKKEDVDY SGDTFEQFQP VNAGDGTLMG IETAAQFQLP FIPGLGLYLN
YTYTHSEIDN FDIKGREGDD LPLPGNPEHT ANASIAYENG PFNIRLSGNY HSDFIDAEEG
SIGENKWEDR YYDSSFTLDL NGGYRMSDIV QLYFEVSNLT NQPLRFYQGE KQYLAQEEWY
DRRFLVGVKA DF