Gene Cpha266_0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0440 
Symbol 
ID4569229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp486186 
End bp488390 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content49% 
IMG OID639765040 
ProductTonB-dependent receptor, putative 
Protein accessionYP_910922 
Protein GI119356278 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CAGCTCTTTT GGTTTTTCTT CTTGCGTTTC CCGCACTTGC GTCGGCAAGT 
GAGATATCCG AGGCATATCC TTCCGCCGAA GAGGTAACCG TTACCGGAAA AAAAGGGGAT
ATTCTGCAAC AGGTTACGGG CAGGGAGTCT GAGCTGCTCA ATCCTTCGCA GATGTCTGTC
TACAAGGCGA TCAACCTCAT GCCTTCGCTC AGCCAGCAGA GTGTTGATCC GTACGGGCTT
GCCGATATTG TCAACTATCA CGAATCGTTT CGTTTCAGGG GCGTTGAGGC AACATCCGGT
GGTGTTCCCG CCACAACGGT GAATGTTGAG GGTCTTCCCG TAACCGGAAG GCCTGGCGGA
GGAGCTACCA TCTACGACCT TGAAAATTTC AGTAATATCA ATATCTATAC CGGAGTTATG
CCTGCCAGCG CAGGGCTTGG TCTTGCCGAT GTCGGGGGCA AGATCAATAT GGAGATCCGT
CGTCCCGAAG AGAGCTTTGG TGTGCTTTTC AAACAGGGCT TCGGCAGCAA TGATTTTCGC
CGCACCTTTT TGCGGGTTGA TACCGGAGCT CTTCCCGGTG ACGTGAAAAG TTTCATCTCC
TTTTCTGATG CCGCAGCCGA TAAATGGAAA GGCGAGGGCG ACAGCCAGAG ACGCAATGTG
ATGGCGGGAG TAACGAAAAC GTTTGGCGAT AATGTCAAAC TTGAAGCCTT TGTGACCTAC
AGCAAAGGGG ATATTCATGC ATACAGGCCC TTCAGCTATG CTCAAATCGG CAACCTCGAA
AGCGCTTATA CAATCGATTA CGGAACGAAT CCCGACAGTT ATGACTATTA CGGATACAAC
CGTAACGAGT TCGAGGACTG GATGGTGATG GCCAATCTCG AAGTAAAAAC CGGCGAGCAC
TCGAAACTTG ACATCAAGCC CTACTACTGG AGCGACAAAG GATACTATCT TGAAACCATT
ACCCTTGCGA ACAATCAGAA CCGGATCAGA CGGTGGGACA TCGATCACGA TCTGAAGGGC
GTACTTGCCG AGTATACGAC AACGATCGGT AGTGTTGATC TTGATTTCGG CTACCTCTAC
CATACGCAGG TACGTCCCGG ACCGCCAACC TCGTGGAAAA ACTACAAGGT TGTTAACGGC
AAGCCTGTTT TTGATCAGTG GAACATTCTT TCAAACAGCT CAAGCCATGA ACTGCACTCT
CCGTTTCTTG AGGCAACATA CCGATTCGGT TCCTGGAAGC TTGAAGGAGG GCTGAAGTAT
ATCAACTATA CGCTCCCCTC AATCATTACC TATAACACGG CGGGTCTCGG CGATCTCAGT
TACGACGAAG CTCTTGCCGG CAATCCGTCG ATCAATGCCA AAGCCAGCGC ACTGGATACA
AAAAGTTTCA GCAGGCTCTT TCCGAATCTG ACTCTGACAC GGTTTATCGG CGACAATATC
TCACTTCACG CTGCATACGG GGAAAACTAC GTGACCCATG TCGATATTTA CCCCTACTTT
ATCTCGCAGA GCGCCAATTT TATCCAGAAA GGTATCAGTT TTCAGCAGCT CTGGGATGCT
CGGGAGATGG AGACCTCGCA AAATTTCGAG CTTGGTATGC GCGTGAAAGG CAGCAACTGG
AGCATTGCGC CAACTATCTA CTACGCGCTG CATAACAATA AACAAGCGGT ATTGTACGAT
CCTGTGCTCG ATGCGACCTA CCCGATGAAC AATGCCGATG CCGAAGGGTA TGGTTTCGAG
CTGGAAGCCG AGTACAAGCC TCTTGAAAAC CTGAGCTGTT ACGGTTCATT CTCATGGAAC
AGGTTTTCCT TCTCACAGGA CATCAACTCC GATGCTCCTG GCGGAGGAAT CATCAAGGTG
AAGGGAGAGC AGGTTCCCGA TGCTCCGGAA TTCCTGGCAA AAGGAATGGT CAGCTACAAG
GCCGGGGACT TTTTGATTAC TCCCATTGTC AGGTACACCT CGGTTCGCTA CGGAGATGTA
CTGCATAATG AAAAGATCGA CAGCACAACG CTTTTTGATC TCGATCTGAC CTGGCAGAAA
AAGATGCTTG GCTTTAAAGA GGTCGAAATC TCACTTTCTC TTCTGAATAT TTTCGACAAG
CAGTATGTCA GCATGATCAG TACCTCTGAT TATAAAACTC TGAAAACATC CTATCAAGCC
GGTATGCCCT TTACCGTTAT GGCAACGCTG GCAGTTCACT ACTGA
 
Protein sequence
MKKTALLVFL LAFPALASAS EISEAYPSAE EVTVTGKKGD ILQQVTGRES ELLNPSQMSV 
YKAINLMPSL SQQSVDPYGL ADIVNYHESF RFRGVEATSG GVPATTVNVE GLPVTGRPGG
GATIYDLENF SNINIYTGVM PASAGLGLAD VGGKINMEIR RPEESFGVLF KQGFGSNDFR
RTFLRVDTGA LPGDVKSFIS FSDAAADKWK GEGDSQRRNV MAGVTKTFGD NVKLEAFVTY
SKGDIHAYRP FSYAQIGNLE SAYTIDYGTN PDSYDYYGYN RNEFEDWMVM ANLEVKTGEH
SKLDIKPYYW SDKGYYLETI TLANNQNRIR RWDIDHDLKG VLAEYTTTIG SVDLDFGYLY
HTQVRPGPPT SWKNYKVVNG KPVFDQWNIL SNSSSHELHS PFLEATYRFG SWKLEGGLKY
INYTLPSIIT YNTAGLGDLS YDEALAGNPS INAKASALDT KSFSRLFPNL TLTRFIGDNI
SLHAAYGENY VTHVDIYPYF ISQSANFIQK GISFQQLWDA REMETSQNFE LGMRVKGSNW
SIAPTIYYAL HNNKQAVLYD PVLDATYPMN NADAEGYGFE LEAEYKPLEN LSCYGSFSWN
RFSFSQDINS DAPGGGIIKV KGEQVPDAPE FLAKGMVSYK AGDFLITPIV RYTSVRYGDV
LHNEKIDSTT LFDLDLTWQK KMLGFKEVEI SLSLLNIFDK QYVSMISTSD YKTLKTSYQA
GMPFTVMATL AVHY