Gene Cpha266_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1820 
Symbol 
ID4571162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2076394 
End bp2078154 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content46% 
IMG OID639766402 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_912260 
Protein GI119357616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.120135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAGA AGGAAAACGC GCTTAAAAAA CTTGCAGGTA ATGCTGTTTC GGGTATGGTG 
GCGACGATTA TTTATATGGT CAGCCGCCTT CTTCTTACAC CGTTCATTCT GCAGTATCTC
TCTCTGGAGG AGTTCGGCTT ATGGTCGCTC TGTTTTATCA TTCTCTCTTA TGCCGGAATG
GGAGGGTTCG GAGTAAACAG TACCTATATC CGTTATTCGG CAAGATACCT TGCGGAGGGA
AAAGAGAAAG AGATCAGCAA GCTGCTCTCA ACCGGTGTTG CCTATATGTT TTCATTCTGT
CTCTTTTTCT GTCTGGTTCT TTATCTGATT ATGCCTTTTC TTCTCGAAAG GTTCCATATA
GCGCCTGCGC AGCAGGATCT TGCCTCAACA ATATTTCTTG GTACTGCTGC AGTTTTCAGT
CTTGAACTTA CTCTTGGCGG ATTCCGGTTT GTCATTAACG GAATGCATGA GTTTTTAAAG
GAGAAAATCG TTTCAACCGT TGCCGGACTC ATTGAGATTG GCGCTATCCT TCTGTTTCTT
TACTTCGGAG CGGGGGTTAA AGGGCTCTTG TACGCGTTTG CCTTAAGGCT GGTTCTTGAA
ACTATTGGCT GCTGGGCAAT TGCCCGATCC TTGCTTCCTT CGCTCTCTGT TTCATGGAGA
TTGATCAGCC GTGAAAATTT CAGGCTTTTT CTCGGTTTTG GCGGCAAAGT CCAGGTGCTC
GGCATTCTGG GTATCTTTCT TACGGCGCTT GACAGATTGT TCATTACGGC AATTGCCGGA
CTTGCCGCAG GAGGCATGTT TGAGATAGGT CGAAAGCTCC CTTCAACGGC AGGGGGTATC
TCATCATCCG CATTCGGTCC GTTTTTATCT ACCGCATCTC ATATCGAAGG CCGTTGGGCA
GGTGAAAAAC CGGATGCTTT TCCGGACAGG CTTAAAACCT ATGGTCTTAT TGTTGCAACA
ACCGTTACGC TATCCCTTGT CCCGCTTTTT TTTCTACTGC CCGTGCAAAA ACGGCTGCAG
GGGGCAAGTC CGCTGATCGC TGTATTTGCA GGGGTTTTAA CCGTTGTTCT GTTTTATCTG
CTCAATCGCA GAATGAAAAA TGAAAATTTT CTCGATAACA TTGAATTAAA GCAGCTTTAT
CTCAACGGGA TTCGTTTTAC CAACATGATC AACTCGACAC TGTTTCTTTT TCTTGTCGCC
ATGGCTCATC CACTGATGAA TGCATGGGTT GGCAAGGAGT ATGCGCGTGC TGCCGATGTT
ATGATCTTTT TATCGACAGC CTACTCGATT CAATTGTGTA CAGGTCCGAT AACCATGATA
TTTCGGGGAA TTGATCGTAA CGGAAGAGAG CTTGAGTACA TGCTGGTTCA GGTTATACTG
ATGGTTATCT GGATTCCTGC CGGAACGATT GCATCGGGAT TGATCGGATC AGCAGCAGCT
ATTGCGTGCA GTTCGATAGT CAGCACATGC TTTCTTTTCT GGCGGAGCAA TAACACGTTT
CAGATTCGAT TCCGTAAATT TGTTTCAGTC ACCGTTATCC CTGCGCTTGT TCCTCTCTTG
CCGGCAGTGG CTGTTTTTGC CGTTTCGGAG ATCTATCCCG CAGAAGGGAG ACTTGTGGCT
GTCTTGCAGG TTCTTGTTTG CGGTGTTGTC TATGTGTTGC TTTCTGTCAT GATGTTCTGG
AAATTTATCC TGAACGGCGA GGAAAAATCA AAAGCACTGG AAATGATACC TTTTAACCGG
AAACGGAATC CTCCATGCTG A
 
Protein sequence
MNQKENALKK LAGNAVSGMV ATIIYMVSRL LLTPFILQYL SLEEFGLWSL CFIILSYAGM 
GGFGVNSTYI RYSARYLAEG KEKEISKLLS TGVAYMFSFC LFFCLVLYLI MPFLLERFHI
APAQQDLAST IFLGTAAVFS LELTLGGFRF VINGMHEFLK EKIVSTVAGL IEIGAILLFL
YFGAGVKGLL YAFALRLVLE TIGCWAIARS LLPSLSVSWR LISRENFRLF LGFGGKVQVL
GILGIFLTAL DRLFITAIAG LAAGGMFEIG RKLPSTAGGI SSSAFGPFLS TASHIEGRWA
GEKPDAFPDR LKTYGLIVAT TVTLSLVPLF FLLPVQKRLQ GASPLIAVFA GVLTVVLFYL
LNRRMKNENF LDNIELKQLY LNGIRFTNMI NSTLFLFLVA MAHPLMNAWV GKEYARAADV
MIFLSTAYSI QLCTGPITMI FRGIDRNGRE LEYMLVQVIL MVIWIPAGTI ASGLIGSAAA
IACSSIVSTC FLFWRSNNTF QIRFRKFVSV TVIPALVPLL PAVAVFAVSE IYPAEGRLVA
VLQVLVCGVV YVLLSVMMFW KFILNGEEKS KALEMIPFNR KRNPPC