Gene Cpha266_0889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0889 
Symbol 
ID4570503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1012543 
End bp1014303 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content41% 
IMG OID639765484 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_911361 
Protein GI119356717 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATC AGAGCAATGC ACATAAGAAA CTTGCCGGAA ATGCGTTATC CGGTATGGTT 
GCTATTGTCA TTTATATGGT AAGCCGTATT CTTTTGACTC CCTATATCCT GCACTATCTC
TCGCTGACTG AATTCGGATT GTGGTCTCTC TCTTTTATTA TTCTTTCTTA TGCAGGGATG
GGCGGATTCG GGGTTAACAG TACCTATATC CGTTATTCTG CACGGTATCT CGCTGATGGC
AAGCAGAGTG AAATAAGTAA TCTTCTTTCG ACCGGCATAG CTTATATGTT ATCCTTTAGT
TTGCTTTTTT GCTCGGTACT TTATTTTTTG ATGCCTTTTA TTCTCCGGCA ATTTCATATA
GAACCTTCAC AACAGGAACT TGCTTCCACC ATATTTCTTG GAACAGCACT TGTATTCAGT
CTTGAACTGA CGTTAGGCGG GCTTGCATTT ATCATCAATG GAATGCATGA GTTTGCAAAG
GAAAAAATAA TCTCAACAAT TGCCGGGCTT TTTGAAATTG TATTTATTCT TCTCTTTCTT
GCTCTTGGAG CAGGGGTCAA GGGATTACTT TATGCTTTTG CCTTAAGGAT TGTCATGTCA
ACGATCCTCT GCTGGAAAGT TGCCCGCAGC CTGTTGCCAT CGTTAACGAT ATCATGGAAA
CTGGTAACTC GTGAACACTT TCGCCACTTT ACAGGGTTTG GCGGGAAAGT CCAGGTGCTT
GGTATTATTG GCATCTTTCT TACAGCGATG GACAGAATGT TTATTACCGC TATTTTGGGA
CTTGCGTCCG GAGGTATGTT TGAACTTGGC CGAAAGCTGC CCTCTACTGC CGGAGGTATT
GCCAATTCGG CATTTGCCCC GTTTTTATCT ACAGCAGCAC ATCTTGAAGG CTCATGGGCG
GGTGAAATGA ACAATACGGT GGGAGACAGA ATTAAAACCT ATCTCATCAT TTCGATAATG
GCCATTCTTT TTGCCATTAT TCCCGTTGTT TTTTTGCCGG GTTTTCAGAA ATATCTTCCG
ATATCACCGG TTTTCATAGC TTCAGCAGTT GCCATGGTGT TTTTCTATCT GTTTTTTCAG
CTTCAACACG AACAGAAAAA AAATAATTTC CTTGACAATC AGGAATTAAA AGAACTGTAT
CTCAATGGCA TCCGGTTTAC AAATATCATA AGTTCAATAC TTTTTGTTTT TCTGGTTGTT
ATGGCTTACC CCCTTATCGA TGCATGGGTT GGTTCAAAAT ATTCAGAGGC TGCAACGATC
ATGATTTTTC TTTCTGCAGG ATACGCAGTC CAGCAGTGTA CCGGACCAAT AAACATGATA
TTCAGGGGAA TAAACAAGAC AGGAAAAGAA CTGGAATATA TGCTTGTTCA GGTTTTATTG
ATGCTGATCT GGATTCCGGC AGCAACAATA ACCTACAGTT TATCAGGTGC TGCTGCCGCA
ATAGCATTAA GTTCAATAAC CAGCACCCTG TTTCTTTTTT TGCGAAGCAG TTATATCTTT
CAAGTCAGAA TCTGGGAAAT TATTGTTCGA TCTATCCTTC CTTCGCTGGT GTCGTTTTTT
CCCGCTTGCC TGATCTACAT CATTACGGTA CTGTTTCCTG TTACAGGGAG GATTGCTGTC
ATTGCGCAAA TCCTTGTCTG TGGAGTTCTC TATCTCATCA TGACCATAGC GCTGCTTTGG
GGCATTGTTT TGAACGAAGA TGAAAAAAAA CAGGCAATTG CATTATTGCC ATTTAAAAAG
AAGATGGACT CATCACAGTG A
 
Protein sequence
MKNQSNAHKK LAGNALSGMV AIVIYMVSRI LLTPYILHYL SLTEFGLWSL SFIILSYAGM 
GGFGVNSTYI RYSARYLADG KQSEISNLLS TGIAYMLSFS LLFCSVLYFL MPFILRQFHI
EPSQQELAST IFLGTALVFS LELTLGGLAF IINGMHEFAK EKIISTIAGL FEIVFILLFL
ALGAGVKGLL YAFALRIVMS TILCWKVARS LLPSLTISWK LVTREHFRHF TGFGGKVQVL
GIIGIFLTAM DRMFITAILG LASGGMFELG RKLPSTAGGI ANSAFAPFLS TAAHLEGSWA
GEMNNTVGDR IKTYLIISIM AILFAIIPVV FLPGFQKYLP ISPVFIASAV AMVFFYLFFQ
LQHEQKKNNF LDNQELKELY LNGIRFTNII SSILFVFLVV MAYPLIDAWV GSKYSEAATI
MIFLSAGYAV QQCTGPINMI FRGINKTGKE LEYMLVQVLL MLIWIPAATI TYSLSGAAAA
IALSSITSTL FLFLRSSYIF QVRIWEIIVR SILPSLVSFF PACLIYIITV LFPVTGRIAV
IAQILVCGVL YLIMTIALLW GIVLNEDEKK QAIALLPFKK KMDSSQ