Gene Cpha266_1813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1813 
Symbol 
ID4570364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2068032 
End bp2069120 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content48% 
IMG OID639766395 
Productglycosyl transferase family protein 
Protein accessionYP_912253 
Protein GI119357609 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATGA TCAGCATCAA TGCTGCAGGC AGACACCATC TTCCCGATAT CGATTGCGTC 
CTCATCGGCG TTAACTGCAG TAAAACGCTG GCAAGATGCC TTGATTCGAT ACGATCGTGC
GATTACCCAC AGGAAAAATT GCATAGCTGT TATGTTGACG GCGGGTCAAC TGACAAAAGC
ATTGAGATCG CCGAACGGTA TGAGGATGTT ACGGTTATAG CGCTTGATCC TGCATATCCG
ACGCCTGGAA TGGGAAGAAA TGCCGGCTGG AAACACAATA AGTCGCCGTT TGTTCAGTTT
CTTGATTCTG ATACCATTCT TGACGCACGC TGGCTTCGTA AAGCTGTTGA GGCAATGGCG
GATGAGCGAT TCGGAGCGGT GATTGGCATG CGTCAGGAGA TGTATCCTGA ACGCACGGTC
TATAACTGGA TTGGCAATAT CGAGTGGAAC GGGCCCGCAG GTCTGTCCGA TTGTTTCGGA
GGAGATGTTT TTATCCGGCG CACAGCGCTT GAAAAAACAG GAGGATACGA CGAAACGCTT
GTAGGCGGTG AAGATCCGGA ACTCAGCCGG AGGGTGATCA GGGCAGGCTG GCAGATTGTT
CGGCTTGATG CGCTGATGAC AAGGCATGAT CTGGCCATGA CCACGATGAG TCAGTATTTT
CGGCGGGCAT TTCGTTCCGG CTATGGCTTT GCCGCGGTGA GTTTTCGTGA ATCCCTGGTT
GGGAGTTCTT TCTGGAAGTA CGATGTTTTG AAAATTTTCA TTAAAGCGGG GAGTTTTTTC
GGTTGCATCG TTCTTGCTTT ACTCTTGTTT TTTGTTACAC AAGCAAACAG TGTAAAAATT
ATAGCGGCTT TTCTTCCCTT TGTTGGACTT ATGGTGATGC TCTCTCCCCG GCTGTTTAAA
ACAGGAAAAT TCATGCGTGA AAATAATCTG AACAAGAACG ATGCGAAAAG GTATGCATGG
CACTGTTCGG TAGTGGTTGT TCCCCAGTTT TTCGGGATCA TCCGGTTTCA CCTTGGCCGT
ATTTTCAATA AACCCCTGAA AAACAGGCGC CGAAATCTCA AAACAGGAAT TTCAATTTCC
GGCACATGA
 
Protein sequence
MNMISINAAG RHHLPDIDCV LIGVNCSKTL ARCLDSIRSC DYPQEKLHSC YVDGGSTDKS 
IEIAERYEDV TVIALDPAYP TPGMGRNAGW KHNKSPFVQF LDSDTILDAR WLRKAVEAMA
DERFGAVIGM RQEMYPERTV YNWIGNIEWN GPAGLSDCFG GDVFIRRTAL EKTGGYDETL
VGGEDPELSR RVIRAGWQIV RLDALMTRHD LAMTTMSQYF RRAFRSGYGF AAVSFRESLV
GSSFWKYDVL KIFIKAGSFF GCIVLALLLF FVTQANSVKI IAAFLPFVGL MVMLSPRLFK
TGKFMRENNL NKNDAKRYAW HCSVVVVPQF FGIIRFHLGR IFNKPLKNRR RNLKTGISIS
GT