Gene Cpha266_1812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1812 
Symbol 
ID4570363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2066815 
End bp2068035 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content51% 
IMG OID639766394 
Productglycosyl transferase, group 1 
Protein accessionYP_912252 
Protein GI119357608 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAA GAGCGATTGC GTATCTTTGC AGCGAGTATC CGGCTATTTC CCATACTTTT 
ATCTACAGGG AGATTGAGTC GCTCCGTCAA GCAGGGGTGA CCGTCTATCC GGCATCGATC
CACAAGCCCT CCAACCTTCA GGTTATGACT TCGGATGAGA GAGCGGAGGC TGCGCGTACC
CTGATGGTGC TTTCACTTCC GATACCCGCC ATGGTTGGAG CGCATCTGCA CTGTCTTCTT
AAAAATCCGG GAGGTTATGT TCGCATGATC TTCGCTGCCG TCAAACTGCT CACAACGGGA
CCAAAAAGCC CCTTGAAGGC CGGCGCATAT TTTGCCGAGG CGGGTATTCT TCTTGAGTGG
ATGCATCGGC ATGGTGTTAC CCATATTCAT GAGCATTTTG CTAATCCTAC TGCTCTTGTT
GCCATGCTCA TGAAGCGTTA TGGCGGAGTG AGCTTCAGTA TCTCCGTGCA TGGGCCGGAT
ATTTTTTATA TCGTCGATAC GGCGATGCTT GCTGAAAAAG TTCGCGAAGC GGCGTTTGTG
CGCTGCATCA GCCACTATTG CCGCAGCCAG ATTATGCGCA TCAGCAAGCC GGAGAGTTGG
AAGAAACTTC ATATTGTTCG TTGCGGGGTC GATCCCGCCC TCTATGTTCC GAGGCCGGAA
CCCGTTAATC CTGTGCCGGA TATGCTCTGT GTTGGCAGGC TGGTTCCGGC AAAAGGACAG
CATATTCTGC TCGAAGCCTG TACTCTCCTG AAAAAAGAGG GAGTTCGTTT TCAGTTGACC
TTTGTTGGCG ACGGCCCTGA TCGGGACTCT CTTGAACAGT TCAGCGCTTT AGCTGGCCTG
AACGGTATGG TAACGTTTAC CGGTGCGCTC GGCCAGGACA AGGTTCGTGA TTATTATGAC
AAGGCCGATC TTTTTGTGCT TGCAAGTTTT GCCGAAGGGG TTCCCGTTGT GCTGATGGAG
GCTATGGCAA AGGAGATACC TGTTATCTCA ACGCGGATTA CCGGTATACC TGAATTGATT
GAGCATGATC GTGACGGATT GCTTGCAACA CCGGGAGACG CCGTGGATCT TGCCCGCCAG
ATCCGGAGAT TGCTTGATGA CTCCGGACTT CGCCGTGAGC TGGGAGTGGC CGGACGGAAA
AAAGTTATTG AACTTTACAA TCAGCATGGC AACAATAGTG CTATGGTTGA TCTTTTTCAC
CTTGAAGGGA TCTCCTCATG A
 
Protein sequence
MKTRAIAYLC SEYPAISHTF IYREIESLRQ AGVTVYPASI HKPSNLQVMT SDERAEAART 
LMVLSLPIPA MVGAHLHCLL KNPGGYVRMI FAAVKLLTTG PKSPLKAGAY FAEAGILLEW
MHRHGVTHIH EHFANPTALV AMLMKRYGGV SFSISVHGPD IFYIVDTAML AEKVREAAFV
RCISHYCRSQ IMRISKPESW KKLHIVRCGV DPALYVPRPE PVNPVPDMLC VGRLVPAKGQ
HILLEACTLL KKEGVRFQLT FVGDGPDRDS LEQFSALAGL NGMVTFTGAL GQDKVRDYYD
KADLFVLASF AEGVPVVLME AMAKEIPVIS TRITGIPELI EHDRDGLLAT PGDAVDLARQ
IRRLLDDSGL RRELGVAGRK KVIELYNQHG NNSAMVDLFH LEGISS