Gene Cpha266_2318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2318 
Symbol 
ID4570956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2661040 
End bp2662260 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content51% 
IMG OID639766879 
Productglycosyl transferase, group 1 
Protein accessionYP_912733 
Protein GI119358089 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTT TATTTGTACA TCAGAACTTT CCCGGACAGT TCCGACATGT TGCCAAAGCC 
CTGGCCGAGA TGCCGGAACA CCGGGTAGTC GGTATTGGAG AGAGCGCCAA TCTCAAGGGA
CGTCTGTCGT TGCATCCCCG GATAAACGTA ATGGGGTATC AGCCGAAAAG GGGTGCGAGT
CCAGAGACTC ACCACTATAT CCGTGACTTC GAAGGGGCTG TACGCCGGGG CCAGGAGGTT
GCCCGTGTGG CGTTTGAGCT CCGAAAGAAA GGGTTTCGTC CCGATCTGGT GATCAGCCAT
CCAGCCTGGG GGGAGTCGTT TTTTCTTCCG GATATTTTTC CGGATGCCCG TCATATCGGT
TATTTCGAGT ACTTCTATCG GAGTTCCGGG GGGGATATCG GGTTTGATCC GGAGTTTCCT
TCTTCTTTTG ATGATCGGCT GAAGGTAAGG ATTAAAAACA CGACTCAGCT TCTGAGCCTT
GATTCTGCCG ATGCAGGAAT TTCGCCTACC CTGTGGCAGC AGAGCCGCTA TCCGAAAGAG
TTTCATTCGA AAATCAGGGT AATTCACGAA GGGGTGGATA CCAACGTTGT CGCTCCTGAC
GAAAATGCAT CGATTGATAT TGACGGAGCG CATTTCATAA GAGGCGACAG GGTTATTACC
TATGTAGCCC GTAATCTGGA ACCTTGTCGA GGGGTTCATG TGTTCATTCG CGCTATTCCA
CTGATTCAGG AGCTGTGCCC TGATGCACGG ATTGTTATTA TCGGAGGCGA TGATGTCAGT
TACGGGAGAA GACCTACGGC AGGAACAACC TACCGGTCAC TTTATTGTGA CGAAGTGAAA
GATGTAGCGG ACTGGTCACG GGTTCATTTT ACCGGCAGGC TGCCCTACAA CCGCTACCTG
AAAATTTTAC AGCTCTCTTC AGCTCATGTT TATCTTACCT ATCCTTTTGT GCTCTCCTGG
TCGATGCTTG AGGCAATGGC TGCCGGTTGT GTTGTGATCG GTTCAGCGAC GCCCCCTGTT
CAGGAGGTCA TTACTCATGC AGAGAATGGC CTGCTGGTTG ACTTTTTCGA CAGGGAAGAA
CTTGCTCGTA CGGTTGCCGG AGTAGTCAAT AACCAGTCAC AGCATGAACA GATAAGGCAA
TCTGCCCGAC AGACCATACT TGATCGCTAT GATCTGCATA CAAAATGCCT GCCCGAACTG
CTGCGGTATC TGATCGGGTA G
 
Protein sequence
MNFLFVHQNF PGQFRHVAKA LAEMPEHRVV GIGESANLKG RLSLHPRINV MGYQPKRGAS 
PETHHYIRDF EGAVRRGQEV ARVAFELRKK GFRPDLVISH PAWGESFFLP DIFPDARHIG
YFEYFYRSSG GDIGFDPEFP SSFDDRLKVR IKNTTQLLSL DSADAGISPT LWQQSRYPKE
FHSKIRVIHE GVDTNVVAPD ENASIDIDGA HFIRGDRVIT YVARNLEPCR GVHVFIRAIP
LIQELCPDAR IVIIGGDDVS YGRRPTAGTT YRSLYCDEVK DVADWSRVHF TGRLPYNRYL
KILQLSSAHV YLTYPFVLSW SMLEAMAAGC VVIGSATPPV QEVITHAENG LLVDFFDREE
LARTVAGVVN NQSQHEQIRQ SARQTILDRY DLHTKCLPEL LRYLIG