Gene Cpha266_2356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2356 
Symbol 
ID4569614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2741188 
End bp2742330 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content48% 
IMG OID639766914 
Productglycosyl transferase family protein 
Protein accessionYP_912768 
Protein GI119358124 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATAT TGATCTACCA GATAGTTGTT CTGGTTTCGC TGATGGTTTT TTTCGGGATT 
CTTCTTCGAA ACCTGCGCGA TCTTCCCGCT CTTCCTTTGA CGCCTTATCG GGCAAGGCCT
TTTGTTTCCG TTCTTGTGCC GGCAAGGAAT GAGGAGTTGA ACATTGAAGG ATGCCTCGCA
TCGCTGATTT TACAGACATA CGATAACGTC GAGATTCTTG TGCTTGATGA TGGATCATCG
GACAGAACAT GGGAGATATT ACAGCGTCTT CAGGCAAAGT ATGGTTCTCG TCTGAAGATT
TATCAGGGAG AATCTCTTCC CGAGGGATGG CACGGCAAGG CCTGGGCTTG CCGTCAGCTT
GCCACGAAAG CAGAGGGTCA GCTACTGCTT TTTACCGATG CTGATACCAC TCATAAACCG
GAAGCGTTAT CCAGAGCGGT TGCTGCCATG GAGGAGTCCG GCGCAGACAT GCTGTCGCTG
ACTCCCCTCC AGGAGACGCA AACTTTTTTT GAACGGCTTG TTGTTCCGCT GGTCTATGTA
ATTCTTCTCT GTTATCTACC TCTTCGACTT CTCAGCAGAT CAAAAAAACC TGCGTTCTGT
TTTGCGTACG GTCAGTTTAT TCTGTTTCGT TCTAAATTTT ATGAGAGCAT AGGCGGCCAC
GCGGCAGTAA AAAACGCCCT TGTTGAAGAT GTCTGGTTGT GCAAGGCGGT TAAAAAATCC
GGAGGAAAAG TTGTCGCTTA CAATGGCGTC GATGCGGTAA GCTGCAGGAT GTACCGGAAC
TTCAGGGAGG TCATTGAGGG TTTTTCGAAA AATCTTTTTG CCGGGCTTGG CTACAGTACG
CCGCTTCTTT TTTTACTTGT TGTGCTTACG GCGATTTTTC ATGTTGCCCC CTGGTTTTTT
TTTACACTGG CGCTGGCAAG GGGAGATACG GCCCCGGCAC ATTTACTGCT TCCTCTCACG
CAAATTGCGA TTGCGCTGTC CTGCAGGGTC ATCATAGCTG TAAAGTGCCG GCAGCCATTA
TCAATGGTAT GGTTTCATGC CCTCTCGCAG TTTGTTCTGA TTGCGATTGC GCTGAACTCA
TTTTATCAGG TTAAATCAGG TCGCGGCTCA AGGTGGAAAG GCAGGAACTA CAAGTTTTCC
TGA
 
Protein sequence
MLILIYQIVV LVSLMVFFGI LLRNLRDLPA LPLTPYRARP FVSVLVPARN EELNIEGCLA 
SLILQTYDNV EILVLDDGSS DRTWEILQRL QAKYGSRLKI YQGESLPEGW HGKAWACRQL
ATKAEGQLLL FTDADTTHKP EALSRAVAAM EESGADMLSL TPLQETQTFF ERLVVPLVYV
ILLCYLPLRL LSRSKKPAFC FAYGQFILFR SKFYESIGGH AAVKNALVED VWLCKAVKKS
GGKVVAYNGV DAVSCRMYRN FREVIEGFSK NLFAGLGYST PLLFLLVVLT AIFHVAPWFF
FTLALARGDT APAHLLLPLT QIAIALSCRV IIAVKCRQPL SMVWFHALSQ FVLIAIALNS
FYQVKSGRGS RWKGRNYKFS