Gene Cpha266_0358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0358 
Symbol 
ID4569336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp400322 
End bp401470 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content44% 
IMG OID639764956 
Productglycosyl transferase, group 1 
Protein accessionYP_910841 
Protein GI119356197 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.348726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTG GTATCGACTT TACGCACGAT CTGGGGTATA GCGGCATAGG GACATACTGT 
CGTTCTCTTA CAGAAGCGAT GGCACAACGC GAACCGGAAA ACATATACAA TATCCTCACC
CTTCACCATA AAATTCCAGA GGTTCAGCAA CACTTTTCAA ATCTGCGCGC TATTGTCTAT
TCGGCACCGT TTCCAAATCC AATGCTATTA GGAGGGAAAT GCAATAAAAT GATTAGAAAA
TATCATCAGA CAATCTGGAA AAAAAAGGCT GCCAACTATG ACCTTGTTCA TTTTACGCAC
CAGGGATATT TTGTTCCCGG TATTGAGAAT GCCGCAGTAA CGATACACGA CCTGATTAAG
CTGTATAACA AGAGTTATAC GGCCATTGAA ACAACTCACC CCCTTTTTCT GACAACAAAA
AAGATGATCA ATGATGCCGC GACCATTTTC GTGCCGTCAG AATTTGTGCG CAATGAGCTC
AGAAACTACT TTGCCGGCTG CGAAAAGAAG GTAAAGGTCA CCTATGAAGG CATTAAACCT
GTCTATCGAC AAACCCCTCC TGATCCCGCT GTTCTAAAGA AATACGACCT GCTGGATAAC
GGCAGGTTTT TTCTCTATGT TGGCCGATAT GAGTCAAGAA AAAACCTCGA CAGACTGATT
CTTGCCTATG CACAGCTCCC CGATACATTA AAAAAAGATA CCCTGCTGGT GCTGATCTGT
CCAACCGAAA AAAAATCGAC AAAAGAGCTG CAAAAAAAAA TCGCTGGCGC CGGTCTCGAA
AAAAATGTTC TGCATCTGGT ACACGTACCT GATAACGACC TCGTACACCT TTATAATGCT
GCCCTTGCGC TTCTTTTTGT ATCCTTCTCT GAAGGATTCG GTCTGCCGCT CGTTGAAGCC
ATGAATTGTG GATGTCCGGC TATAATTGCC AACAGCTCCT CGCTTCCTGA AATATCGGGG
AGCTCGTCCC TTCTGGTTGA CCCCTATGAC ACAGAATCAA TTCGTCAAGC AATGCTTGCA
ATCAGTGAAG ACTCCCTGCT ACGAAACGAT CTTTCAAAAA AATGTATCGT GCGAGCACAA
CGTTTTTCCT GGCAAACAAC CGCTCAGGAA ACACTGAAAG GATACCATGC AATGCTGAAT
AAACCGTAA
 
Protein sequence
MNIGIDFTHD LGYSGIGTYC RSLTEAMAQR EPENIYNILT LHHKIPEVQQ HFSNLRAIVY 
SAPFPNPMLL GGKCNKMIRK YHQTIWKKKA ANYDLVHFTH QGYFVPGIEN AAVTIHDLIK
LYNKSYTAIE TTHPLFLTTK KMINDAATIF VPSEFVRNEL RNYFAGCEKK VKVTYEGIKP
VYRQTPPDPA VLKKYDLLDN GRFFLYVGRY ESRKNLDRLI LAYAQLPDTL KKDTLLVLIC
PTEKKSTKEL QKKIAGAGLE KNVLHLVHVP DNDLVHLYNA ALALLFVSFS EGFGLPLVEA
MNCGCPAIIA NSSSLPEISG SSSLLVDPYD TESIRQAMLA ISEDSLLRND LSKKCIVRAQ
RFSWQTTAQE TLKGYHAMLN KP