Gene DET0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET0203 
Symbol 
ID3230494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp195733 
End bp196728 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content51% 
IMG OID637119770 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_180951 
Protein GI57235024 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.97495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACC CGTTTATTTC GGTAGTGGTT TCAGCATACA GCATGGAGCG TTATACCGAC 
CTGTTATCCC TCATAGACAG CCTTAAATCC CAGGCTTATA CCAATTTTGA AGTAATACTG
GTTATTGAAA AATCCAGCCA GCTTTTTTCC AACCTGAACC GCTGTCTTTC GCAGCACCCT
TACAGCCGTA ATTTCAAGCT CTTTTTCAAT GACGGTCTGC CGGGGCTGGC CAGTGCCCGC
AACCTGGGGG TGGAGAAGTC ACAGGGGGAA ATAATAGCTT TTATAGATGA TGATGCGGCC
GCTTCACCGG AGTGGCTTGG CTGTATCGCC TGCTGTTTTA GGGAGGATAA AAAAATCATA
GGGCTTACCG GCCCGGCCTT GCCCTGGTGG GAACACCCCG GACTCAGCTG GTTTCCGGCG
GAGTTTAACT GGGTCATAAG CACTATGCCC TGCCTTCCCC GCAGCCGCGG CTATATCAGA
AACGCCTGGG GAACCAATAT GGCTTTTCGG CGGCAGGCTT TCAGTAACGG GCAGGGCTTT
GATACCGAAC TGGGCGCTAA GGGCGGGGGT AAAAAGGGCA AGTCAGAACT GGTGGGTGAG
GATACCGAAT TTTGCCTGCG TATCTGCCGG CAAAGCGGCC TGAATATCCT TTACGCACCC
GAAGTAAAAG TTATGCACCG GGTATACCGT TACCGGCTGA AGCCGGGTTT TATTGCCAAG
AGGGCTTATT GGGAAGGGTA TACCAAGGCG GTTTTTAAAT ACAGGCTGGG CAAAGAAACA
AGCGGGGCAG GGGTTTTATC TGAAGAACTG GGTTTGCTTA GGCGGATTTT TCTCCGTCTG
CTTCCCCTCA GCTTGCTAAA TCTGCCTCTC AAACCCCGCC GGTCTGCTTT AACTATTCTG
ACCACCTTAA ATATTCTGGG TTCTACCGGC TGGGGTTATA TACGCGGAAT GGCCTCAGCC
AGATTTATAT CAGTCAAGGA GGCGGTAAAT GCCTGA
 
Protein sequence
MADPFISVVV SAYSMERYTD LLSLIDSLKS QAYTNFEVIL VIEKSSQLFS NLNRCLSQHP 
YSRNFKLFFN DGLPGLASAR NLGVEKSQGE IIAFIDDDAA ASPEWLGCIA CCFREDKKII
GLTGPALPWW EHPGLSWFPA EFNWVISTMP CLPRSRGYIR NAWGTNMAFR RQAFSNGQGF
DTELGAKGGG KKGKSELVGE DTEFCLRICR QSGLNILYAP EVKVMHRVYR YRLKPGFIAK
RAYWEGYTKA VFKYRLGKET SGAGVLSEEL GLLRRIFLRL LPLSLLNLPL KPRRSALTIL
TTLNILGSTG WGYIRGMASA RFISVKEAVN A