Gene HMPREF0424_1286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1286 
SymbolthiO 
ID8709585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1535313 
End bp1536383 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content49% 
IMG OID646483374 
Productglycine oxidase ThiO 
Protein accessionYP_003374475 
Protein GI283783721 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAATA TTCGTATTAT TGGTGCAGGT ATTATTGGTT TAGCAACAGC GTGGAGACTT 
ACGGAAGCTG GCGTAAAAGT TAGCATAATT GATCCAGATC CAGTCTCGGG CGCTTCTCAC
CATGCTGCTG GCATGTTAGC TCCTACCGCA GAAATGCAAT ATCAGCAAGA AGCGTTGTAT
CCTCTTATGT TTGAGTCTGC AACTATGTAT AAGGAATTGG TTGATTCGGT TGCAAAGTAC
ACCGATAAGC CAACTGGCTA CTTAGAGTGC GGCTCTTACG TAATTGGCGG TGACGCTGCA
GATAACGAGC ACGTAAACGA GTTGCTAAAC TTGCAGCATC GTTACGGCCG TAGAGCTGAG
CATATTTCTG TGAGTAAGCT TCGTGAGATT GAGCCTGCTC TTTCGCCAAC TATTGCTGGC
GCTGTAAGCG TGCCGGACGA CCATCAAATC AACCCTCGTC TTTTCACTGC AGCAATGATT
GATGCACTCG AAAAGCGCGG TGTTGTTATT GAGCGTCGCG CAAGCACTCC AGACGATTTG
GGTCCAGACA CAGTTTTGGC ATCCGGCTTG GGTGCTAAGG ATATGCTTCC TCAGCTTAAG
CTTCGCGCTG TGTGGGGCGA TATGATTCGT ATGACTATTC CAGAGCCTTT GCGCCCAATT
TTGAGCTCTG TGGTTCGTGG ATTTGTGCAT GATCGTCCAG TTTATTTGGT TCCTCGCGCT
AAGGAAACAG GCGAGCTTGT GCTTGGCGCT ACTTCTCGCG AAGATGACCG CGATATTCCA
AAGGTTGGTG GCGTACTCGA CTTACTTCGC GATGCTGCAA GCGTGCTTCC TGGCATTCAG
GAATGCTCGA TTACTGAGGT TACCGCTGGC GGCCGCCCTG GAACTCCGGA CGATTTGCCA
TTCATTGGTA TTGATCCTAA GACGCATACC ACGATTTCCA CTGGTTACTT CCGTCACGGC
ATTTTGCTCA CCGCTTTGGG TTCTTCTTTG ACTACAAAGC TTGTGCTTGA GCAAGAATTG
ACCGAGAAGG AAAAGGGCTT CTTGGAAAGC TGCAATCCTG CTCGCTTTTA A
 
Protein sequence
MRNIRIIGAG IIGLATAWRL TEAGVKVSII DPDPVSGASH HAAGMLAPTA EMQYQQEALY 
PLMFESATMY KELVDSVAKY TDKPTGYLEC GSYVIGGDAA DNEHVNELLN LQHRYGRRAE
HISVSKLREI EPALSPTIAG AVSVPDDHQI NPRLFTAAMI DALEKRGVVI ERRASTPDDL
GPDTVLASGL GAKDMLPQLK LRAVWGDMIR MTIPEPLRPI LSSVVRGFVH DRPVYLVPRA
KETGELVLGA TSREDDRDIP KVGGVLDLLR DAASVLPGIQ ECSITEVTAG GRPGTPDDLP
FIGIDPKTHT TISTGYFRHG ILLTALGSSL TTKLVLEQEL TEKEKGFLES CNPARF