Gene HMPREF0424_0153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0153 
Symbol 
ID8708928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp174289 
End bp175338 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content48% 
IMG OID646482272 
Productthiamine biosynthesis lipoprotein, ApbE family 
Protein accessionYP_003373417 
Protein GI283782663 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.191913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGGA ATGTAATGGC GATTGAGCGC GCGCTTGGCA CGGGGATTAT TATCTCTAGC 
AGCGTGCCGA TTTCGCAGCG CGTGCAGAAT CGGATTCGCG ATTTTATTGA AGAGTATGAA
TCGGTGCTTT CGCGCTTTCG TGCGGATTCG CTTGTTTCGC GCATGGCTTG TGCGGAGCAT
GGCGGCGAGT TTGAGTTTCC AACGTGGGCT CAGCCGCTTT TTGCGATCTA TAGCGAGTTT
TACGATGCCA CGCACGGTGC TTTTGACGCT TGTATTGGTG CGGATCTGCT TGCGCTTGGC
TACAACAATT CTGTGCAATT CGTTCCGGAG TCGGCAGCTA GCGCAGGCAA GAACGATAAT
AGCAGCAGCT ACAGTTGCTC TAACTATCGC CGCGCTCTGC CAGTTAAGTG GGCAGATATT
TCGCGAGATG ACGGCGGCGC AACGCTTCAC ACAAATAAGC CAGTGCAGCT TGATTTTGGT
GCAGCCGGTA AGGGCTATTT TGTAGATCTT GTAATGCAGA TTATTAAAGA GGAGTTTAGT
GACGATTCGA CTGCGAATAA TTATTTTCCT TCGGATTTTG ATTTTTTGGT AAACGCAGGC
GGAGATATGC GCGCTTGCTT TAGCAAAAAG AATAGTCAAA TAAAAGTTGC GCTAGAAAAT
CCTTTTGACA CAACGCAAGC GGTAGGTGTG GCATCAATCG CAAGCGGAGC GTTGTGTGCT
TCGTCTGCTG CAAGAAGGCG CTGGAAAGTA AAAGACACAA ATTGCCTTGC AGCTGATGCT
TTTGAATCTA ATGTAGTTGC AACTCACCTT ATCAACGCTT TAGATGGCGT ACCTTCGCAA
AAACTTTCTG CAAGCTGGAC TTACGTTCCT GCTAAAACAT GTGCTTTTCC GACTGCTTAC
GCCGATGCGC TCGCAACTGC GCTTTTTATT TCGCAAGAAA GCGATTTGCA AAAAATCGCG
CAAACTACCG GCGCTGAGTT TGCTGTAATG CAGCCAAATC ATGCGCTTCG CAAAACGTGT
GCTTTCCCAG CGCGCTTTTT TGCTGAATAA
 
Protein sequence
MFGNVMAIER ALGTGIIISS SVPISQRVQN RIRDFIEEYE SVLSRFRADS LVSRMACAEH 
GGEFEFPTWA QPLFAIYSEF YDATHGAFDA CIGADLLALG YNNSVQFVPE SAASAGKNDN
SSSYSCSNYR RALPVKWADI SRDDGGATLH TNKPVQLDFG AAGKGYFVDL VMQIIKEEFS
DDSTANNYFP SDFDFLVNAG GDMRACFSKK NSQIKVALEN PFDTTQAVGV ASIASGALCA
SSAARRRWKV KDTNCLAADA FESNVVATHL INALDGVPSQ KLSASWTYVP AKTCAFPTAY
ADALATALFI SQESDLQKIA QTTGAEFAVM QPNHALRKTC AFPARFFAE