Gene HMPREF0424_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0653 
Symbol 
ID8709977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp732491 
End bp735541 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content39% 
IMG OID646482759 
Producthypothetical protein 
Protein accessionYP_003373883 
Protein GI283783129 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.523938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTA ATCTGAATTT TGCAAAATTT ACTTATACTG CAAAAAAGTG GAGGTGGATT 
CTTGCGATAG TAGTGTTAAT TGTCGCGTTT GCTGTAGCTG CAATATTTGC TCTCGCAAGG
TTCATAACTG ATTGCATGTG GTATTCGCAG CTTGGATTTG AAAGTGTTAT TTGGATTCAG
CTTGCAGCAA AAGTGGGCGT TTGGGTGCTT TATGCAATAT TAATGGCTGC TTTTGGATAC
TTGTCTGCAG CTTTGGCAAT AAAAGCGCGT CCAAGCAATG CAGATGGAAC TTATATTCGA
ATTAAAGGCA ATGTTATTGA TTTAAAGCAG GGAATTAGTT CAAAGCTTGC TTTGCATGTT
GCAGGAATTT TTTCATTGAT TGTAGGCGCA GTATTTGGAT TTAAATTCTA CAATCATTGG
GCGCAAATTT TGTTAATGTT TAACGCCCAA CCTTTTGGCA TTAAAGATCC GCAATTTGGA
ATTGACAATG GCTTCTATGT ATTTGTGCTT CCAGGATTAA GGTTATTTAT AAGCGCATTT
GTGCTTTTAT TGGTAATATC TCTTGTTTTC TCTATTATTA CGCACATTCT TATGGGAGCA
ATTCGTTTTA GTATGCCTAT CGATGGTAAA GGCATATTAA GTATTACAAA ACGAGCACGT
AGACAAATTT CTGTATGGTT TATGCTTGTG ATTTTGACAT GGGCACTTCA ACAGATTTTA
GACATGTTCG CTTTAGTAAC TCTTGATGGA TCTCGAATTA CAGGCGGCTC ATACACTGAC
ATGAATGCTG GTGTTCCTGG AAGCATAGCA ATGGCAGTGA TAACAGTGTT AGTTGGTTCT
GTTATTGCAT TTTGGTTTAT GAAATCTCAT GCTTTATCTG TAGAAGCACC AATTTCAGTG
CGTTTTAAAA CAGCAATTAA GGCATGGAAA ACACCAATTA TTGCTTTAGC TTTTATGATT
GTTTGCTCAA TTGTTATATC TTTCGCTTGG CCGGCACTTG TGCAGCGATT TAAAGTTGCT
CCTAATGCTC AGGAATTGGA AGCTACTTAT ATTCAGCGTA ATATCGATGC AACAAAGTTT
GCTTACGGGC TAAATAACGT AAAGAAAGAA TCGTATAACG CTACTTCTAA AGGTAAATCA
GGTGCATTAT CAAAAGAAGC AGAGTCAACT GCGCAAATTC GTTTGTTAGA TCCACAAGTT
ACGTCTCCAA CGTTCCGTCA ATTGCAGCAA TCGAAGCAAT ATTACACTTT CGCAGATACG
CTTTCAGTTG ATAAATACGA TATTGACGGT GTAAGTCAAG ATACGGTTAT TGCTGCGCGC
GAGCTTGATT TAGCTGGCAA CGATAATCGA AACTGGGTTA ATGATCATAC AGTTTATACG
CATGGCTATG GAATTGTTGC AGCTTACGGA AATAAAGTAA CAACAGATGG ACAGCCACAA
TTTATGGAGT ATGGTATTCC TGCTCAAGGC AAACTTACTA AGTCACAAAA GTATGAACCG
CGTATTTACT TCTCACCAAA TGCTCCTGAG TATTCTATCG TTGGTTCTCC AAAAGGGACA
TCTCCATGGG AGTTTGATTA TCCGACTGGC TCTCAAGGTG CTTTAACTAC TTTCAAAGGC
AATGGTGGTC CAAGTTTAGG AAACTTCTTC TCTAAACTTT TGCATGCAAT TAGATTTGAG
TCTGATCAGA TTCTTTTTTC TGACCGTGTG ACTTCTGAAT CTCAAATTTT GTATGATCGT
GATCCTAAGA CACGCGTGTC TAAGGTTGCT CCTTACTTAA CTTTGGATGG TCGAGTTTAT
CCTGCAGTAG TAGATGGTCG CGTTAAGTGG ATTGTTGATG GCTATACTAC TTCTGATTCT
TACCCTTATT CTCAAATGAC TGATTTTGGT AAGGTAACTC AAGATTCCAC AACAACAACT
TCTCATTCTA TTAAAGGGTT GACAAATCAG CGCGCAAACT ACATTCGTAA TTCTGTGAAA
GCTACAGTTG ATGCTTATGA TGGATCTGTT GATTTGTATG TGTGGGATAC GAAAGACCCT
GTGATTAAAG CATGGCGTTC AATTTTCCCA GGTCATTATC ATGACATTTC TAAGATTTCT
GGCGATTTGA TGAAGCATTT GCGTTACCCA GAAAGCTTGT TTAAAGTTCA ACGTCACTTA
TTAGCAAAGT ATCACGTCGA TACAGCAAGC CAATTCTTCT CTGGAGAAGA CTTCTGGCAG
ATTCCTGTTG ATCCAACTGA ATCTCAAAAA GCACAAAAAG AAGATATTTT GCAACCTCCA
TACTATTTGA CTTTGCAAAC TGGAGACGCA AAGAAACCAG TATTTTCGCT TGTTTCTACT
TATATTCCGG CAGGTAAAAG TACTCGTGAA ATATTAACTG GTTTCTTGTC TGTTGATTCA
GATGCGGGGG ATACTCCTGG AAAGATTGGT CCAAATTACG GTAAGATTCG TTTGCAAGAA
TTACCTAAGA TATCTAATGT TCCAGGACCT GGTCAAGCTC AGAATAATTT CAACGCTAAT
GCAAATGTTT CTAAAGAATT GAATTTGCTT GAATCCGGTT CTACTAAAGT TAAGCGCGGC
AATTTGATTA CATTGCCTCT TGGCGGTGGG CTGGTTTATG TAGAGCCTGT TTATGTGCAA
TCTAGCGGTT CTACTAGCTA TCCTTTGCTT AAGAAGGTGT TAGTTGCGTT TGGAGATCAA
GTTGGATTTG CAGATACGTT AGATGAAGCA CTTAATCAAG TATTTGGCGG AAACTCGGGC
GCTTTGGCTG GAGATGCTTC TAATAATTCT TCTTCAAACA ATGCTGCTGA AAATAATGAA
TCTAAGGATG CTGATTCTAA AGAAGGTACT TCTAAGGAAA ATGAGTCTAC GAATTCAGAA
AAATCTCATT CTATGAGCCA AAAAGCTAAA GAAGCACTTA AACGTGCGGC TCAAGCATTA
AAAGATTCTG ATTCTGCTAT GCGATCCGGA AACTGGGAAG CTTACGGAAA GGCACAAAAA
GAGTTGAGCG ATGCTATTAA TGAAGCAATG AAAGAAGAAT CAGGTAAGTA A
 
Protein sequence
MKVNLNFAKF TYTAKKWRWI LAIVVLIVAF AVAAIFALAR FITDCMWYSQ LGFESVIWIQ 
LAAKVGVWVL YAILMAAFGY LSAALAIKAR PSNADGTYIR IKGNVIDLKQ GISSKLALHV
AGIFSLIVGA VFGFKFYNHW AQILLMFNAQ PFGIKDPQFG IDNGFYVFVL PGLRLFISAF
VLLLVISLVF SIITHILMGA IRFSMPIDGK GILSITKRAR RQISVWFMLV ILTWALQQIL
DMFALVTLDG SRITGGSYTD MNAGVPGSIA MAVITVLVGS VIAFWFMKSH ALSVEAPISV
RFKTAIKAWK TPIIALAFMI VCSIVISFAW PALVQRFKVA PNAQELEATY IQRNIDATKF
AYGLNNVKKE SYNATSKGKS GALSKEAEST AQIRLLDPQV TSPTFRQLQQ SKQYYTFADT
LSVDKYDIDG VSQDTVIAAR ELDLAGNDNR NWVNDHTVYT HGYGIVAAYG NKVTTDGQPQ
FMEYGIPAQG KLTKSQKYEP RIYFSPNAPE YSIVGSPKGT SPWEFDYPTG SQGALTTFKG
NGGPSLGNFF SKLLHAIRFE SDQILFSDRV TSESQILYDR DPKTRVSKVA PYLTLDGRVY
PAVVDGRVKW IVDGYTTSDS YPYSQMTDFG KVTQDSTTTT SHSIKGLTNQ RANYIRNSVK
ATVDAYDGSV DLYVWDTKDP VIKAWRSIFP GHYHDISKIS GDLMKHLRYP ESLFKVQRHL
LAKYHVDTAS QFFSGEDFWQ IPVDPTESQK AQKEDILQPP YYLTLQTGDA KKPVFSLVST
YIPAGKSTRE ILTGFLSVDS DAGDTPGKIG PNYGKIRLQE LPKISNVPGP GQAQNNFNAN
ANVSKELNLL ESGSTKVKRG NLITLPLGGG LVYVEPVYVQ SSGSTSYPLL KKVLVAFGDQ
VGFADTLDEA LNQVFGGNSG ALAGDASNNS SSNNAAENNE SKDADSKEGT SKENESTNSE
KSHSMSQKAK EALKRAAQAL KDSDSAMRSG NWEAYGKAQK ELSDAINEAM KEESGK