Gene HMPREF0424_0791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0791 
Symbol 
ID8709136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp896414 
End bp898333 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content43% 
IMG OID646482892 
Producthypothetical protein 
Protein accessionYP_003374009 
Protein GI283783255 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.654637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGAAG GAAGCAACGG AAGGCGACGT TTTTCCGACA AATGGGGCAA ACACGAGTTA 
GATGTACTCG CTGTATTGTC TTCCGCATTC CCGCAATGGC TGACTTCGCG TCAAATTGCA
CAGCGCGTAA AGGCATATGC AGATTCATAC GGTGAGCTTG CAGATCAAGC AGCTAAAGCA
GCATTCGCAA AACAGTTCCA GCGAGATCGC GCAAAGCTTG CTGCTATGGG TATTGCAATA
GAGTCTAGGC AGCCAGAATA TTCTTCGAAG TCCGAAGGTC AAGACTTTGC GTCTTATCGT
TTGCAATTAG GCGATGAGCC TCGCGTACGT TTACGTTTCG ACCAATCTGA ATTACCAGTT
TTAGCTGCAG CAAATTATCT TGCTCGTTCA ATGTCTATTT CTTCTCTTGA TTCAGGTCAA
CACGAGCAAC AGCGCACTTC TCGCACTGCT CCTAGAGTTC CACAAACTCC TATTCCTGGC
TTAGGATTAG ATTCAATCGC ACCTGGTCTT GGAACACAAA TGCTACCTGA TTCATTGGTA
AAAGTTATTG ATTCTCGCAG ATTTGCCGCA ACTGTGGATG TCGATGGTGA ACATTTAAAC
GTTGCTTATA CTGATGCTGA CGATTTAGCG ATGTTTGTAC TAGAACATCC TGGATCTAGC
GTAGTAAGCC CTCAAGAAGC AGTTGATGCT TTCCATCGCC GTTTGCACGC AGCAGTTAAT
TTTGCTCAAT ATGACGAAAA TGAGCAAAAA GATGAAGAAG GTGCTATACA AGAAGCGCAA
AATATTGAAG AACAAAATAG CAACACTGAT AAATCGCATA CCAAAAAAGG CTCTTCATTC
CAAACTGGAA GTGAAGTAGA TCGTAGGCTT CGATTGATGC TTTTCTTGTC TGCTCATCTT
GGAGAGGAAT TCCCATTAGA CGAGCTTGCT GAGCGTTTTA TTGGTAGACC AAAAAGCGAT
GATGAACTTC GTCGATTTGT TACAATTCTT CACAAGGATA TAAATACTCT TACTACTGTT
TCTGACGATG GGGAAATGGC CGGAAGCCAA TTCTTTGATA TAGATTGGAC TCTTCTTGAG
AACGAGGGAA TTGTTTCTGC TACTAACTCT TTAGGATTGG AAAGACTTGC TGGAATTTCA
CAGCAATATT TGAGCATGCT TACTGCGTCT GCTAATTATT TGGCGCATTC CTTGCTTTTG
CCTATTGAAC AGCGCACACA AGCTGAATCC TTGTATAAGC GTCTTCGCCG TCATGTTCTT
CCTGGTCAAA CTCCATGGCT GAGTTTAACA GGTTACGAAA TTGAGCCTCG AAATTGTTCT
ATTGTTCGTA GTGCTATTAA CTCTGGATCT TTACTAGATA TGGAATACAC TGACGGGGCA
GGACGTATTC GTCGCAAGAT TGTTGCACCG TCGAAGATTT ACGTTGATGA AGGCGTTTAC
TATGTTGCAG TATGGACTGA TGTAGAGCAG CAAACGCCAG AAGATAAACG TGAATTTGTT
GCTAAAGATA CGACAATTAA TAAAGCGAAT GGATTGCCTC GCATTTGGCA AGTTCTTCGC
GTAGCACGTA TTGAACGTGC AGAAGTTGTA GAGCCTGTAT CTCAAGTTAA AATTCCAGAT
GTGCCAGTTA GTGAACTTCG CAAGTGGAGC TTTGATAACG GAACTGAAAC TTGCTTCATT
ACCGATGAAA CAGAATTGAA CTTTTTGAAG AATCTTTCTG GAGCCACTAT GGAAACTTGT
GGCAGCGGAG TAAAAGTCCA TCTTACTGTT TCTTCTGATT CATGGTTCGT TGCTTTTTGC
ATTGCTCACG CACGTCATAT TACAGCTGTT GCTCCTGAAA CATTGCGCAG TATGATTATT
GCTCGCGCTC AACGTGAATT AAGCGTTAAT CACGCTGATA ATGCGAATAC AGAGGAATAA
 
Protein sequence
MVEGSNGRRR FSDKWGKHEL DVLAVLSSAF PQWLTSRQIA QRVKAYADSY GELADQAAKA 
AFAKQFQRDR AKLAAMGIAI ESRQPEYSSK SEGQDFASYR LQLGDEPRVR LRFDQSELPV
LAAANYLARS MSISSLDSGQ HEQQRTSRTA PRVPQTPIPG LGLDSIAPGL GTQMLPDSLV
KVIDSRRFAA TVDVDGEHLN VAYTDADDLA MFVLEHPGSS VVSPQEAVDA FHRRLHAAVN
FAQYDENEQK DEEGAIQEAQ NIEEQNSNTD KSHTKKGSSF QTGSEVDRRL RLMLFLSAHL
GEEFPLDELA ERFIGRPKSD DELRRFVTIL HKDINTLTTV SDDGEMAGSQ FFDIDWTLLE
NEGIVSATNS LGLERLAGIS QQYLSMLTAS ANYLAHSLLL PIEQRTQAES LYKRLRRHVL
PGQTPWLSLT GYEIEPRNCS IVRSAINSGS LLDMEYTDGA GRIRRKIVAP SKIYVDEGVY
YVAVWTDVEQ QTPEDKREFV AKDTTINKAN GLPRIWQVLR VARIERAEVV EPVSQVKIPD
VPVSELRKWS FDNGTETCFI TDETELNFLK NLSGATMETC GSGVKVHLTV SSDSWFVAFC
IAHARHITAV APETLRSMII ARAQRELSVN HADNANTEE