Gene HMPREF0424_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1070 
Symbol 
ID8709103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1221510 
End bp1222916 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content43% 
IMG OID646483162 
ProductCHAP domain protein 
Protein accessionYP_003374273 
Protein GI283783519 
COG category[R] General function prediction only 
COG ID[COG3942] Surface antigen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.518455 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAG ATCGACATAG CATGTATGCA ATTCGACGCA CTCACACTGT AGTGAGTTTA 
ATAGTTGCTA TTTCAATGAT GTTTGCTGGT GTTGCTGGTG CATTATTTAA TGCGTCTACA
GCAAGTGCTG TCAACATGAA GGAATATAGG CGCAAAATAC AACGAAACGA GAATTTGAAG
CACCAACTTG CTGGTGTAAG TAAACAACTT GCAAATAAGA TCATTGAATT GAATGATTTA
AATGAGCATA AAATTCCAAA TCAAATACGA GCTGCTCAAC AAGCAGAAGA GCAAGCTTCT
CAGGCGCGCA GTCTTGTAGA ATCAACGCAG CAGCGTCTAG AGTCTGCACA AAAAGATAAG
CGTGATTTGG AAGAGAAAAT TAAAAAAACT GGTGAAGATT TTGATGATGC TAAGGCTGCT
GTAGCACAGC TTGCTCGTAA GAGTTTCCAT GGTTCTCAGG CATCTAATGT TATGGATATT
GTGACTAAAT CAACTACTAC AGAACAGTTT GTTGGAAAGT TACAATCCGA GGCTGCTGTT
GCTAGAAGTG AAGCAAATGC TGCTAATGAT GCTGCTATTA AGTTGAATAC TTCTATGAAT
CGTCGTCAGC GTTTGGCTGC TATTGAAGCA GAAATAACCA AGTTAAAGCA GCAGGCGGAT
GTTCAAGCGC AAAGTGCTCA AATCGCTGCG AAAGCCGCCA ATGATCGCAA AAAGTCCTTG
CAAAGTTTGC GAGATCATGG TGAAGAGATT CGTAAGCAGC TGGAAGGTCA AAAGAGTAGT
TTGACTTCTC AGTCTGCGCG TGAGGCTGCA GAAATTATAG CAATGAAATC GGCCATCGAT
GCTCAAGCGC GTGCTCTTGC TAATAAGTCA TTCGCTAAAG CTGATCCGAA TGCTGGTAAT
CGCCAGCAAT TGAATGGAAG CTCTGGAAAG AATGTTTACC TTCCTCCGAT GCAACTTTCT
AATGGTGCTG CTAATGGTAT GAATTATACT GTTCCTGGAA ATTGCGCAGA AGGTTCTAAG
TTCTGCTACG GGCATAATAC TGGAAACACT GTTGGTGGTT CTGCTTATCC TGCTCGTCAA
TGCACGTTAT GGGCTTATTT GCGTCGTTCG CAACTTGGTC TGCCAGTTGG CTCCTACATG
GGTAATGGTG CTGAGTGGGC GAATACTGGT CGTAGGTTTG GATATTTAGT TAATAGAATT
CCACATGTTG GTGCTGTTAT GGTGTTTGCT CGCGGTCAAC GCGTCGGCAA CTGGAATGCT
GATTGGCAAT ATGGACATGT TGCGGTTGTG GAACGTGTGA ATGCTGATGG ATCAGTTCTT
ATTTCTGAAG GTGGAACTGG TTTTTCGACA TTCCCTGCTT ATGAAACTAT TTATAATCCA
GGCGATTACG AGTATGTGCA TTATTAA
 
Protein sequence
MSEDRHSMYA IRRTHTVVSL IVAISMMFAG VAGALFNAST ASAVNMKEYR RKIQRNENLK 
HQLAGVSKQL ANKIIELNDL NEHKIPNQIR AAQQAEEQAS QARSLVESTQ QRLESAQKDK
RDLEEKIKKT GEDFDDAKAA VAQLARKSFH GSQASNVMDI VTKSTTTEQF VGKLQSEAAV
ARSEANAAND AAIKLNTSMN RRQRLAAIEA EITKLKQQAD VQAQSAQIAA KAANDRKKSL
QSLRDHGEEI RKQLEGQKSS LTSQSAREAA EIIAMKSAID AQARALANKS FAKADPNAGN
RQQLNGSSGK NVYLPPMQLS NGAANGMNYT VPGNCAEGSK FCYGHNTGNT VGGSAYPARQ
CTLWAYLRRS QLGLPVGSYM GNGAEWANTG RRFGYLVNRI PHVGAVMVFA RGQRVGNWNA
DWQYGHVAVV ERVNADGSVL ISEGGTGFST FPAYETIYNP GDYEYVHY