Gene HMPREF0424_0398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0398 
Symbol 
ID8709924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp427966 
End bp429111 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content46% 
IMG OID646482514 
ProductCHAP domain protein 
Protein accessionYP_003373647 
Protein GI283782893 
COG category[R] General function prediction only 
COG ID[COG3942] Surface antigen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACATG CTGCACATGG TGCCACAAAA GCAAAGAGGC GTGGTGATTC GCCGCTTTCG 
CTGTTTGTAG CTCAGCATGG TGCGCATGGT GCGCATAGTG TGCGCGGTAT GCGTTCTGTT
CAGACTAATC GTGTTAGTGA GCGTTTGGCT GCGGTTGCTA CTAGCAATGG TGCTGTTGTA
GAGCTTCCAG AAGCTGTTTC TGAGCGTTTA CAGGAGCTTG TTCCTCAAAG CAGGCGCGCG
CTTAGGTTGT CTAAGCTTGC TAATGAACGT CGTCGCAATG TAATACTTTC GGCTTCTCTT
GTTGCAATGG TTGGTACTGT TGCAGCTACT ATGGCTTTAA CTAAGGTTAA TAGTTTTGCT
TCTCGCGCTT CCGAATTTGA GCCGTCTATG ACTGCAGAGG GCGACGCATC TTCAACAGTT
TCGCGTTCTA GCCAGCGTGA GTCTCTTGAT CCTATGGCGA CTATTAATAA CACTGTTGAC
TCTATTGAGA ATGCTGCTAA GAAATTGCAT AATGCGGATT CTGCTGATGC TGCAAATAGC
GCGAATGAAA ATAGTTCTGC CCGCTCGGAT TCGAAGCATA AAAACTTAGC CAGTAAGGTT
GCTAATTCAG TACAACCTGG CACTTTTTCT ACTATATCTT CGAAGTCTGA TTCGTGGAAT
TTAGGTTCTG ATTCAGGCTT TAATATTGCG GAAATGTCGC GTTCTGCTGC GAATAATCCT
AAGGTTGCTT TGTTGATAGA TAAAGACTTT GACGTTTTGC CGAAGGGCTT TAATCCAAAT
CACGCAACTG GGGATACCGG TAACGCGTAT GAGTTCAGCC AGTGCACGTG GTGGGTGTAT
ATTCGCCGCC ATCAGTTAGG TTTGCCTGTT GGCTCGCATT TTGGCAACGG AAATATGTGG
GCTGATTCTG CGCGCAGTCT TGGCTATTGG GTAGATAATT CCGCTCGTCA TGTTGGCGAT
ATCATGGTGT TCCGCTCGGG TCAGGCTGGT TCTGATCCAT TCTATGGGCA CGTTGCAGTT
GTTGAAAAAA TTAATTCAGA CGGTTCTATT GAAACTTCTG AATGTGGCGC TTCTTACCAA
GGACGAACTT TCTCGCGCAA GTTTGACGCT AAGGAAGTTT CTCAATATCA GTTTATTCAT
TACTAA
 
Protein sequence
MRHAAHGATK AKRRGDSPLS LFVAQHGAHG AHSVRGMRSV QTNRVSERLA AVATSNGAVV 
ELPEAVSERL QELVPQSRRA LRLSKLANER RRNVILSASL VAMVGTVAAT MALTKVNSFA
SRASEFEPSM TAEGDASSTV SRSSQRESLD PMATINNTVD SIENAAKKLH NADSADAANS
ANENSSARSD SKHKNLASKV ANSVQPGTFS TISSKSDSWN LGSDSGFNIA EMSRSAANNP
KVALLIDKDF DVLPKGFNPN HATGDTGNAY EFSQCTWWVY IRRHQLGLPV GSHFGNGNMW
ADSARSLGYW VDNSARHVGD IMVFRSGQAG SDPFYGHVAV VEKINSDGSI ETSECGASYQ
GRTFSRKFDA KEVSQYQFIH Y