Gene HMPREF0424_1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1274 
Symbol 
ID8709329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1522515 
End bp1523822 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content40% 
IMG OID646483362 
ProductPAP2 family protein 
Protein accessionYP_003374464 
Protein GI283783710 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.594886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAG AATTTGCAGG AATTCAATCC AATAGCATTG ACAATCAGCA TGATGATTCT 
GCTGAGAAAA ATGATGCGCT GCCGGTAGGT GGAACTTCGC AAGATGTTAA TCGCAACAGT
GAAGGTGGTG ACTATTCGCA TCCTTTGGAT TCGATTAATA CTATTACGCC ATTTTCTCCA
ATCTCATCTG AAGATAATGA CGAAAAAATA TCGCCTTTAT TTCCAGAAAA TAATGAAGAA
AAATCTTCTA AACAGTCTGT AAAACAGTTT GTGAAACAAA CTAAGCGTCA ATCTGATTTG
TCTGGCTCGT CCGATTTGTC GTATTCGACT AATCAGTTAT CTTCTAAAGA TTCTCTATCA
GATGCAGATG ATATTGATCA GCTTGGTCAG CTTGTAAAGC GTCCTAGAAT ATCTACCATA
TTGTGGTGTG TTGCTATTGC AATTGTTTTC TTGGCTTCTG CTGCGTTCAT ATGGTTTATT
AGTGTGCAAA CAGTACTTGG TCAAAGCTAT GAAGAAATGG TAATTGATGG ATTTGGTTCT
CATGGTACGC CATCTTGGCT TGCATTTTGT TTGCGACCTA TAAGCGTTTC TATGGTTGTT
ATAGTTACTA GCATAATAAT TGCTCTTGTT TCACTCATAG TAGTTTGTAT TCGTAAACGT
TGGTGGCTTC TTGGTCAATG TGCTGGAATT ATTATTCTTT CGGCTGCTGC TGAACCATTG
AAAAAAATTC TTCCGCGCCC AATGCTTATA AGTATTGAAT ATTTGTCTGC AAATTCAGCA
CCTTCAGGTC ATACCTTACT TATTACTGCA TCATGTGCGT TGCTTATTTG CGCAGTTTCG
CGTGTTTGGC GCGCATGGGC AGCGGTAGCT GCAGCTTTTA TAAGCGTTCT TGTGGAACTT
TCGTTGGTTG CAGCGCATTG GCATAGAGTT TCTGACGTTC TTATGTCATT ACTTATCGTT
GGTGCTGTAA CGCTTATTAT ACTCGCATGC ACTCGCAGTA GTGGAATGGA TATGCCAGCT
TATCGACGTT CCTCAATTAG CGTTCAAATT GTAGGAAGCT CTATGATTAC GATTGGTGTG
TTGGCGTGTT TGTATGCACT TTATTTAATA TGGCAAATTC TTCCTGGTGT TGATATTTTT
GCTCAATGGG CTGCAAGCGT ATCTTATGTA GCAACTTATT GGCTGATTAT AGGTGTGTCT
TTGCTTGTTT TTGGCGTAAT TATGGTAATG CGTCACTCTA CAGCTTCGCC GTTGAGTAGG
CTTGGATTAG TAGGCGCTCC GCCGACGCCT CCATCTGCTT CTAAGTAA
 
Protein sequence
MSEEFAGIQS NSIDNQHDDS AEKNDALPVG GTSQDVNRNS EGGDYSHPLD SINTITPFSP 
ISSEDNDEKI SPLFPENNEE KSSKQSVKQF VKQTKRQSDL SGSSDLSYST NQLSSKDSLS
DADDIDQLGQ LVKRPRISTI LWCVAIAIVF LASAAFIWFI SVQTVLGQSY EEMVIDGFGS
HGTPSWLAFC LRPISVSMVV IVTSIIIALV SLIVVCIRKR WWLLGQCAGI IILSAAAEPL
KKILPRPMLI SIEYLSANSA PSGHTLLITA SCALLICAVS RVWRAWAAVA AAFISVLVEL
SLVAAHWHRV SDVLMSLLIV GAVTLIILAC TRSSGMDMPA YRRSSISVQI VGSSMITIGV
LACLYALYLI WQILPGVDIF AQWAASVSYV ATYWLIIGVS LLVFGVIMVM RHSTASPLSR
LGLVGAPPTP PSASK