Gene HMPREF0424_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1035 
Symbol 
ID8709339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1174779 
End bp1176848 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content37% 
IMG OID646483128 
Productarylsulfatase 
Protein accessionYP_003374240 
Protein GI283783486 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCAT TTTTGCGCTT GAAAGTTCCG TATTGGCTGT ATGCATTTCT TTTCTTTGTT 
ATTGACGGCA TGTCGGTAAT TATTGTGCAA ATGGGTGTGC AGCGCGCTGG AAGAGTTGAG
ATGACTTCTG CAATGTCTGG TGGACGTTGG GGATTAATAA CTAAAACCTG GAAAGAACTT
AATTTCGTTA TTGTTCTTAG TGCGCTCGTT GTTGCGATGA TATACGGTTT AATTCTTTTG
ATTAGTAACC GATTTTGGAT ATCAAGTGCA ATAATTCTAA GTGTTTCATT GCTTATTGCT
GTTATTGAAT ATATGAAAGT AAATGTTCGT TATGAAACTA TTCTTCCTGC GGACTTAAAC
TTTTTAAAAA GTAACACTGG TAATGTCGCT TCATTTTTGC CAGATAATGC TCCTATGGTC
ATTGGTGTTG CTTTAGGCGT TTTTATTTTG CTTTTAGTTA CAACTGTTTG CTTACACATA
TTTGATACAA ATCATGGAAA GATTGTGCGA TTTAAAGATT GGCGTTATTC TGCTGGTATT
CGTGTTGCAA TTGCGTTGAT TTTAATAGGG AATTTAAGCT GGTATATTGG CGGCGTTGGC
ACAGTTGACT CTTCGGCTAA TGTTTTTTCT AAAATGCTTG GTGACAGTCC TGCTATGTGG
GATTCTGTTT ATGACGCTCA ACGCAATGGT GCTATTGTGG CATTTTTGCG TAATGTAAAT
CCTAAAATTA TGGATAAACC GGCTGATTAC AGTGAAGAAA CTATGAAGGC TGTATACAAG
CGTTATAACG ATGAAGCTAA ACGTATAAAT CGTTCTCGTA CAACAAATAT GAACGATAAT
ACTTTTATTC TTATCTTGTC TGAATCGTTC TCTGATCCTA CTCGTGTTCC TGGTTTGAAG
TTGAACAAGA ATCCAATACC ATTTATTAGC AATTTAAAGA AGCATACGGA TAGTGGTTTG
ATGCTTTCTT CTGGTTATGG TGGCGGTACT GCAAATCTAG AGTACATGTC TTTAACTGGT
TTGAGCATGG CTAATTTTGA TCCTTCAATG ACTAGCCCGT ATCAGCAGTT GGTTCCTAAT
GCTCAATGGT CTCCAACTAT TAATCAATAT TGGGATGATT CTCGTAATTC TATTAAATCT
ATAGCTTTCC ACCCGTATGA GCCGAGCATG TATTTGCGCG CTACTAATTA CAAAAAGTTT
GGATTTAGTA AGTTCTACGC ATTGCAAGGA CCTGATGTTA TTGCTCACCG TGATGTTATA
GATAAATCAC CGTATGTTTC TGACGCATCA GCATATAAGA GTGCGTTAGA AAAGATTAAA
GAGCATAAGC AACCTAGGTT CGTACAAATT GTTACTATGC AAAACCATAT GCCTTATAGG
GATTGGTATG CGAATAATGA ATTCGAAGCG TCTTCTAAAG ATGGTGCCGC TGATCTTGGT
GATGATGAAA AAACTTCTAT TGAAACGTAT GCTAAGGGTG TGCAGCACAC TGATGAGACA
ACACAAGCGT TTTTGAAGAG TTTAGATAAA CTAAATAAGC CTATTACTGT TATGTTTTAT
GGTGACCATT TGCCAGGAAT TTACGGCACT GCAAGTTCAG ATGAAAAGAA TTCATTGTCT
TTGCATTTGA CTGATTATTT TATTTGGTCA AATAAAGTTG CTTTGGAGCG TAAAGAGCGT
AGTGAGAAAA AATCTTCAAA CAAAAATGTA GAGCATTCTA ATACAAGTAA AGATGTAGAT
CGTACTAATA AGTATTCTTC GCCTAACTTC TTTATTTCTC AGGCTGCTTT ACATATGAAC
GCTAAAGTTT CCCCGTATCT TGCTTTCTTA ACTCGTTTAC ATGAGCATGT GAGTGCTATG
GAGCCTCCTG TTGTAAATAC TATTCAAGGC TGGGATAGAA TTCCTGAAGG ACAATCAATT
TATTTGGATA ACGATGGTAA TCCGATGATT TTGTCTAAGA TGGATAAGAA GTCACGTCAA
CTTCTTCACG ACTATCGCTT AATACAATAC GATATTACTG CTGGAAAGCA TTATTTGCGT
AATACTGATT TTATGAAATT ACCGCGCTAA
 
Protein sequence
MKSFLRLKVP YWLYAFLFFV IDGMSVIIVQ MGVQRAGRVE MTSAMSGGRW GLITKTWKEL 
NFVIVLSALV VAMIYGLILL ISNRFWISSA IILSVSLLIA VIEYMKVNVR YETILPADLN
FLKSNTGNVA SFLPDNAPMV IGVALGVFIL LLVTTVCLHI FDTNHGKIVR FKDWRYSAGI
RVAIALILIG NLSWYIGGVG TVDSSANVFS KMLGDSPAMW DSVYDAQRNG AIVAFLRNVN
PKIMDKPADY SEETMKAVYK RYNDEAKRIN RSRTTNMNDN TFILILSESF SDPTRVPGLK
LNKNPIPFIS NLKKHTDSGL MLSSGYGGGT ANLEYMSLTG LSMANFDPSM TSPYQQLVPN
AQWSPTINQY WDDSRNSIKS IAFHPYEPSM YLRATNYKKF GFSKFYALQG PDVIAHRDVI
DKSPYVSDAS AYKSALEKIK EHKQPRFVQI VTMQNHMPYR DWYANNEFEA SSKDGAADLG
DDEKTSIETY AKGVQHTDET TQAFLKSLDK LNKPITVMFY GDHLPGIYGT ASSDEKNSLS
LHLTDYFIWS NKVALERKER SEKKSSNKNV EHSNTSKDVD RTNKYSSPNF FISQAALHMN
AKVSPYLAFL TRLHEHVSAM EPPVVNTIQG WDRIPEGQSI YLDNDGNPMI LSKMDKKSRQ
LLHDYRLIQY DITAGKHYLR NTDFMKLPR