Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_1035 |
Symbol | |
ID | 8709339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | - |
Start bp | 1174779 |
End bp | 1176848 |
Gene Length | 2070 bp |
Protein Length | 689 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 646483128 |
Product | arylsulfatase |
Protein accession | YP_003374240 |
Protein GI | 283783486 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCAT TTTTGCGCTT GAAAGTTCCG TATTGGCTGT ATGCATTTCT TTTCTTTGTT ATTGACGGCA TGTCGGTAAT TATTGTGCAA ATGGGTGTGC AGCGCGCTGG AAGAGTTGAG ATGACTTCTG CAATGTCTGG TGGACGTTGG GGATTAATAA CTAAAACCTG GAAAGAACTT AATTTCGTTA TTGTTCTTAG TGCGCTCGTT GTTGCGATGA TATACGGTTT AATTCTTTTG ATTAGTAACC GATTTTGGAT ATCAAGTGCA ATAATTCTAA GTGTTTCATT GCTTATTGCT GTTATTGAAT ATATGAAAGT AAATGTTCGT TATGAAACTA TTCTTCCTGC GGACTTAAAC TTTTTAAAAA GTAACACTGG TAATGTCGCT TCATTTTTGC CAGATAATGC TCCTATGGTC ATTGGTGTTG CTTTAGGCGT TTTTATTTTG CTTTTAGTTA CAACTGTTTG CTTACACATA TTTGATACAA ATCATGGAAA GATTGTGCGA TTTAAAGATT GGCGTTATTC TGCTGGTATT CGTGTTGCAA TTGCGTTGAT TTTAATAGGG AATTTAAGCT GGTATATTGG CGGCGTTGGC ACAGTTGACT CTTCGGCTAA TGTTTTTTCT AAAATGCTTG GTGACAGTCC TGCTATGTGG GATTCTGTTT ATGACGCTCA ACGCAATGGT GCTATTGTGG CATTTTTGCG TAATGTAAAT CCTAAAATTA TGGATAAACC GGCTGATTAC AGTGAAGAAA CTATGAAGGC TGTATACAAG CGTTATAACG ATGAAGCTAA ACGTATAAAT CGTTCTCGTA CAACAAATAT GAACGATAAT ACTTTTATTC TTATCTTGTC TGAATCGTTC TCTGATCCTA CTCGTGTTCC TGGTTTGAAG TTGAACAAGA ATCCAATACC ATTTATTAGC AATTTAAAGA AGCATACGGA TAGTGGTTTG ATGCTTTCTT CTGGTTATGG TGGCGGTACT GCAAATCTAG AGTACATGTC TTTAACTGGT TTGAGCATGG CTAATTTTGA TCCTTCAATG ACTAGCCCGT ATCAGCAGTT GGTTCCTAAT GCTCAATGGT CTCCAACTAT TAATCAATAT TGGGATGATT CTCGTAATTC TATTAAATCT ATAGCTTTCC ACCCGTATGA GCCGAGCATG TATTTGCGCG CTACTAATTA CAAAAAGTTT GGATTTAGTA AGTTCTACGC ATTGCAAGGA CCTGATGTTA TTGCTCACCG TGATGTTATA GATAAATCAC CGTATGTTTC TGACGCATCA GCATATAAGA GTGCGTTAGA AAAGATTAAA GAGCATAAGC AACCTAGGTT CGTACAAATT GTTACTATGC AAAACCATAT GCCTTATAGG GATTGGTATG CGAATAATGA ATTCGAAGCG TCTTCTAAAG ATGGTGCCGC TGATCTTGGT GATGATGAAA AAACTTCTAT TGAAACGTAT GCTAAGGGTG TGCAGCACAC TGATGAGACA ACACAAGCGT TTTTGAAGAG TTTAGATAAA CTAAATAAGC CTATTACTGT TATGTTTTAT GGTGACCATT TGCCAGGAAT TTACGGCACT GCAAGTTCAG ATGAAAAGAA TTCATTGTCT TTGCATTTGA CTGATTATTT TATTTGGTCA AATAAAGTTG CTTTGGAGCG TAAAGAGCGT AGTGAGAAAA AATCTTCAAA CAAAAATGTA GAGCATTCTA ATACAAGTAA AGATGTAGAT CGTACTAATA AGTATTCTTC GCCTAACTTC TTTATTTCTC AGGCTGCTTT ACATATGAAC GCTAAAGTTT CCCCGTATCT TGCTTTCTTA ACTCGTTTAC ATGAGCATGT GAGTGCTATG GAGCCTCCTG TTGTAAATAC TATTCAAGGC TGGGATAGAA TTCCTGAAGG ACAATCAATT TATTTGGATA ACGATGGTAA TCCGATGATT TTGTCTAAGA TGGATAAGAA GTCACGTCAA CTTCTTCACG ACTATCGCTT AATACAATAC GATATTACTG CTGGAAAGCA TTATTTGCGT AATACTGATT TTATGAAATT ACCGCGCTAA
|
Protein sequence | MKSFLRLKVP YWLYAFLFFV IDGMSVIIVQ MGVQRAGRVE MTSAMSGGRW GLITKTWKEL NFVIVLSALV VAMIYGLILL ISNRFWISSA IILSVSLLIA VIEYMKVNVR YETILPADLN FLKSNTGNVA SFLPDNAPMV IGVALGVFIL LLVTTVCLHI FDTNHGKIVR FKDWRYSAGI RVAIALILIG NLSWYIGGVG TVDSSANVFS KMLGDSPAMW DSVYDAQRNG AIVAFLRNVN PKIMDKPADY SEETMKAVYK RYNDEAKRIN RSRTTNMNDN TFILILSESF SDPTRVPGLK LNKNPIPFIS NLKKHTDSGL MLSSGYGGGT ANLEYMSLTG LSMANFDPSM TSPYQQLVPN AQWSPTINQY WDDSRNSIKS IAFHPYEPSM YLRATNYKKF GFSKFYALQG PDVIAHRDVI DKSPYVSDAS AYKSALEKIK EHKQPRFVQI VTMQNHMPYR DWYANNEFEA SSKDGAADLG DDEKTSIETY AKGVQHTDET TQAFLKSLDK LNKPITVMFY GDHLPGIYGT ASSDEKNSLS LHLTDYFIWS NKVALERKER SEKKSSNKNV EHSNTSKDVD RTNKYSSPNF FISQAALHMN AKVSPYLAFL TRLHEHVSAM EPPVVNTIQG WDRIPEGQSI YLDNDGNPMI LSKMDKKSRQ LLHDYRLIQY DITAGKHYLR NTDFMKLPR
|
| |