Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_1241 |
Symbol | |
ID | 8708781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | - |
Start bp | 1476112 |
End bp | 1478427 |
Gene Length | 2316 bp |
Protein Length | 771 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 646483329 |
Product | surface-anchored protein domain protein |
Protein accession | YP_003374434 |
Protein GI | 283783680 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000212575 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTCAGG TCATTAGGGA TATTGTGAAA CGTGCTTTTG CAATGGCAAG TGTTGCAATA CTTGCCATTG CCGTGCTCGC ATCAGGATTT AGTGCAGCTT TACCGGCAGT TGCATTAGAA AATAGTAATC CTGGCGAAAA ATACGATCCT ACAAAAAAGG AAGTGCTAAG TCACGGCCAT ACGGACGTAT TTTACCCAAT TCAATACAAC GGTAAATTCA TTATGGCCGT CGAAAAAGAC GGTGCCACTT TCTTAAAGCC TGAAAACACT ACTTTGCGTG TAGCAAAAAA CACGTATACT ACAAAATCGC AGCTTCCTGC GCTCGCTACT GAATACTACT ATCTCGATGG TTCCGGAAAT CAAAAAGGAA ACCCCCTCTT CCCGGGTTGG GACACAGGTT ATGCTGCAAC TTTAGTTGGC GCACCTCATG CAGATGACGC AACAGCAGAT ATTGCAATTC AGCAAGTTAC TGGACCAATG AACGGGCGCA TTCTGCTTTG GACTACCGAT GGTGCTGGTA AAAATTCGAA GAAATTATCG TTTGAAGAGA ACGATTTAGA CGACGAACCT GATACTGATG GCAGTCGATT TATGCTACCT GGCGTAATTC ACCAACATAC TGCTGGCCAC GTGCATGCAA ACTGGGGATT TACGCAGCCT GGCGTTTACA AACTAAAAGT AGCAGCAACT ATTACAAATA AGAACACTAA GAAGCAGATT ACAACCGAAC CTGCAGAATA CACTTTTGAA GTAGAAGATA CATATTCAGG CGAAGTACCT GCTGGCATTA CGGAAACGCT TGATTTGCAC CGTCGCGGTG ATTCTATTAA TCCTGACGAT GATGAAGAAG CACACGAAGC TTCCAAAGAT CATAAAGACA CGGATAAAGA CGATGGCGGC GATATGCGCA TCGGCAATAT TCGTGATTCC GGACCTCATC CGCATTATCA TTCATACGAA GGTTATGGTG GTTTAGACTT AAAAGTAGTT AACAAACCAA AGGGTGCTCG TATTGAGTGG AGATATGTGC GTGCTGACGA AGGTCCTGAC GCGTATGGAA CCACTCTTTT TGCTGAGAGA TTGCAGCTTC CAGCTGAGCC TGCAATGAAC AAGATGAAAG TGTATGCTCA CGCAACTGAA GGTGAAACGC AAGTTGGCAA AGATACTGCG TCTGCAACCA TCGCCGTTGA AGATCACGGT GCTGACGGAC ATCCTGTAGT AAAAGCAATT GCTCCTTATA AGCGTTTCAA ACCAGGGGAT ACTTTACACG CGAAAACAGT ATTGCTTAAC CCTCATGTTG CAACCGACGG TGTTACCGGT GCGCCAATTG ACGATCCGAC AAGTCCAGTA ACTTCCATAG TCAAAGATTA TGTTTGGTTG ATAAAAAAGG AAGGAGAAAG CGAATTTAAG CGCATTCCTG GTGCCGTGAA CAGCAAGTTG GAGCTAAAAC TTGATGCGTC AATGCAAGGT GCAACGATTC GTCCTTCCTT AGTTTTAAAG AACGGTGAAC TTTACCGTAA TAAAATGTTT GATGAGTTTT GCGATTATGT TATTGAGATG AAAGGTGTTC CTCACTCTCA TAACCATGAT GGCGATGATG ATCAAAGTGG TTCTGATTCT GAAGATGATG AGAACTCTCA CGGGAAAAAG CACCATGAGA AGCGTCGTAA TAAGCGCAAA AATAAGACTA AGCATTTTGT TGGTGGAAAG AACTTTTTGA AAGGTGTATT CGGCGGCACT GCAAATTCGG GACTATTTGG ATCTAGTTCT AATGGTGGAT TCCAATTCGA ATCTAATTTC TTTAAGAAAT CCAACAGAAA ATTCAAGCGC ACAAAGAAGA ATCGCAATAA GACTAACCGC ACTAATCGTA AGAATAATTC TAAAAATAAT ACGCAAAGTG GTGCTAGTTC TGGTTCTAGT TCTTTTACAA GAACAGAAAA TTCAAGCGGA ACATCGTTAC ATAATTCATC GCGACGAACT TCTGGCGGAA CTCATACTAA GAACAAGAGC TCTAAGAAAA GTGGACAGAC TATTCGTAAT TTCGTAAGAA CAGATAATAA TTCTGGTAAT AAGTCTAAAA ATAGTTCTGT CAATAACTCT GATTTTGTTA AGAATACTGA ACGTCATGCT TTGCAAGGTG CAAGGCAATA CAACGAAGAA TCTGATGAAG CGTATGAAGA TGATTCAGAT GATACAAAAG GTGATGGTTC CTCGACTAAG TGGGTTGCGG TTGCAGCTTC AGGTGCGAGC GTCGGCTTAT GCACAATAAC TGGCGCATGC GGAGCATTGA TACGTTCTAA ATTGAAATTG TTGTAA
|
Protein sequence | MTQVIRDIVK RAFAMASVAI LAIAVLASGF SAALPAVALE NSNPGEKYDP TKKEVLSHGH TDVFYPIQYN GKFIMAVEKD GATFLKPENT TLRVAKNTYT TKSQLPALAT EYYYLDGSGN QKGNPLFPGW DTGYAATLVG APHADDATAD IAIQQVTGPM NGRILLWTTD GAGKNSKKLS FEENDLDDEP DTDGSRFMLP GVIHQHTAGH VHANWGFTQP GVYKLKVAAT ITNKNTKKQI TTEPAEYTFE VEDTYSGEVP AGITETLDLH RRGDSINPDD DEEAHEASKD HKDTDKDDGG DMRIGNIRDS GPHPHYHSYE GYGGLDLKVV NKPKGARIEW RYVRADEGPD AYGTTLFAER LQLPAEPAMN KMKVYAHATE GETQVGKDTA SATIAVEDHG ADGHPVVKAI APYKRFKPGD TLHAKTVLLN PHVATDGVTG APIDDPTSPV TSIVKDYVWL IKKEGESEFK RIPGAVNSKL ELKLDASMQG ATIRPSLVLK NGELYRNKMF DEFCDYVIEM KGVPHSHNHD GDDDQSGSDS EDDENSHGKK HHEKRRNKRK NKTKHFVGGK NFLKGVFGGT ANSGLFGSSS NGGFQFESNF FKKSNRKFKR TKKNRNKTNR TNRKNNSKNN TQSGASSGSS SFTRTENSSG TSLHNSSRRT SGGTHTKNKS SKKSGQTIRN FVRTDNNSGN KSKNSSVNNS DFVKNTERHA LQGARQYNEE SDEAYEDDSD DTKGDGSSTK WVAVAASGAS VGLCTITGAC GALIRSKLKL L
|
| |