Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0698 |
Symbol | |
ID | 8709199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 788219 |
End bp | 790423 |
Gene Length | 2205 bp |
Protein Length | 734 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 646482803 |
Product | phage prohead protease, HK97 family |
Protein accession | YP_003373925 |
Protein GI | 283783171 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02543] Listeria/Bacterioides repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAGA GAACTGGTCT TATGTTGACT AGCACTATGA TGGGTATTGC TACTGCTGCT TGTGCTTTCT GCATGGCGCA AAGTGCGAAT GCTGCCGAAA CTTCAGCTTC TTCTGGTGTT AATACGACTG TAAATTCTTC TACAAATAAT TCTTCTACCA CAGCTTCTAA TACAGAAACT TCCTCTACAG AAGGTTCTTC TTCCGTAACT TCTTATAACG CAGCTTCTAA TACGCCCAAT GGGGGGGGGT CTGCTTTAGA TAGCACTTCC GCAAAACCTT CTGGAAATAC TGTACCTTCC AAGTCTAGCG AAACTGCAAA AACTGCCGAA ACCACAAAAT CTACTAAATC TTCTAAAGCT TCTGAATCTG TAACTGATGC TGAAAATTCT GAAAATGCTA GACTTACACA AACTCTTATA GATGTAAGCA ATACTAATCT TCAAACAAAT AATCGTGAAG ATAGAAGCGT ACACGCATCT GCAGAATCTA AAGCAGCATC AACTGATTCG GGTACTACAG ATGATGCATC TTTAACTTGG ACGAAGGACG ACTTTAACAT TTCTGACGAT GGTAAAACTA TTGGAGGTAG GAAAAATGTT TATTATCCGG AGACCGGTCA AACGGTAAAT GAGTATGTGG ACGGACTTAG TGCCAAAGGC AAGGAGAAAC TTAAGAAAAA TCATCATGTT GTGATTCCTG AGGGTATTGA GGTAATTCAT GAATGTGCTT TTACCGGTCA GGCTGACAAA GTCGGGAATA AGCATAAACA CGTTGACGGC GAGACTTATA TTGAGGGTGT CACTCTTCCG CAATCTCTTA GAATAATAGA ATATGGAGCT TTTGGTTGGA ATAAAATTAA AGGAACCGTT GTTATTCCAA AAAATGTGGT TTCAGTAAGC GATGCAGCCT TTGTTGCTAA TGAAATTGAA AAAGTTGTGT TTGAAGGCGT GCTTGATGAT AAGGGAAAAG AGCACGATAC AGATACTAAT CCGTATTATC TTGCTGGAGT TGGTAGTGTT GCTTTCCAAG GTAATAAAAT TTCTGAAATT GTCGTAAAAG GTAATTTAGG ACAATATAAG TTTCGCTCAG GGGAAAATGA TTCACCGTTT GACAATCAAA ATCCTGGACC TTTCACTATT GAAGTGGGAG AGGAATACAA AAGTCCTATC AAAATTACGC AAGAAAGCGC TGGTCATACA ATCAGTGTTA TGGAGGGTTT TGAGGAGAAT GGAAATCCTG TAACGATTGA CCTCTCACCA TATTTTAAGA AAAATGCTGA TGGCAAATAT GTAGCTGTAA AAACTGGCAC ACTTGAAGGT CAGTGCATTT TTTATGACTT TGTTCAGCGT GTGCCTAGGA TGATTGGTAG ATCGCGTTTT ACATACAACA TTGTTCCTAA AGCTACGATT TACACAGTTA CTTTTGTAAA TGGCGCTAAT TCTCATTATG CTTCTGTGCA AGTTAAGGAA GGCAAGAGCA TAAACGATAA GAGTGTTGAT AGTCAGTCTA TGCCTAAAGA TCCTACGAAG ATTGGGTACA CTTTTGAAGG GTGGAATGAA GATCAAGACG GTACTGGTGC AGTGTTCCTT GCCAGCACTA GTGTAAACAA AAATTTGAAA GTTTACTCTA TTTTTAAGAA TTACTTACCA GTTATTACAG TTCAGAATAA GACTATTACT GAGGGTGATT CACTCGATTT GAATAGTCTT GTTGTGTCTG TTAGTGATAT TGAGGATAAT TCTATAGCAA CTAAGGTTAA GCGTATTAGT GACGGTGGTT TTAATAACAA TGTTCCAGGT GTTTACACTA TCACGTTTAG CGTAACGGAC AATGGCGGTG CTACTGCAAC AGCAGTTGCA ACAGTAACTG TTAATAAGAA GCCAACGCCT CTAATTCCAC CAGCTCCTGC GCCGGAACCG GAGCTAACTC CAACTCCTGT TCCACTTCCT GAGTCGTCGG CCCCTGTCGC TCCAGGTCCG ATTCTGACAC CCTCTGAGTC TGAGACAGAA CCTGAGTCAG CTCCAGCTCT TTCGCAACCC GAGCAACCGG AAGCTAAGCG CGTTGTTAAG CATCTTCCTA AAACTGGCTC TTCAGTAGCT ATGGTAATAA ATTATTTGTT TGCATCTGTG ACTGCAGGAA TTTTTGCACT TGTAGAAGCT AGAAAATCAC TTCGCAGATA TTCCAAGCAT GCTCGCAAAA ACTAG
|
Protein sequence | MNKRTGLMLT STMMGIATAA CAFCMAQSAN AAETSASSGV NTTVNSSTNN SSTTASNTET SSTEGSSSVT SYNAASNTPN GGGSALDSTS AKPSGNTVPS KSSETAKTAE TTKSTKSSKA SESVTDAENS ENARLTQTLI DVSNTNLQTN NREDRSVHAS AESKAASTDS GTTDDASLTW TKDDFNISDD GKTIGGRKNV YYPETGQTVN EYVDGLSAKG KEKLKKNHHV VIPEGIEVIH ECAFTGQADK VGNKHKHVDG ETYIEGVTLP QSLRIIEYGA FGWNKIKGTV VIPKNVVSVS DAAFVANEIE KVVFEGVLDD KGKEHDTDTN PYYLAGVGSV AFQGNKISEI VVKGNLGQYK FRSGENDSPF DNQNPGPFTI EVGEEYKSPI KITQESAGHT ISVMEGFEEN GNPVTIDLSP YFKKNADGKY VAVKTGTLEG QCIFYDFVQR VPRMIGRSRF TYNIVPKATI YTVTFVNGAN SHYASVQVKE GKSINDKSVD SQSMPKDPTK IGYTFEGWNE DQDGTGAVFL ASTSVNKNLK VYSIFKNYLP VITVQNKTIT EGDSLDLNSL VVSVSDIEDN SIATKVKRIS DGGFNNNVPG VYTITFSVTD NGGATATAVA TVTVNKKPTP LIPPAPAPEP ELTPTPVPLP ESSAPVAPGP ILTPSESETE PESAPALSQP EQPEAKRVVK HLPKTGSSVA MVINYLFASV TAGIFALVEA RKSLRRYSKH ARKN
|
| |