Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0780 |
Symbol | |
ID | 8710066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 878385 |
End bp | 881837 |
Gene Length | 3453 bp |
Protein Length | 1150 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 646482881 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_003373998 |
Protein GI | 283783244 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.516639 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGATTA TGACTAACAA CATGCAAACT ACTGAAAATT CAGAATCTCA GGAAGATAGT ATGTTGCAAC TACAAGAATC TCTTAACTCG TTGTGGTCAG AAGGCAATCC AATAGATATT TCGTCGCGCG CTAAAAAGTA CGTTATTAAA GATTTTCCAC AAACAAATAC CAACAATCAA TTAGGCTGGG TTCAACTTAA CAAGAAAAAA GAATACACGT CAATTGGATT ATTTAAAGAA GACGGAAGAA AATTCCTTCA ACTAGCAGTA GCAGATTCAG AAGGGCGAAT ATTTCTCGGC AGCAAACTTA GCTACGAAGA TATTCAAGAC ATGAGACCGC AACTTTCATT TGTAGAAAAA AATAATTCTT CACTGAAAGA TTTACACCGT TTTTATGGAA AACAATATAA AAAAACAAAA ATTACTCTAA AATGTGCATT TTTGGAAGCG TTTGACGAAA CGTACACCAA ACATACAGAC TATGATATGG ATAATATCGT AGAATCCTAT TTAAATGGCG ACGCAACCGT AGAAGATTTT CTCGCTTCTT TAGAAAGCAG TAAGTCTAAA AGCGCATTAA AACCAGAACA TCGCTCTAAA ACTTTTGCCG GAAAAATAAA CCACTTTTCT AAAACTTTTT CAAAAATTAG AAAAGAAAAC GTATTTTGCG TATCAAACGG AGACTTAGTT TTACAACTTT CAGAAGATAA CGGTTTAATT AAAATCGGAT CATGGATCTC ACACGGGTTC GGAACTAAAA AAACTTATAC TCTCGAAGAG TTACGAACTA AACTTCCAGA AAACTTACGA TGCAATCTTA CATACAGAAA TGCAATAACA ATAGCATTCT TTGCAGCATT ATTCATTCCA GACATGCACA AGTCTTTAAC TGGATTTGGT TACAGCAATG TACTTCGCAG ACTAGAACAC GAAGCACCTT TAGGAGCAAT ACGAAGAAGT GTAGAAGACA TAAAAATAGC GAAAAAAGAG AATTTAAGAG TTTCAGGTTT AGAAGAATAT TTCGAAAAAT TAATGCGCTC AACTGGAGCT TTAGAAACAG AACGCAGCCT GGAAGCAGTG CATGGTGCAG AACGCCTACA CATGCAAAAA TCTCAGCAAA CAGAAGCAAA CTACTTACTT TGGGATGATG CTTTGGAACC AAATGCTGCA CTCAAAGTAC TTAAAATAGA AGGCGCTATT AACCGTTTAT ATTCAATTTA TGATTGTCTG CAAATTAACT CCATTATGGG AGGAAAAACT TGCGAAAATA CTATAAAAGA AGAGCAGGCA TCACAAATCG ACATGGCAAT GATTAAAGAT GTTGCTTTTG CAGCAAGCGG TTCAATAGGT ATGCAATTTT TTGGCAATCC TCTGATGGAC GACAAAATAA GACCTATTTT GCACATGTGG AGAAGAGCAA AGCAGGTAAA AAATAACGAT AAGTATTTCA ATAAGATAGA TAACGACAGT GAATGGTCCT ATAGGCAGCG TCTTTCTAAT CTTATTAGAA GTGTATACTT GCCGTTCCGT TTTGACGCGG AATTCCGATC TAATTTAGAA GATGGTAACG TGGCAATAAA CCTTACTACA GCTGGAGCTG CACTTATGCC ATCTGTAGCT TACGCTAATG AGGATAAATC TTGGAAAAAT CTGAACGACG ACGATAAAGA AAAACTAGCT ACAAACTACA ATCTTCAAGT TGGAATGATA ATGTCTACTC TAGCTTTTGG AGCAAGCGGT AAAGTAAAAA ATGTTTCTAT ACGCTTAGAT TCAATAGGCT TAGAAGAGAT GGTTACAGCA CAAAACAATG CTATGAACAA TCTTGTAAAT CGTACTCTTA ATGCGCTTAA CGTTATGAAT AGTGACGCTG CAAATAATGA TTCATCCAAG CCAAAAGGAG ATCCAAAAGA CGGTGACATT CATGGAGACC CGTCTAAATT GCAAGAAATG CAGCAGACTA ATATTAGTAA CGCAAACAAG ACAAACAATA ATGTCATATC AATGCATGCG AGCGCAAGCA GCCAAGAATT AAACTCAGAA ACAGACACAA ACGCAGATTC TAACGAAAAT CAACTGTCCG AAAACAGCAT GGATAAGTCT AAAGAAGAAA ATTCATTTGC AGCTTTTACT ACAGCACCAT CCATACGAGC TCTAACTACA GTTACGTTTA GCAGAGACAG ATTTATAAAA CTTATTCACC AAAACGGATT AAATAATCCT ATAAAAACAT ACGAAAAATT TAATGCCAAA TTAAACATTG ATTCTCATGG AAAATTAAAT ACCATCGAAC CAGATTTTGA TGTTCACAGT AGCCGCTTTT CTCCTCATGG ATCACAAGAA GAACCTGAAT TTTCTGACAA AATTTTTACA GCAGATCAGG AAAGCATACT GGGAACACAA TACGCACAAG GGCTTTCTAT ACAACGTGAA GATCTTCTGC AACAAGCTGT TGCCGACTTC CATCATATTG CTTCAGAACT TATGCCAACA ACTGCACAAA AAGCTCACGA AGCTATGAGT ATTATTGAAA GCATTGACGA TCCTGAACTT AACGCTCAAG CAGATTCAAT AACGCGAGCG CTAATTGACG AAACAGATTT ACCCAATCTT TCATTTACTA CAGCAAAACG TATAAGAGAT ATTCGTACAA AAGCTCACGA ACAGTTTATG AATGGGGACC TTAGCGAGGC TTTAAAAGAG TACGAAAATT CTGTAGCACA ATTCGATACT ATGTTTACTA GCAGCGAAGC TGTACCAAGA TACTTTAATT CCTACGCTGA GAGAGTAATT TATAATCATC TATTTGCAAC TAAAGAAGAA CGAACAAGAC TTATTCCTGA CGGTCTTTTC TACGCGCATA TGGAACTTGC TGATTTACTT TCGCAATTAA ACCAACATGA CGAAGCTTTA CGCCATTTAA ATATTATGGT TTCGTACGCT CCTACTTACG CTCTTTCTCA TTTAAGATTA GCTGATATAC TTGCGCAAAA AGAGGATTGG TCATCTGTTA TTGCAGCTTG CATTAACGCT TTGAATGTGT CTCTCGATAG GGATGACGCT GCTTTTGCTT ACTATAAGCT CGCTTATGCG GAATGGATGC AGAATAATTT CCTCATTGCT GCATCTTCAT ATAGAATGGC ACAATACTTA GCTCCAGGAA AGATTGAGCC GCTAGAAATG GAACTTGATG AGCTTCTTTC TAGAATGAGA TCGCAGTGCA AGCTAATACC GAGCAATATA CATGAAGCAC AGATGGCTCT TCTTTCTGAA GACGTACCAG CTTGGCCTGA TATTGAAGCA GAAGAGATTA TTGATAAAGC AGCAAAATTA ACTGTGAACG ATGGAATGTT TGTAATTGCA AGAACTTTAT GTGTTGCAAA TATTCGAATG ACTTCTAACG AAGATCTAAG TAGCACAGTA CAAACACAGT TCTTAAGATC TTTAAACGCT TAA
|
Protein sequence | MGIMTNNMQT TENSESQEDS MLQLQESLNS LWSEGNPIDI SSRAKKYVIK DFPQTNTNNQ LGWVQLNKKK EYTSIGLFKE DGRKFLQLAV ADSEGRIFLG SKLSYEDIQD MRPQLSFVEK NNSSLKDLHR FYGKQYKKTK ITLKCAFLEA FDETYTKHTD YDMDNIVESY LNGDATVEDF LASLESSKSK SALKPEHRSK TFAGKINHFS KTFSKIRKEN VFCVSNGDLV LQLSEDNGLI KIGSWISHGF GTKKTYTLEE LRTKLPENLR CNLTYRNAIT IAFFAALFIP DMHKSLTGFG YSNVLRRLEH EAPLGAIRRS VEDIKIAKKE NLRVSGLEEY FEKLMRSTGA LETERSLEAV HGAERLHMQK SQQTEANYLL WDDALEPNAA LKVLKIEGAI NRLYSIYDCL QINSIMGGKT CENTIKEEQA SQIDMAMIKD VAFAASGSIG MQFFGNPLMD DKIRPILHMW RRAKQVKNND KYFNKIDNDS EWSYRQRLSN LIRSVYLPFR FDAEFRSNLE DGNVAINLTT AGAALMPSVA YANEDKSWKN LNDDDKEKLA TNYNLQVGMI MSTLAFGASG KVKNVSIRLD SIGLEEMVTA QNNAMNNLVN RTLNALNVMN SDAANNDSSK PKGDPKDGDI HGDPSKLQEM QQTNISNANK TNNNVISMHA SASSQELNSE TDTNADSNEN QLSENSMDKS KEENSFAAFT TAPSIRALTT VTFSRDRFIK LIHQNGLNNP IKTYEKFNAK LNIDSHGKLN TIEPDFDVHS SRFSPHGSQE EPEFSDKIFT ADQESILGTQ YAQGLSIQRE DLLQQAVADF HHIASELMPT TAQKAHEAMS IIESIDDPEL NAQADSITRA LIDETDLPNL SFTTAKRIRD IRTKAHEQFM NGDLSEALKE YENSVAQFDT MFTSSEAVPR YFNSYAERVI YNHLFATKEE RTRLIPDGLF YAHMELADLL SQLNQHDEAL RHLNIMVSYA PTYALSHLRL ADILAQKEDW SSVIAACINA LNVSLDRDDA AFAYYKLAYA EWMQNNFLIA ASSYRMAQYL APGKIEPLEM ELDELLSRMR SQCKLIPSNI HEAQMALLSE DVPAWPDIEA EEIIDKAAKL TVNDGMFVIA RTLCVANIRM TSNEDLSSTV QTQFLRSLNA
|
| |