Gene HMPREF0424_0780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0780 
Symbol 
ID8710066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp878385 
End bp881837 
Gene Length3453 bp 
Protein Length1150 aa 
Translation table11 
GC content37% 
IMG OID646482881 
Producttetratricopeptide repeat protein 
Protein accessionYP_003373998 
Protein GI283783244 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.516639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATTA TGACTAACAA CATGCAAACT ACTGAAAATT CAGAATCTCA GGAAGATAGT 
ATGTTGCAAC TACAAGAATC TCTTAACTCG TTGTGGTCAG AAGGCAATCC AATAGATATT
TCGTCGCGCG CTAAAAAGTA CGTTATTAAA GATTTTCCAC AAACAAATAC CAACAATCAA
TTAGGCTGGG TTCAACTTAA CAAGAAAAAA GAATACACGT CAATTGGATT ATTTAAAGAA
GACGGAAGAA AATTCCTTCA ACTAGCAGTA GCAGATTCAG AAGGGCGAAT ATTTCTCGGC
AGCAAACTTA GCTACGAAGA TATTCAAGAC ATGAGACCGC AACTTTCATT TGTAGAAAAA
AATAATTCTT CACTGAAAGA TTTACACCGT TTTTATGGAA AACAATATAA AAAAACAAAA
ATTACTCTAA AATGTGCATT TTTGGAAGCG TTTGACGAAA CGTACACCAA ACATACAGAC
TATGATATGG ATAATATCGT AGAATCCTAT TTAAATGGCG ACGCAACCGT AGAAGATTTT
CTCGCTTCTT TAGAAAGCAG TAAGTCTAAA AGCGCATTAA AACCAGAACA TCGCTCTAAA
ACTTTTGCCG GAAAAATAAA CCACTTTTCT AAAACTTTTT CAAAAATTAG AAAAGAAAAC
GTATTTTGCG TATCAAACGG AGACTTAGTT TTACAACTTT CAGAAGATAA CGGTTTAATT
AAAATCGGAT CATGGATCTC ACACGGGTTC GGAACTAAAA AAACTTATAC TCTCGAAGAG
TTACGAACTA AACTTCCAGA AAACTTACGA TGCAATCTTA CATACAGAAA TGCAATAACA
ATAGCATTCT TTGCAGCATT ATTCATTCCA GACATGCACA AGTCTTTAAC TGGATTTGGT
TACAGCAATG TACTTCGCAG ACTAGAACAC GAAGCACCTT TAGGAGCAAT ACGAAGAAGT
GTAGAAGACA TAAAAATAGC GAAAAAAGAG AATTTAAGAG TTTCAGGTTT AGAAGAATAT
TTCGAAAAAT TAATGCGCTC AACTGGAGCT TTAGAAACAG AACGCAGCCT GGAAGCAGTG
CATGGTGCAG AACGCCTACA CATGCAAAAA TCTCAGCAAA CAGAAGCAAA CTACTTACTT
TGGGATGATG CTTTGGAACC AAATGCTGCA CTCAAAGTAC TTAAAATAGA AGGCGCTATT
AACCGTTTAT ATTCAATTTA TGATTGTCTG CAAATTAACT CCATTATGGG AGGAAAAACT
TGCGAAAATA CTATAAAAGA AGAGCAGGCA TCACAAATCG ACATGGCAAT GATTAAAGAT
GTTGCTTTTG CAGCAAGCGG TTCAATAGGT ATGCAATTTT TTGGCAATCC TCTGATGGAC
GACAAAATAA GACCTATTTT GCACATGTGG AGAAGAGCAA AGCAGGTAAA AAATAACGAT
AAGTATTTCA ATAAGATAGA TAACGACAGT GAATGGTCCT ATAGGCAGCG TCTTTCTAAT
CTTATTAGAA GTGTATACTT GCCGTTCCGT TTTGACGCGG AATTCCGATC TAATTTAGAA
GATGGTAACG TGGCAATAAA CCTTACTACA GCTGGAGCTG CACTTATGCC ATCTGTAGCT
TACGCTAATG AGGATAAATC TTGGAAAAAT CTGAACGACG ACGATAAAGA AAAACTAGCT
ACAAACTACA ATCTTCAAGT TGGAATGATA ATGTCTACTC TAGCTTTTGG AGCAAGCGGT
AAAGTAAAAA ATGTTTCTAT ACGCTTAGAT TCAATAGGCT TAGAAGAGAT GGTTACAGCA
CAAAACAATG CTATGAACAA TCTTGTAAAT CGTACTCTTA ATGCGCTTAA CGTTATGAAT
AGTGACGCTG CAAATAATGA TTCATCCAAG CCAAAAGGAG ATCCAAAAGA CGGTGACATT
CATGGAGACC CGTCTAAATT GCAAGAAATG CAGCAGACTA ATATTAGTAA CGCAAACAAG
ACAAACAATA ATGTCATATC AATGCATGCG AGCGCAAGCA GCCAAGAATT AAACTCAGAA
ACAGACACAA ACGCAGATTC TAACGAAAAT CAACTGTCCG AAAACAGCAT GGATAAGTCT
AAAGAAGAAA ATTCATTTGC AGCTTTTACT ACAGCACCAT CCATACGAGC TCTAACTACA
GTTACGTTTA GCAGAGACAG ATTTATAAAA CTTATTCACC AAAACGGATT AAATAATCCT
ATAAAAACAT ACGAAAAATT TAATGCCAAA TTAAACATTG ATTCTCATGG AAAATTAAAT
ACCATCGAAC CAGATTTTGA TGTTCACAGT AGCCGCTTTT CTCCTCATGG ATCACAAGAA
GAACCTGAAT TTTCTGACAA AATTTTTACA GCAGATCAGG AAAGCATACT GGGAACACAA
TACGCACAAG GGCTTTCTAT ACAACGTGAA GATCTTCTGC AACAAGCTGT TGCCGACTTC
CATCATATTG CTTCAGAACT TATGCCAACA ACTGCACAAA AAGCTCACGA AGCTATGAGT
ATTATTGAAA GCATTGACGA TCCTGAACTT AACGCTCAAG CAGATTCAAT AACGCGAGCG
CTAATTGACG AAACAGATTT ACCCAATCTT TCATTTACTA CAGCAAAACG TATAAGAGAT
ATTCGTACAA AAGCTCACGA ACAGTTTATG AATGGGGACC TTAGCGAGGC TTTAAAAGAG
TACGAAAATT CTGTAGCACA ATTCGATACT ATGTTTACTA GCAGCGAAGC TGTACCAAGA
TACTTTAATT CCTACGCTGA GAGAGTAATT TATAATCATC TATTTGCAAC TAAAGAAGAA
CGAACAAGAC TTATTCCTGA CGGTCTTTTC TACGCGCATA TGGAACTTGC TGATTTACTT
TCGCAATTAA ACCAACATGA CGAAGCTTTA CGCCATTTAA ATATTATGGT TTCGTACGCT
CCTACTTACG CTCTTTCTCA TTTAAGATTA GCTGATATAC TTGCGCAAAA AGAGGATTGG
TCATCTGTTA TTGCAGCTTG CATTAACGCT TTGAATGTGT CTCTCGATAG GGATGACGCT
GCTTTTGCTT ACTATAAGCT CGCTTATGCG GAATGGATGC AGAATAATTT CCTCATTGCT
GCATCTTCAT ATAGAATGGC ACAATACTTA GCTCCAGGAA AGATTGAGCC GCTAGAAATG
GAACTTGATG AGCTTCTTTC TAGAATGAGA TCGCAGTGCA AGCTAATACC GAGCAATATA
CATGAAGCAC AGATGGCTCT TCTTTCTGAA GACGTACCAG CTTGGCCTGA TATTGAAGCA
GAAGAGATTA TTGATAAAGC AGCAAAATTA ACTGTGAACG ATGGAATGTT TGTAATTGCA
AGAACTTTAT GTGTTGCAAA TATTCGAATG ACTTCTAACG AAGATCTAAG TAGCACAGTA
CAAACACAGT TCTTAAGATC TTTAAACGCT TAA
 
Protein sequence
MGIMTNNMQT TENSESQEDS MLQLQESLNS LWSEGNPIDI SSRAKKYVIK DFPQTNTNNQ 
LGWVQLNKKK EYTSIGLFKE DGRKFLQLAV ADSEGRIFLG SKLSYEDIQD MRPQLSFVEK
NNSSLKDLHR FYGKQYKKTK ITLKCAFLEA FDETYTKHTD YDMDNIVESY LNGDATVEDF
LASLESSKSK SALKPEHRSK TFAGKINHFS KTFSKIRKEN VFCVSNGDLV LQLSEDNGLI
KIGSWISHGF GTKKTYTLEE LRTKLPENLR CNLTYRNAIT IAFFAALFIP DMHKSLTGFG
YSNVLRRLEH EAPLGAIRRS VEDIKIAKKE NLRVSGLEEY FEKLMRSTGA LETERSLEAV
HGAERLHMQK SQQTEANYLL WDDALEPNAA LKVLKIEGAI NRLYSIYDCL QINSIMGGKT
CENTIKEEQA SQIDMAMIKD VAFAASGSIG MQFFGNPLMD DKIRPILHMW RRAKQVKNND
KYFNKIDNDS EWSYRQRLSN LIRSVYLPFR FDAEFRSNLE DGNVAINLTT AGAALMPSVA
YANEDKSWKN LNDDDKEKLA TNYNLQVGMI MSTLAFGASG KVKNVSIRLD SIGLEEMVTA
QNNAMNNLVN RTLNALNVMN SDAANNDSSK PKGDPKDGDI HGDPSKLQEM QQTNISNANK
TNNNVISMHA SASSQELNSE TDTNADSNEN QLSENSMDKS KEENSFAAFT TAPSIRALTT
VTFSRDRFIK LIHQNGLNNP IKTYEKFNAK LNIDSHGKLN TIEPDFDVHS SRFSPHGSQE
EPEFSDKIFT ADQESILGTQ YAQGLSIQRE DLLQQAVADF HHIASELMPT TAQKAHEAMS
IIESIDDPEL NAQADSITRA LIDETDLPNL SFTTAKRIRD IRTKAHEQFM NGDLSEALKE
YENSVAQFDT MFTSSEAVPR YFNSYAERVI YNHLFATKEE RTRLIPDGLF YAHMELADLL
SQLNQHDEAL RHLNIMVSYA PTYALSHLRL ADILAQKEDW SSVIAACINA LNVSLDRDDA
AFAYYKLAYA EWMQNNFLIA ASSYRMAQYL APGKIEPLEM ELDELLSRMR SQCKLIPSNI
HEAQMALLSE DVPAWPDIEA EEIIDKAAKL TVNDGMFVIA RTLCVANIRM TSNEDLSSTV
QTQFLRSLNA