Gene HMPREF0424_0391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0391 
Symbol 
ID8708851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp420040 
End bp422046 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content50% 
IMG OID646482507 
Producttetratricopeptide repeat protein 
Protein accessionYP_003373641 
Protein GI283782887 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGACG CTGCGCAATT TTATAACGAG CTTGACGAAA TGTTCGCGAA TCACGCGAGC 
GCAGACGCGA TTGAAACGTA TCTTTTGCGA AAACTTACCG AAGCCAATCC CCTGCAACTG
TCTATTCTCA ACGAACTCAT GGGCTTTTAC CGTTCGCGCG GAGAGCACAC AAAAAATCAG
CCAATCATCA ATCGCGCTCT AAATCTTGCA AACAAAATGC AGCTAGCTGG CACCGAAGCT
GGCACAACCA CACTAATTAA CGCCGCCACA AGCCTCCGCG CGGCTGGCGA CTACGATCGC
GCCGAAAAAA TTTACACTCA GGCTCTTAAC GAATCCGCCA TGACTCTTGG CGCCAAAAAT
CGCAAGCTCG CCGCGCTTCA CAACAATCTT TCGATGCTTT ACAGCGAAAC CGGTCGCACG
CATGATGCAA TTGAGGAGCT TAATCAGGCG CTTGAAATTT TGCAAAATAC GAGCACGGAT
CCTGAGCGCG ACATCGATAT TGCCGCCACG CACACGAATC TTGCGCTTGC GATGCTGCAG
GAGTGTACGA AGGAGTGCTC GCATCCCAAC ACCAGCACAA ACAGCAAATC CGCAACTCTT
GAATCCGCTT TTGAACACGC TTCGACCAGC GTTCGCATGT ATATTGCGGG GAATAACGAA
AATCAGCCGC ACTACGCTTC TGCATTGGCA GGTTTTGCGC AAGTACAGTG CGCGCGCGGC
GAGTACGCTC AAGCGGAGGA GAGTTACAGT AAGGCACTTG ATTTGATTGC GAGATGCTAC
GGAAAAGATA GCGAATCTTA CGCGATTACG ACGGAAAATT TGCGGCAGAC GCGCGAGTTG
GCAGAAAAAG TTACGGAAGA ATCTATAGAA ATTCAAGAAT CAGCAAACCA AGATATAACA
GAACAACACT CAACGAACTC ACACGCAGCA AACCAGAACG CAGCAACCCA GAACGCAGCA
AACAATAACA TAAAAACCGG CATGCAATTA GCGCAATCAT ACTGGCAAAC TTACGGAAAG
CCGTTACTTG ATCAGCCAAA ATTCGCGAGG TACAAAAATC GCATTGCAGC AGGATTAGTT
GGTCACGGTT CGGAATGCTA CGGTTTTGAC GACGAAATTT CGCGCGACCA CGATTTTGGT
CCCGGATTCT GCTTGTGGCT TACTGACGAA GATTACGCGG AAATTGGCGC GGATTTGCAA
AACGCTTACA ACGCACTTCC GCAAAAATAC GCCGGTTTTG AATCTCGCAA CGAAACGCAG
CGCGCCAAAT CATGCGAAAG CAGCAAGCGC GTCGGAATAT TTCGCATAAG CGAATTTTTC
GAAAATATAA CCGGCTTCCC CACTGCCCCG GCCGCAAACG AGCCGCACTT ATGGCTCTCA
TTAAGCGAGT CAACGCTTGC AGCAGCGACG AACGGAAAAA TTTTCGCAGA TCCTCTTGGC
GAGTTCTCAA AAGCGCGCCA AAGCTTCAAA CTCATGCCGA ACGACGTGCG AATCTCACTA
ATCTCGCGCA GACTTGGCAT GATTTCGCAA GCCGGCCAAT ACAATTTCCC ACGCATGATT
GCGCGCAAAG ACGCATCCGC AGCATGGCTT TCTATCAACG AATTCGTGCG CGCAACTGCT
TCCCTCGTTT TTCTGCTCAA CAATCCTGTA ACCGCAGGCT ACTTGCCTTA CTACAAGTGG
CAGTTTGCGG CATTGCGCAA GCTCAGCAAT CGCATGGCAT CTAGGCTTCC GGAAGTATGC
AGCAAACTTG AGTCGGTAAT GCGACTTTCT TCCGCTGCAT GCTTTGGCGG AGACGGTTCC
GGCGGAGACG GTTTTGGCGA AGGCGGCAAG GGCGCTGGAC TTGCGCAAAA GCAAGTTACG
CAAATAATTG ACAGCATTTG CGAAGATATT GTGCGAGAAT TGCAATATCA AGGCTTAAGC
GATTGCAGCG AAACTTTTTT GGAATGGCAG CGGCCGTACG TTGAAGCACA TATTCACTCG
CGTGCAGCAT GTTTGAAGAG CTTATGA
 
Protein sequence
MEDAAQFYNE LDEMFANHAS ADAIETYLLR KLTEANPLQL SILNELMGFY RSRGEHTKNQ 
PIINRALNLA NKMQLAGTEA GTTTLINAAT SLRAAGDYDR AEKIYTQALN ESAMTLGAKN
RKLAALHNNL SMLYSETGRT HDAIEELNQA LEILQNTSTD PERDIDIAAT HTNLALAMLQ
ECTKECSHPN TSTNSKSATL ESAFEHASTS VRMYIAGNNE NQPHYASALA GFAQVQCARG
EYAQAEESYS KALDLIARCY GKDSESYAIT TENLRQTREL AEKVTEESIE IQESANQDIT
EQHSTNSHAA NQNAATQNAA NNNIKTGMQL AQSYWQTYGK PLLDQPKFAR YKNRIAAGLV
GHGSECYGFD DEISRDHDFG PGFCLWLTDE DYAEIGADLQ NAYNALPQKY AGFESRNETQ
RAKSCESSKR VGIFRISEFF ENITGFPTAP AANEPHLWLS LSESTLAAAT NGKIFADPLG
EFSKARQSFK LMPNDVRISL ISRRLGMISQ AGQYNFPRMI ARKDASAAWL SINEFVRATA
SLVFLLNNPV TAGYLPYYKW QFAALRKLSN RMASRLPEVC SKLESVMRLS SAACFGGDGS
GGDGFGEGGK GAGLAQKQVT QIIDSICEDI VRELQYQGLS DCSETFLEWQ RPYVEAHIHS
RAACLKSL