Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0391 |
Symbol | |
ID | 8708851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | - |
Start bp | 420040 |
End bp | 422046 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 646482507 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_003373641 |
Protein GI | 283782887 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGACG CTGCGCAATT TTATAACGAG CTTGACGAAA TGTTCGCGAA TCACGCGAGC GCAGACGCGA TTGAAACGTA TCTTTTGCGA AAACTTACCG AAGCCAATCC CCTGCAACTG TCTATTCTCA ACGAACTCAT GGGCTTTTAC CGTTCGCGCG GAGAGCACAC AAAAAATCAG CCAATCATCA ATCGCGCTCT AAATCTTGCA AACAAAATGC AGCTAGCTGG CACCGAAGCT GGCACAACCA CACTAATTAA CGCCGCCACA AGCCTCCGCG CGGCTGGCGA CTACGATCGC GCCGAAAAAA TTTACACTCA GGCTCTTAAC GAATCCGCCA TGACTCTTGG CGCCAAAAAT CGCAAGCTCG CCGCGCTTCA CAACAATCTT TCGATGCTTT ACAGCGAAAC CGGTCGCACG CATGATGCAA TTGAGGAGCT TAATCAGGCG CTTGAAATTT TGCAAAATAC GAGCACGGAT CCTGAGCGCG ACATCGATAT TGCCGCCACG CACACGAATC TTGCGCTTGC GATGCTGCAG GAGTGTACGA AGGAGTGCTC GCATCCCAAC ACCAGCACAA ACAGCAAATC CGCAACTCTT GAATCCGCTT TTGAACACGC TTCGACCAGC GTTCGCATGT ATATTGCGGG GAATAACGAA AATCAGCCGC ACTACGCTTC TGCATTGGCA GGTTTTGCGC AAGTACAGTG CGCGCGCGGC GAGTACGCTC AAGCGGAGGA GAGTTACAGT AAGGCACTTG ATTTGATTGC GAGATGCTAC GGAAAAGATA GCGAATCTTA CGCGATTACG ACGGAAAATT TGCGGCAGAC GCGCGAGTTG GCAGAAAAAG TTACGGAAGA ATCTATAGAA ATTCAAGAAT CAGCAAACCA AGATATAACA GAACAACACT CAACGAACTC ACACGCAGCA AACCAGAACG CAGCAACCCA GAACGCAGCA AACAATAACA TAAAAACCGG CATGCAATTA GCGCAATCAT ACTGGCAAAC TTACGGAAAG CCGTTACTTG ATCAGCCAAA ATTCGCGAGG TACAAAAATC GCATTGCAGC AGGATTAGTT GGTCACGGTT CGGAATGCTA CGGTTTTGAC GACGAAATTT CGCGCGACCA CGATTTTGGT CCCGGATTCT GCTTGTGGCT TACTGACGAA GATTACGCGG AAATTGGCGC GGATTTGCAA AACGCTTACA ACGCACTTCC GCAAAAATAC GCCGGTTTTG AATCTCGCAA CGAAACGCAG CGCGCCAAAT CATGCGAAAG CAGCAAGCGC GTCGGAATAT TTCGCATAAG CGAATTTTTC GAAAATATAA CCGGCTTCCC CACTGCCCCG GCCGCAAACG AGCCGCACTT ATGGCTCTCA TTAAGCGAGT CAACGCTTGC AGCAGCGACG AACGGAAAAA TTTTCGCAGA TCCTCTTGGC GAGTTCTCAA AAGCGCGCCA AAGCTTCAAA CTCATGCCGA ACGACGTGCG AATCTCACTA ATCTCGCGCA GACTTGGCAT GATTTCGCAA GCCGGCCAAT ACAATTTCCC ACGCATGATT GCGCGCAAAG ACGCATCCGC AGCATGGCTT TCTATCAACG AATTCGTGCG CGCAACTGCT TCCCTCGTTT TTCTGCTCAA CAATCCTGTA ACCGCAGGCT ACTTGCCTTA CTACAAGTGG CAGTTTGCGG CATTGCGCAA GCTCAGCAAT CGCATGGCAT CTAGGCTTCC GGAAGTATGC AGCAAACTTG AGTCGGTAAT GCGACTTTCT TCCGCTGCAT GCTTTGGCGG AGACGGTTCC GGCGGAGACG GTTTTGGCGA AGGCGGCAAG GGCGCTGGAC TTGCGCAAAA GCAAGTTACG CAAATAATTG ACAGCATTTG CGAAGATATT GTGCGAGAAT TGCAATATCA AGGCTTAAGC GATTGCAGCG AAACTTTTTT GGAATGGCAG CGGCCGTACG TTGAAGCACA TATTCACTCG CGTGCAGCAT GTTTGAAGAG CTTATGA
|
Protein sequence | MEDAAQFYNE LDEMFANHAS ADAIETYLLR KLTEANPLQL SILNELMGFY RSRGEHTKNQ PIINRALNLA NKMQLAGTEA GTTTLINAAT SLRAAGDYDR AEKIYTQALN ESAMTLGAKN RKLAALHNNL SMLYSETGRT HDAIEELNQA LEILQNTSTD PERDIDIAAT HTNLALAMLQ ECTKECSHPN TSTNSKSATL ESAFEHASTS VRMYIAGNNE NQPHYASALA GFAQVQCARG EYAQAEESYS KALDLIARCY GKDSESYAIT TENLRQTREL AEKVTEESIE IQESANQDIT EQHSTNSHAA NQNAATQNAA NNNIKTGMQL AQSYWQTYGK PLLDQPKFAR YKNRIAAGLV GHGSECYGFD DEISRDHDFG PGFCLWLTDE DYAEIGADLQ NAYNALPQKY AGFESRNETQ RAKSCESSKR VGIFRISEFF ENITGFPTAP AANEPHLWLS LSESTLAAAT NGKIFADPLG EFSKARQSFK LMPNDVRISL ISRRLGMISQ AGQYNFPRMI ARKDASAAWL SINEFVRATA SLVFLLNNPV TAGYLPYYKW QFAALRKLSN RMASRLPEVC SKLESVMRLS SAACFGGDGS GGDGFGEGGK GAGLAQKQVT QIIDSICEDI VRELQYQGLS DCSETFLEWQ RPYVEAHIHS RAACLKSL
|
| |