Gene HMPREF0424_0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0066 
Symbol 
ID8709038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp78418 
End bp79389 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content40% 
IMG OID646482187 
Producthypothetical protein 
Protein accessionYP_003373334 
Protein GI283782580 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1651] Protein-disulfide isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0355055 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAAA TTCAGATGCG TGATAATAAG CAGGGAGATT CATCTGCTAC TCGCCTACAA 
AGAACTATAG AAAATCGCGA ACGTGCAGCG CGTGAGGCGC GTGAGCGTCG TCAGCAAGCA
ATTATCGGCG TTATTGTAGT AATTATTGTT GCGGCTATGT TGGCTGCTGT AGGCATTACT
GCGTATCAGG CAGCTTACGT TCAGCCTCAG AAAAAAATAG AGCAATCTGC AAAATCTAAT
CATAATTTGG ATAATATTGA TCCAAGTATT CGTCCAAGCG GTGTTAATTC TAAGAATGGC
ATTCTTTTTA GCAAGGATGG GTACGGCAAG CAAGCTGCTG GTGCTCCTAC AGTTGCCACT
TATTTTGATC CTTTGTGCCC TGGTTGCGGA TCCTTTAATC GTACTGTTGA TGAAACTTTA
ATTAAAATGG TTGAAGCAGG TCAAATTAAC CTAGAACTTC ACCCAATGTC ATTCTTGGAT
GGACTTTCTA CTGACCATTA CTCGACTCGT GTTTCTAGTG CTATTGCTTA TATCGCTTCA
TATGATAACA ATCCAAAGCA TTTGCTTCAG TTTATCAATG GCATTTTTAA CGAAAAATTC
CAGCCGGAAG AAAGCGAGGG ATATAAGCCT GTAAGCAATA AAGAGTTGAT TAAACTGGCT
AAGAAATCTG GAATTCCAAA CGAAATTGCA AGTAAAGCTT TTAACCGCCA ATATTTGAAG
TGGCAGCTGC TAGTTAATAA ATACACTCCT GATCGTAAGG AATTGTGGAA TGTCAGCGGT
CCAAATAAAG GATCTATGAC AACGCCGACT GTAACTATTA ACGATAAATT GCTTGATATG
AATGCCATTA ATGAGAAAAA AATGAAGGTG CTTGATGCTC TGCTTCATTG CATCGGCCTG
GATAAAAAAC AAGTTGGAGT TGCAGGTCAG ATGCCTAAAG TGTCTGACAC TAGTTCCCCT
ATTGCATTGT AA
 
Protein sequence
MPKIQMRDNK QGDSSATRLQ RTIENRERAA REARERRQQA IIGVIVVIIV AAMLAAVGIT 
AYQAAYVQPQ KKIEQSAKSN HNLDNIDPSI RPSGVNSKNG ILFSKDGYGK QAAGAPTVAT
YFDPLCPGCG SFNRTVDETL IKMVEAGQIN LELHPMSFLD GLSTDHYSTR VSSAIAYIAS
YDNNPKHLLQ FINGIFNEKF QPEESEGYKP VSNKELIKLA KKSGIPNEIA SKAFNRQYLK
WQLLVNKYTP DRKELWNVSG PNKGSMTTPT VTINDKLLDM NAINEKKMKV LDALLHCIGL
DKKQVGVAGQ MPKVSDTSSP IAL