Gene HMPREF0424_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1000 
Symbol 
ID8709173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1132215 
End bp1134185 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content50% 
IMG OID646483093 
Producthypothetical protein 
Protein accessionYP_003374208 
Protein GI283783454 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG3096] Uncharacterized protein involved in chromosome partitioning 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATT TTCAGAATGC TACGTGGAGC GTGATCCAAT TGGATTATGC GAACGCGTTT 
ATTCCTGACG TTCGTTTGAA CGGTGGCGAC GAGAATGGGC GTATTATCCG CGTGCAATTG
TTAGATAATG GTGTGCCGGT TGATGATTCG ACTGTTGAAG TGTTCTTGTG CTGGAATCGC
CAGCCGGGGG TGCTTATTGG TGACCGCGTA AAAATGGAAG CGAAGGATTC TGACGATGGT
CGCATTTGGC AGGTGGCGGT GCCGGTTGCG GCTTGTCGTA TGCCGGGCAC AGTGACTCTT
GGTTTTGAAG TGAAACGTGA TAAGACGATC GTTTGCTCGC GTAGTTTTAC TGCTATTGTC
GAGCAGCCGG TTTTTGACGC TGGTTCGCCT GAGGGTAAGT CTTACAGGCA GGAGCTTGAA
GACACGGCAC AACAAGCAAA GGACGCTACT GGTAAAGCTA ACGCTTTAAC TGATAAAGTT
TCACACCTGA TTGAACAGAA TGAAACAGTA TCGCGGAATG CTCAAAATGC GGCTGATGCC
GCTAATAATG CTACTAGCAT AGCGCAACAG GCGGCACAAC AAGCAAAGGA CGCGGCAAGT
GAAGCATCAC AAGCTGCTCA AAACACTCAA AACGTTATTA GTCACGCTAC TGAAGTTGCC
CAACAGTGTG ACGCTAGTAA ACAGACTGCT GACCAAGCAG CAAAGCGCGC TGATGATGCT
GTGAGTGGGT TAAAGCAAAC TGTGCAGAAT GCTGCTGCTG ATGCGGCTAG TAAAGTGCAG
CAGGCTGTTG AGCGCGCTAA CAGTGCGACT CAAGCCGTGG ACGCGGTTCG TGAGAAAACA
GAAGCTGCAA ACAAACAAAC TGAAACCGAT CTAGCAGCAT TAAGAGAAGA GGTCGTTAAA
GCTCAGCGCG CCGGCTTTAC CGCGTCATCT AGCGCGCAAA AATGCGATGA AGCAGCACAA
GCTTATAGGA ACGTGTCCGG TGAGGTAGCT CAAGCTAAAC AGACTAGCGA ACAGGCGGTT
GAAGCGGCTA ATCAGGCGTT GCATACGGCT CAGGAGTCTG CTGCTGCGGT GGCTCAAGCT
CAGAGCGTTC TTGACCAAGT AAAGGATGCT AGCGAGACTG CTAAGCGTGT GGTGAGTGCT
GTTGAAGAGT TGAAGCAGAC GAATAACGCG GCACTTGAAG CTACTCGTAC GGCTAACGCT
CAAGCGTCTG CAGCCGCTGA TGCGGCTGGC AAGGCTAATA ATGCTACTAG CACAGCCAAC
AGCGCAGCGC AAGCGGCGAA TGATGCGGCT GGCAAGGTTA CGCAGACTTT GCAAGAGTCT
GAAACACGTT TTAAAGCTGT TGAACAGGCA GCGAATGATG CTAAAAGCGT GGCTGGTACG
GCGAATAGTA CAGCTGAAGC GGCGCGCTCT ACAGCTGAGC AGGCTCAGAG CAAAGCTAAC
GACGCGGCGG GATCAGCTCA ACGCGCTCAA AACACAGCTA ACAGTGCTAT TGAAGCTACT
GATAACAATA AAAATCGCAT TGACTCTATG GAGTCGGATG TTAGCTCGTT AAAAAACTCT
TGTAGCGCAG CACAGAGTAA GGCGAACGAT GCGGCTCAGA CTGCTAGTAA AGCACAGTCG
GTAGCGGATA GTGCTAATAG TGCGGCGCAA GCAGCTGCTA GTAAAGCTGA TAGCGCACAA
CAAGCAGTAA ACAACATTCG CACACCAATA GTAAAACCCC AATCATTGAC CGGTTACACA
ACCCCAAGTT CTTGGGATTG GACTCTAACA GATTTAAAAG AATTACCACA TGGGGGGCAT
ATTCTCATCT ACCCACAAGC TGACAGTATT AAAGAGTACA TGCGCCAGCA GCCAGAGTTC
GAGATTGAAG AACAGAATGG TGTGCGAACA GGGAAAATAA CTGTAATAAC TCACCAGCCG
AATACTAACG GTTCTGCTCT AAAGCTTGTT TTTGTTTGGT TTGCGGACTA G
 
Protein sequence
MSDFQNATWS VIQLDYANAF IPDVRLNGGD ENGRIIRVQL LDNGVPVDDS TVEVFLCWNR 
QPGVLIGDRV KMEAKDSDDG RIWQVAVPVA ACRMPGTVTL GFEVKRDKTI VCSRSFTAIV
EQPVFDAGSP EGKSYRQELE DTAQQAKDAT GKANALTDKV SHLIEQNETV SRNAQNAADA
ANNATSIAQQ AAQQAKDAAS EASQAAQNTQ NVISHATEVA QQCDASKQTA DQAAKRADDA
VSGLKQTVQN AAADAASKVQ QAVERANSAT QAVDAVREKT EAANKQTETD LAALREEVVK
AQRAGFTASS SAQKCDEAAQ AYRNVSGEVA QAKQTSEQAV EAANQALHTA QESAAAVAQA
QSVLDQVKDA SETAKRVVSA VEELKQTNNA ALEATRTANA QASAAADAAG KANNATSTAN
SAAQAANDAA GKVTQTLQES ETRFKAVEQA ANDAKSVAGT ANSTAEAARS TAEQAQSKAN
DAAGSAQRAQ NTANSAIEAT DNNKNRIDSM ESDVSSLKNS CSAAQSKAND AAQTASKAQS
VADSANSAAQ AAASKADSAQ QAVNNIRTPI VKPQSLTGYT TPSSWDWTLT DLKELPHGGH
ILIYPQADSI KEYMRQQPEF EIEEQNGVRT GKITVITHQP NTNGSALKLV FVWFAD