Gene HMPREF0424_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0422 
Symbol 
ID8708777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp454072 
End bp455217 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content48% 
IMG OID646482537 
Productsortase family protein 
Protein accessionYP_003373669 
Protein GI283782915 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID[TIGR01076] LPXTG-site transpeptidase (sortase) family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.102959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAA ACGACGCTGT TGCTAAAGTA TATGCTTTCC TCAACAAAAT CTCCCAAAAT 
AAGCTTTTCA AAAACCAAAC AAACGCAACC CAAACTAACG AAACCAACTT AACCGAAACC
AACTTAACCG AAACCAACTT AACCGAATCA AACGCAACCG AAAATAACGC AACTCAAGCC
CACGCAACCG AAGCAAAATC CGCCGAAAGT AATCAAAAAT CTAAGCAAAA AACCAATCGA
AAATCTAAGC TCCGCCGCAT CGTCGAGCCA ATCGCATTCG TGTTGGCAGG CATTTTGTGC
TTCAGTTACC CTGTTGTTTC AACACTTTGG AACAATCGCG TGTCGAAGGA AATTTCTAAC
GCGTACGACA AGTATAACCA CGATCAGGCT GGTGATGTGC GCCGCGCTCA CATTCGCGCA
GCGAAGCTTT ACAATAAAAG TCGCAAGAAT ATGCTTACCA CGGATCCGTA TGGTCCGGAT
GGTCAAAAAG ACGTAACTAA CACGCCTGAA TACAAGCGTT ATCTTAAGGC ACTTGAGGAG
CCTATGGGCA TTATCGGCAT CGTAAAAATT CCGAAAATTG GCGTAAAACT TCCTATTTAT
CACGGCAGTT CGCAGGAAGT TTTAGCGCAC GGCGCTGGTC ATTTGTACGG CACAGATTTG
CCGGTTGGTG GCAAGAACCG CCACACAGTT ATTACCGCGC ACACGGGTCT TGCGGATGCA
ACCATGTTCG ATGATTTGGT GAATTTAAAG AAGGGCGACT ACTTCTACCT CGACGTGCAA
GGCGAAACTT TACGATACAA AGTGTTCCGC ATCAGCGTGG TTGAGCCACA CGATATTAGT
TTGTTGCAGC GCGAAAAGGG TCGCGACTTG GCGACGCTGC TAACGTGTAC CCCGTATGGT
GTGAACTCGC ATAGGCTTTT GGTGACGGGG TATCGTGTGC TGCCGGACCC TGTGAAGCCG
CCGGATGACC ACGTGCAATG GCCGCTTTGG ATGACGCTAT TCGTGATTGC AATGGCGTTC
TCATTGATTG TTTTGTCCAT GATGATTGCT GCTGCAACGT CTAAGCGAGG GCGACAGCTC
GACATTCGCG GCAAGCATTT GCTGATCCTT TCGCGCAAAA TGCTGCGTAA GTTGCGTCGC
GAGTAA
 
Protein sequence
MKLNDAVAKV YAFLNKISQN KLFKNQTNAT QTNETNLTET NLTETNLTES NATENNATQA 
HATEAKSAES NQKSKQKTNR KSKLRRIVEP IAFVLAGILC FSYPVVSTLW NNRVSKEISN
AYDKYNHDQA GDVRRAHIRA AKLYNKSRKN MLTTDPYGPD GQKDVTNTPE YKRYLKALEE
PMGIIGIVKI PKIGVKLPIY HGSSQEVLAH GAGHLYGTDL PVGGKNRHTV ITAHTGLADA
TMFDDLVNLK KGDYFYLDVQ GETLRYKVFR ISVVEPHDIS LLQREKGRDL ATLLTCTPYG
VNSHRLLVTG YRVLPDPVKP PDDHVQWPLW MTLFVIAMAF SLIVLSMMIA AATSKRGRQL
DIRGKHLLIL SRKMLRKLRR E