Gene HMPREF0424_0815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0815 
SymboluvrC 
ID8709052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp918567 
End bp920909 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content38% 
IMG OID646482916 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003374033 
Protein GI283783279 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGATA ATGATCTTAA TAAAAATTCT TATAAAGCAT ATGATTTGCA AGAAGAAAAT 
AGGATGAGAG ATAGTGAAAA ATGGCGTAAA ACGCAAGCAA TAATAGATGA ACAGTTTTCG
AAAGATTCAT CATTTGATGA TGAAAGAGTA AATGCATTAG GAGCTCCATT GCTTGGTGAT
ACCCGTGATT TATTCCGTCC AGCTTCTCGT GATATTCCAG CGCAACCTGG AGTATACAAG
TGGCGTGATG GCGAGGGGCG AGTTATTTAC GTTGGAAAAG CAAAAAATTT ACGCAATCGT
TTAGCGAATT ATTTTCAACC TCTTTACCAA CTTCATCCTA GAACTCAATC TATGGTTCTT
ACAGCTCGCA GTTTGGAATG GACTGTTGTA AGCACTGAAT TTGAGGCTCT TACACTTGAA
TATACTTGGA TTAAAGCTTT TAATCCTAGG TTTAATGTAG TTTTTCGTGA CGATAAAACA
TATCCTTATG TTGCAGTTTC GCTAGAGGAA ACTTTTCCAA GAGTATGGAT TACTAGAAAT
CGTAGTGCGC GGCATGCTAG ATATTTTGGT CCGTATGCAA AAGTGTGGCC TCTTCGTCAT
AGTTTAGATT CTTTACTTAA AACTTTTCCT ATTCGCACAT GCTCATCCTC TGTTTTTAGC
AAAGCTCATA GGACTGGTAG ACCTTGCTTA TTTGCTTCGA TTGGTAAATG TTCAGCTCCT
TGTATAGGAA ACATAGATCC TAGCGAACAT CGCAAGATGG TAGAACATGT AGTTGGCATA
CTTACAGGCA GTGTTGGAGA GTCTTATATT TCACAAATTA AGAATGAGAT GAAAGAAGCA
AGTGAAGAGC TAGAATTTGA GCGTGCAGCT AAATTGCGTG ACGAAATTGC TGTTTTGAAC
ACTATTTTGC AACAAAATGC AGTTGTGTTT GACAATGATG TAGAAGCAGA TGTTTTTGGA
TTCTGCGGAG ATGAGCTTGA AGCGTCAATT CATGTATTTT TTGTCAGAGC TGGAATGATT
CGTGGAGAGA AAAATTGGTC GGTAGAACGA AATGAGCATA TCAGTGACAG TGATCTTATA
GCTGATTTAT TAACTCGCGT ATACGCAGAA TATGAGCAAA ATTACACACA AAACCATGAA
GATACTATTA GCATTAAGAA AGTTCGCGAC GCTGTTAGCT CTACGCAGCA AGCTACTGCA
ACTGATGTAA TATTGCGAGC GCAAGCTACT AAAACTCGTC GTGAACGTCA AGAGAGAACA
GGTCGTGAGG ATTTACTTGC TCCTATTTCG CCTGTTCCTC GTGAGATTAT AGTTCCAATT
AAAATTTCTG ATGAACGTAA GCAAGAGCTT GAATTGTGGC TTAGCAATAT TCGTGGTTCT
CAAGTAAGTA TTAGAGTTGC AGAGCGCGGA GAAAAACGTG CTCTTATGGA TCGTGCAAAC
GATAATGCTA GTCAGGAATT GAAACGAATA AAATCTAGCC GTATTAACAG CATAGATATG
AGAACAGAAG CTATGAACGA AATAGCAAAA GCGCTTGGAA TGAGTAAATC TCCTTTGCGA
ATTGAATGTT ATGATATTTC AAATACAGTT GATGGCAGTT ATCAAGTTGC TTCTATGGTT
GTTTTTGAAG ATGGTGTTGC TCGTCCTAGT GAGTACCGTC ATTTTGCGGT TCGAGGTGAA
CATGGTGATG GTAAAATAGA TGATTTAAGT GCTGTTTATG AAACATTATT GCGAAGGTTT
AAGCATAAAG ATGTTTATGC AGAAAATAGG CTTGATGCAT CAGATTCATC AGATACATCA
GATACATCTG ACTCATCTGA TATTAATATT TGCGATGCTA ACAGTATTTC TAATAAGCAT
TTTTCAGGTA ATTCATTGAC TAGTGCGCGG CATTTTGCTT ATAAACCTCA ATTAATTATT
GTCGATGGAG GAAAAGAACA AGCAAAAGCT GCTAAAAGAG CTATGAGAGA TGCTGGCGTA
AGTGATATTA CTGTTTGTGG TTTAGCTAAG AAGCTTGAAG AAGTGTGGCT TCCTGACGAA
GATTATCCAA TCATTTTTAA GAGAAGCTCA GAAGGATTGT ATTTATTGCA GCGTGCTAGA
GACGAGTCTC ATCGTTTTGC TATTTCGTAT CATCGTAAAT TGCGAAGAAA AGGTTCATTG
CATTCTACTT TTGATGCAAT TCCTGGAATT GGAGCAGTGT ATCGCAAGCG TTTGCTTGCA
TCTTTTGGTT CAATAAAATC TTTGAAAAAT GCTTCATTAG AAGATTTGCA AAAAGTACCT
GGAATTGGCG AGAATAAAGC AAAAGCTATA TACAGTGCTT TGCGTAAAAA TAATGATAAA
TAG
 
Protein sequence
MKDNDLNKNS YKAYDLQEEN RMRDSEKWRK TQAIIDEQFS KDSSFDDERV NALGAPLLGD 
TRDLFRPASR DIPAQPGVYK WRDGEGRVIY VGKAKNLRNR LANYFQPLYQ LHPRTQSMVL
TARSLEWTVV STEFEALTLE YTWIKAFNPR FNVVFRDDKT YPYVAVSLEE TFPRVWITRN
RSARHARYFG PYAKVWPLRH SLDSLLKTFP IRTCSSSVFS KAHRTGRPCL FASIGKCSAP
CIGNIDPSEH RKMVEHVVGI LTGSVGESYI SQIKNEMKEA SEELEFERAA KLRDEIAVLN
TILQQNAVVF DNDVEADVFG FCGDELEASI HVFFVRAGMI RGEKNWSVER NEHISDSDLI
ADLLTRVYAE YEQNYTQNHE DTISIKKVRD AVSSTQQATA TDVILRAQAT KTRRERQERT
GREDLLAPIS PVPREIIVPI KISDERKQEL ELWLSNIRGS QVSIRVAERG EKRALMDRAN
DNASQELKRI KSSRINSIDM RTEAMNEIAK ALGMSKSPLR IECYDISNTV DGSYQVASMV
VFEDGVARPS EYRHFAVRGE HGDGKIDDLS AVYETLLRRF KHKDVYAENR LDASDSSDTS
DTSDSSDINI CDANSISNKH FSGNSLTSAR HFAYKPQLII VDGGKEQAKA AKRAMRDAGV
SDITVCGLAK KLEEVWLPDE DYPIIFKRSS EGLYLLQRAR DESHRFAISY HRKLRRKGSL
HSTFDAIPGI GAVYRKRLLA SFGSIKSLKN ASLEDLQKVP GIGENKAKAI YSALRKNNDK