Gene HMPREF0424_0781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0781 
SymbolligA 
ID8709562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp881971 
End bp884859 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content40% 
IMG OID646482882 
ProductDNA ligase (NAD(+)) 
Protein accessionYP_003373999 
Protein GI283783245 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA ATAGTATTCG CGAAGAGTCA ACAGAGAGCG AACAGTTATC GCTTTTTTAC 
GATTTTGACG AGCAAACTAT ATCTCATGAA GCGCCAAAAA ACGAAACTAA TAGCATAAGT
ACAGTTAAAA CAGAGACTGT TACTCAGGCT GATAGCAAAA CTGATAACAA AACTGACATG
CATGAAGCGA ATGAGTGGAA CTCTACAATA GACACCGAAA AGTGGATTGC AAACTTGAAA
CCAACAGACA CTGATGCGTT GGCTCTTGGA AACTTGCAGC CAGCGACGCT TACTATAGAG
CAAGCTGCGA AACTTTGGGC AAAGTTGGCA GCTTGGGCAC AATCTGATCA AATTGCTTAT
TACGTAGAAG ATTCTCCGAC TAGTTCAGAC GCAGCATATG ATGCAAGAAT GCGAGCGCTA
AGCGCTTTAG AAGCTGCCTT CCCAACTTTA GACACACCTC AATCTCCAAC ACATCGCGTA
GGTGGAACTT TTTCTAACGA TTTTACTTCT GTAAAGCATC CGTCTCGAAT GATGAGTTTG
GATGACGTAT TTTCAATTGA AGAATTACAT GATTGGTATA ACAGCGTAAT TCGCGATTTA
CAATGGGATG AGTCTCAACC TCTTCCTATG ACTTGCGAAG TAAAAATTGA TGGTCTTGCT
TTAAATCTTA TTTATCGCAA CGGTACTCTT GAACAAGGTC TTACTAGAGG CGATGGCGTA
ACTGGCGAGG ATATTACATT AAACGTAAGA ACGATTGAAG CAATACCAAC ACAGCTTCAT
TCTGATAATC CTGACGATAT TCCTGAGTTC GTGGAAATTC GTGGCGAAGT TTTTATGAAA
TGGGAAGACT TTAGAAAATT AAATGATGAA CAAGAAAATG CTGGACGAGT TGCTTTTGCA
AACCCTCGTA ACGCTGCAGC TGGCTCTTTG CGCCAAAAAG ACCCACGAAT TACAGCCACG
CGTCGATTAA GCTTTTTTGC ACACGGACTA GGTGAGCTTC GTTGGAAAGA AAATACAAGC
CATAACAATA GTGAAACAAA ATTTAATCAG TCTGACGCAT ATAAGCTCTA TCAACAGTGG
GGAATACCTG TTTCTCCGTA CACAAGAAAA GTCACAAATT TTTCTGAAAT AGAAGAAATG
ATTGACTATT ACGGAAAGCA TAGAGCAAAT ATTTTGCACG CTTTAGACGG CATAGTTGTA
AAAGTTGATG ACAGAGCATT ACAGCATCAA CTAGGAGCTA CTTCTAGAGC TCCACGATGG
GCTATTGCAT ACAAGTATCC TCCTGAAGAA GTTAATACTT ATTTGAAGGA TATTATTGTC
CAGGTTGGAA GAACTGGAAG AGTAACTCCT GTAGCTGTTC TTGAACCTGT TACTGTAGCA
GGATCGACAA TTTCTCGCAC AACTCTTCAT AACGCTTACG AAGTTGAACA TAAAGGTGTG
CTAATTGGCG ATACTGTCGT AGTTCGTAAA GCTGGGGATG TTATTCCAGA ATTAGTCGGA
CCGGTACTTA AAGCTAGGGA AGGGCGAGAA AATGAACTTA GAAAGTTTGT TATGCCAGAA
TACTGCCCAT CTTGTGGAAC TAAGCTAGCT CCTGCAAAAG AGGGAGATAA AGACATTCGT
TGCCCAAATG TAGAAAACTG CCCAGCTCAA CTTACAGAAC GAATAATTCA CTTGGCTTCC
AGACAAGCTT TTGATATTGA AAATTTGGGC GATAACGCTG CGCTAGCTCT TACAAATCCT
GAGGATTGCA GACCTACAAC TGCAGAAGTT TACTGCCCAG ATATGGATAA AATTATAATT
CAGCGAGGCG CAACTCAACA ACCATATATA CCGTCACCAG ATTTAACGCT ACCTGAACCA
CAAACTCCAG TACTTAGAAA TGAAGCTGGA TTATTTAGTA TTACTGCCGA CGATTTGCAA
AATGTAATGG TTTGGAAAGA AATTCCTCTT GTTGAAGAGT GGAAAGAAGT TAGTAAAGAC
GGTAGCTTTA AAAAGCGTAC ACGCAAAATA GGCGGATCTG GTCTTTGGCA TCAAGTTAGA
GCATTTTGGA CTCGCACAAT CGAAGCAAAG TTATCTACAA ATTACGAAGC AGAAGGAACT
TCCGAAAAAA CTACTTCACT AAACGAACAA TGGGATCCTC AATATCCTAA ATTCCAGGTT
CCAATTGACG CTAAAGTAGT ACTTTGGAAA AATAAGCGCA TTACTAGAAA TGCTAAAACT
AGCGACAACG CTAAAGAAAC AATAAATGTG CCATGGTATA CAAGACCTTC TGAAACTACT
CGTAGTATGC TTGAAGAAAT AGCTCAAAAA GGCAAGAATG CAGCACTGTG GCGAGTTTTG
GTTGCTTTAT CTATTCGTAG ACTAGGTCCA CCAACTGCAC GTCTTATAGC GGCAAATTTT
GGATCTTTAG ACAATATTTC CAAAGCATCT ATTGAAGAAC TTACACAAAT TGATGGAGTT
GGTCCTGAAA TTGCGCAAGC AGTATACAAC TGGTTCCAAC AAGCTAAAGA CCCTGCTAAT
TGGCAATTTG AAGTATTAAA ATCGTGGCAA GAAGCAGGTG TAGTAGGGAA AGTTGAAGCA
TCATCTTTCG CTCAAACATT AGTAGGTAAA ACAATAGTTG TTACAGGATC TTTGCAAGGA
TTTACACGGG ACAGCGCAAA AGAAGCTATA GTCTCAAGAG GCGGCAAAGC ATCAGGGTCA
GTAAGCAAGA ACACATATTG CGTTATTCTA GGCGAAAATG CAGGATCCAA GGCAACTAAA
GCGCAAGAAC TTGGCATTCC TATGCTTAAC GAACAACAGT TTAATACTCT TTTAAAAACT
GGGAACTTAG AAGAAATACT ACAAATTGCA AATAATACTG TGCCAAATCT CATTGCTGAG
GAATCATAA
 
Protein sequence
MSENSIREES TESEQLSLFY DFDEQTISHE APKNETNSIS TVKTETVTQA DSKTDNKTDM 
HEANEWNSTI DTEKWIANLK PTDTDALALG NLQPATLTIE QAAKLWAKLA AWAQSDQIAY
YVEDSPTSSD AAYDARMRAL SALEAAFPTL DTPQSPTHRV GGTFSNDFTS VKHPSRMMSL
DDVFSIEELH DWYNSVIRDL QWDESQPLPM TCEVKIDGLA LNLIYRNGTL EQGLTRGDGV
TGEDITLNVR TIEAIPTQLH SDNPDDIPEF VEIRGEVFMK WEDFRKLNDE QENAGRVAFA
NPRNAAAGSL RQKDPRITAT RRLSFFAHGL GELRWKENTS HNNSETKFNQ SDAYKLYQQW
GIPVSPYTRK VTNFSEIEEM IDYYGKHRAN ILHALDGIVV KVDDRALQHQ LGATSRAPRW
AIAYKYPPEE VNTYLKDIIV QVGRTGRVTP VAVLEPVTVA GSTISRTTLH NAYEVEHKGV
LIGDTVVVRK AGDVIPELVG PVLKAREGRE NELRKFVMPE YCPSCGTKLA PAKEGDKDIR
CPNVENCPAQ LTERIIHLAS RQAFDIENLG DNAALALTNP EDCRPTTAEV YCPDMDKIII
QRGATQQPYI PSPDLTLPEP QTPVLRNEAG LFSITADDLQ NVMVWKEIPL VEEWKEVSKD
GSFKKRTRKI GGSGLWHQVR AFWTRTIEAK LSTNYEAEGT SEKTTSLNEQ WDPQYPKFQV
PIDAKVVLWK NKRITRNAKT SDNAKETINV PWYTRPSETT RSMLEEIAQK GKNAALWRVL
VALSIRRLGP PTARLIAANF GSLDNISKAS IEELTQIDGV GPEIAQAVYN WFQQAKDPAN
WQFEVLKSWQ EAGVVGKVEA SSFAQTLVGK TIVVTGSLQG FTRDSAKEAI VSRGGKASGS
VSKNTYCVIL GENAGSKATK AQELGIPMLN EQQFNTLLKT GNLEEILQIA NNTVPNLIAE
ES