Gene ECH_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0301 
SymbolligA 
ID3927385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp287917 
End bp289947 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content31% 
IMG OID637901425 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_507122 
Protein GI88658391 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.10932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGACC AGGAAAAAGC TAAATTAGAG TTAGATAAAT TAAATAAGCA AATACAGCAC 
CATGATGTTC TTTATTATGT ACAGGACAAT CCAGAAATCA GTGATGCTGA ATATGATGAG
TTATGTCGCA AAAGAAATTT AATTCTTAAC GCCTTTCCAA AACTCACACA AAGTAACGAT
TACCAAAACA ATATAGGTAG TTCTCCAGAC ACAAAGTTTG CTAAGGTTAA ACATGAAGAG
AAAATGTTTT CCTTAGATAA CGCTTTTAAT CAACAAGATG TAGAGAAATT CATCACAAGA
ACTAAAAAGT TCTTAAACAT CAATGACAAT CAATCAATAC CATTATCTTG CGAATTAAAA
ATTGATGGAC TATCATTTTC CGTCATATAT AAAAAAGGAG AAATCTATCA AGCATCCACA
AGAGGTAATG GATATTTTGG TGAAAATATT ACAACCAATG TCAAAACTAT AAAAAATCTA
CCACATATTA TTCACAATGC ACCAGATCTT CTAGAAGTCA GGGGAGAAAT ATATATAGAT
CGCTGTGATT TTATACAACT TAATAATGAT GGGAATAATT TTGCAAATCC ACGCAACGCC
GCAGCCGGAT CTGTAAGACA ACTAGATCAT AACATTACTG CCCAACGTAA ACTAAAATAT
TTTATGTATA CAATAGTCAA CACTGCATGT CTAACACAAG AAGAATTATT ATACCAATTA
AAAATCTGGG GCTTTTGTGT AAATGAACAT ACTATTACTA CAGATAATAT TGAAGAGGCT
TTTAATTTTT ACAATCAAGT TTATAACAAT AGAAGTAATA TCAACTATGA TATTGATGGT
ATAATATACA AAGTTAACGA TATCAAATCA CAACATATCT TAGGAACAAC AAGTAAATCA
CCAAGATGGG CTATAGCATA TAAGTTTCCT GCTGCAGAAG CAAAAACCCA ATTAAATAAA
ATTTCCATAC AAATTGGCAG GACTGGTGTA CTTACACCCA TTGCAGAACT TTCTCCTATA
AATATTGGTG GGGTCATAGT TACAAGAGCA AGTTTACACA ACAAAAATGA AATAGAGCGT
AAAGATATAC GTGAAGGCGA CTATGTTATT GTAAAAAGAG CAGGAGATGT AATACCTCAA
ATCGTGGATG TAGATAAAAA TTTAAGAACT CAAGAACTAA CAAAATTCGT TTTCCCAACA
ACATGTCCAT CATGTGGTAG TAGTGTATAT CAAGCAAAGC AAGAAGCATC AATATACTGT
ACTGGTGAAT TATTTTGTAA AGATCAAATC CTGGAAAAAA TAAAGCATTT TGTTTCAAAA
GACGCTTTTA ACATTATTGG ATTTGGGAAA AAACAGCTCT TATTTTTCTA TGAACAAAGA
TTAATAACAA ATATTACCGA CATATTCACA TTAGAAGAAA AAATAAGTAA TAGTGACGTA
AAATTAGAGT CATTACATGG GTGGGGTGAA AAATCCATGA GTAACCTATT TTCTGCTATA
AATAACAGTA AAGTAATCAG CTTGGAAAAT TTTATTTTTG CGTTAGGAAT AAGATTCATA
GGAAAGTACA TAGCAAAAAT ACTAGCTAAC CATTTTTCAT CATATGAAAC ATGGTATAAT
GAGATGCTCA AGTTAGCACA GAATGAAGAT TACTCGCTAA ACATACAACA GATAGGGTCC
AAAACTATTG GTTCACTCAA AATGTTTTTT AGTCAACCAC ATAATTTAAA TATGATTAAT
GATTTAGTAA AACACCTAAC AATAACAGAT ACACAGTCTA GCTCTTATCT ATCACTCATT
CATGGAAAGA TCATAGTTTT TACTGGAGAA TTATCAAGTA TGTCAAGATC TGAAGCAAAA
ATAAGGTCAG AAACTGCAGG AGCAAAAGTT TCTTCATCAT TATCAAAAAA TACAGATTTC
CTAATTGCAG GAAACAATCC AGGATCAAAA TATAAAAAAG CACAATCTCT CAATGTACAG
ATTTTAACTG AAGATTTATG GTTACAATAT ACCCAATCAA GTGAAAATTG A
 
Protein sequence
MVDQEKAKLE LDKLNKQIQH HDVLYYVQDN PEISDAEYDE LCRKRNLILN AFPKLTQSND 
YQNNIGSSPD TKFAKVKHEE KMFSLDNAFN QQDVEKFITR TKKFLNINDN QSIPLSCELK
IDGLSFSVIY KKGEIYQAST RGNGYFGENI TTNVKTIKNL PHIIHNAPDL LEVRGEIYID
RCDFIQLNND GNNFANPRNA AAGSVRQLDH NITAQRKLKY FMYTIVNTAC LTQEELLYQL
KIWGFCVNEH TITTDNIEEA FNFYNQVYNN RSNINYDIDG IIYKVNDIKS QHILGTTSKS
PRWAIAYKFP AAEAKTQLNK ISIQIGRTGV LTPIAELSPI NIGGVIVTRA SLHNKNEIER
KDIREGDYVI VKRAGDVIPQ IVDVDKNLRT QELTKFVFPT TCPSCGSSVY QAKQEASIYC
TGELFCKDQI LEKIKHFVSK DAFNIIGFGK KQLLFFYEQR LITNITDIFT LEEKISNSDV
KLESLHGWGE KSMSNLFSAI NNSKVISLEN FIFALGIRFI GKYIAKILAN HFSSYETWYN
EMLKLAQNED YSLNIQQIGS KTIGSLKMFF SQPHNLNMIN DLVKHLTITD TQSSSYLSLI
HGKIIVFTGE LSSMSRSEAK IRSETAGAKV SSSLSKNTDF LIAGNNPGSK YKKAQSLNVQ
ILTEDLWLQY TQSSEN