Gene ECD_01752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01752 
SymbolyeaG 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1811411 
End bp1813219 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content50% 
IMG OID 
Productconserved protein with nucleoside triphosphate hydrolase domain 
Protein accessionACT43606 
Protein GI253977936 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATGGCTA TCGGTGAGCC TGTCATGGTC GATACAGCCC AGGAACCCAG ACTTTCTCGA 
CTCTTTTCTA ACCGGGTCAT TGCACGTTAT CCGGCGTTTG AAGAGTTTTA CGGCATGGAA
GACGCGATTG AACAGATTGT CTCTTATCTG AAACACGCGG CTCAGGGGCT GGAAGAGAAG
AAACAAATCC TGTATCTGCT GGGGCCTGTG GGTGGGGGTA AATCATCGCT TGCTGAGCGA
CTGAAATCAT TAATGCAGCT CGTACCGATT TATGTATTGA GCGCGAACGG TGAGCGTAGC
CCGGTCAACG ATCATCCGTT CTGTCTTTTC AATCCGCAGG AAGATGCGCA GATTCTGGAA
AAAGAGTATG GCATTCCTCG CCGTTATCTC GGCACCATCA TGTCGCCGTG GGCGGCAAAA
CGCCTGCATG AATTTGGTGG CGATATCACT AAGTTCCGGG TAGTGAAGGT CTGGCCGTCA
ATTCTGCAAC AAATTGCTAT CGCCAAAACG GAACCCGGCG ATGAGAACAA CCAGGACATC
TCCGCGCTGG TTGGGAAAGT CGATATTCGT AAACTCGAAC ACTACGCGCA GAATGACCCG
GACGCCTACG GCTATTCCGG TGCGCTGTGC CGCGCCAACC AGGGGATCAT GGAATTCGTT
GAGATGTTTA AAGCACCGAT TAAAGTGCTG CATCCCTTGT TAACCGCCAC TCAGGAAGGT
AACTACAACG GGACGGAAGG TATCTCCGCC CTGCCGTTCA ACGGGATTAT TCTTGCCCAC
TCGAACGAGT CCGAATGGGT CACTTTCCGT AATAACAAAA ACAACGAAGC CTTCCTCGAT
CGTGTTTACA TCGTGAAGGT GCCGTATTGC TTGCGCATTT CCGAAGAGAT CAAAATCTAC
GAGAAATTGC TTAATCACAG TGAATTGACT CACGCCCCAT GCGCCCCTGG CACGCTCGAA
ACACTGTCAC GTTTTTCCAT TCTTTCGCGC CTGAAAGAGC CAGAAAACTC CAGCATTTAT
TCAAAGATGC GGGTTTATGA TGGCGAAAGT CTGAAAGACA CCGATCCCAA AGCCAAGTCG
TATCAGGAAT ATCGTGACTA CGCCGGTGTC GATGAAGGGA TGAACGGTCT GTCGACGCGT
TTTGCGTTTA AGATCCTCTC CCGCGTGTTC AACTTCGATC ATGTAGAAGT GGCAGCAAAC
CCGGTCCATC TGTTCTACGT CCTGGAACAG CAGATTGAGC GCGAGCAGTT CCCACAAGAG
CAGGCAGAAC GCTATCTGGA GTTCCTGAAA GGTTATCTGA TCCCGAAATA TGCCGAGTTT
ATCGGCAAAG AGATCCAGAC GGCCTACCTT GAATCCTATT CCGAATATGG GCAAAACATT
TTCGACCGTT ATGTTACCTA CGCGGATTTC TGGATTCAGG ATCAGGAGTA TCGCGATCCG
GATACCGGGC AGCTGTTTGA CCGCGAGTCT CTTAACGCCG AGCTGGAGAA AATCGAGAAA
CCGGCGGGGA TCAGTAATCC AAAAGATTTC CGCAACGAGA TTGTTAACTT CGTACTGCGC
GCCAGAGCGA ATAACAGCGG ACGCAATCCG AACTGGACCA GCTATGAAAA ACTGCGCACG
GTCATCGAGA AGAAAATGTT CTCCAATACC GAGGAGCTGT TGCCGGTTAT CTCGTTTAAC
GCCAAAACGT CAACCGACGA GCAGAAGAAA CACGACGACT TTGTCGACCG TATGATGGAA
AAAGGCTACA CCCGTAAACA GGTGCGTTTA CTGTGCGAAT GGTATTTGCG CGTACGTAAA
TCGTCTTAA
 
Protein sequence
MMAIGEPVMV DTAQEPRLSR LFSNRVIARY PAFEEFYGME DAIEQIVSYL KHAAQGLEEK 
KQILYLLGPV GGGKSSLAER LKSLMQLVPI YVLSANGERS PVNDHPFCLF NPQEDAQILE
KEYGIPRRYL GTIMSPWAAK RLHEFGGDIT KFRVVKVWPS ILQQIAIAKT EPGDENNQDI
SALVGKVDIR KLEHYAQNDP DAYGYSGALC RANQGIMEFV EMFKAPIKVL HPLLTATQEG
NYNGTEGISA LPFNGIILAH SNESEWVTFR NNKNNEAFLD RVYIVKVPYC LRISEEIKIY
EKLLNHSELT HAPCAPGTLE TLSRFSILSR LKEPENSSIY SKMRVYDGES LKDTDPKAKS
YQEYRDYAGV DEGMNGLSTR FAFKILSRVF NFDHVEVAAN PVHLFYVLEQ QIEREQFPQE
QAERYLEFLK GYLIPKYAEF IGKEIQTAYL ESYSEYGQNI FDRYVTYADF WIQDQEYRDP
DTGQLFDRES LNAELEKIEK PAGISNPKDF RNEIVNFVLR ARANNSGRNP NWTSYEKLRT
VIEKKMFSNT EELLPVISFN AKTSTDEQKK HDDFVDRMME KGYTRKQVRL LCEWYLRVRK
SS