Gene ECD_03689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03689 
SymboluvrD 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3888260 
End bp3890422 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content58% 
IMG OID 
ProductDNA-dependent ATPase I and helicase II 
Protein accessionACT45482 
Protein GI253979812 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTTT CTTACCTGCT CGACAGCCTT AATGACAAAC AGCGCGAAGC GGTGGCCGCG 
CCACGCAGCA ACCTTCTGGT GCTGGCGGGC GCGGGCAGTG GTAAGACGCG CGTACTGGTG
CATCGTATCG CCTGGCTGAT GAGCGTGGAA AACTGCTCTC CATACTCGAT TATGGCGGTG
ACGTTTACCA ACAAAGCGGC GGCGGAAATG CGTCATCGTA TCGGGCAACT GATGGGCACC
AGCCAGGGCG GCATGTGGGT AGGCACCTTC CACGGGCTGG CGCACCGCCT GCTGCGTGCG
CACCATATGG ACGCCAATCT GCCGCAGGAT TTCCAGATCC TCGACAGTGA AGACCAGCTG
CGCCTGCTTA AGCGTTTGAT CAAGGCGATG AACCTCGACG AGAAGCAGTG GCCGCCGCGC
CAGGCAATGT GGTACATCAA CAGCCAGAAA GATGAAGGCC TGCGTCCGCA TCATATTCAA
AGCTACGGTA ATCCGGTGGA GCAGACCTGG CAGAAGGTGT ATCAGGCGTA TCAGGAAGCG
TGCGATCGTG CGGGTCTGGT GGACTTCGCC GAGTTGCTGC TGCGCGCTCA CGAGTTGTGG
CTTAACAAGC CGCATATCCT GCAACACTAC CGCGAACGTT TTACCAATAT CCTGGTGGAC
GAATTCCAGG ATACCAACAA CATTCAGTAC GCGTGGATCC GCCTGCTGGC GGGCGACACC
GGCAAAGTGA TGATCGTCGG TGATGACGAC CAGTCAATCT ACGGCTGGCG CGGGGCGCAG
GTGGAGAATA TTCAGCGTTT CCTTAATGAT TTCCCCGGTG CCGAAACTAT TCGTCTGGAG
CAAAACTACC GCTCTACCAG CAATATTCTG AGCGCCGCTA ACGCTCTGAT TGAAAACAAT
AACGGGCGTC TGGGTAAAAA ACTGTGGACC GATGGCGCGG ACGGTGAGCC TATTTCCCTC
TATTGTGCTT TTAACGAACT CGATGAAGCG CGTTTTGTGG TTAACCGCAT CAAAACCTGG
CAGGACAACG GCGGGGCGCT TGCCGAGTGC GCCATTCTCT ACCGCAGCAA CGCCCAGTCG
CGTGTACTGG AAGAGGCCCT ATTACAGGCG AGTATGCCGT ACCGTATTTA CGGCGGGATG
CGCTTCTTCG AACGCCAGGA AATCAAAGAT GCGCTCTCGT ATCTGCGCCT GATTGCCAAC
CGCAACGACG ACGCGGCCTT TGAGCGCGTA GTGAATACAC CAACGCGGGG TATTGGTGAC
CGGACGCTGG ACGTGGTACG TCAGACATCG CGCGATCGCC AGTTAACACT CTGGCAGGCA
TGTCGTGAAC TGTTGCAGGA AAAAGCCCTC GCCGGACGTG CTGCCAGCGC CTTACAGCGG
TTTATGGAAC TGATCGACGC CTTAGCGCAG GAAACTGCCG ATATGCCGCT GCATGTACAG
ACTGACCGGG TAATTAAAGA CTCCGGCCTG CGCACCATGT ACGAGCAGGA GAAGGGCGAA
AAAGGTCAGA CGCGTATCGA AAACTTAGAG GAACTGGTGA CGGCAACGCG CCAGTTCAGC
TACAACGAAG AAGACGAAGA TTTAATGCCG CTGCAGGCAT TCCTCTCCCA TGCGGCGCTG
GAAGCGGGCG AGGGGCAGGC GGATACCTGG CAGGACGCGG TGCAGTTGAT GACGCTACAC
TCGGCGAAAG GGCTGGAGTT CCCGCAGGTG TTTATCGTCG GTATGGAAGA GGGCATGTTC
CCAAGCCAGA TGTCGCTGGA TGAAGGCGGA CGTCTGGAAG AAGAACGCCG TCTGGCCTAC
GTTGGCGTAA CCCGTGCGAT GCAGAAACTG ACGCTGACCT ACGCGGAAAC TCGCCGTCTG
TATGGCAAAG AGGTTTACCA TCGCCCGTCG CGCTTTATCG GTGAGTTGCC GGAAGAGTGT
GTGGAAGAGG TGCGCCTGCG CGCCACGGTA AGCCGCCCGG TCAGCCATCA GCGTATGGGT
ACGCCGATGG TCGAGAACGA CAGCGGCTAC AAGCTCGGCC AGCGTGTACG CCACGCTAAG
TTTGGTGAAG GCACCATCGT CAATATGGAA GGCAGCGGTG AACATAGCCG TTTGCAGGTG
GCATTCCAGG GCCAGGGAAT CAAATGGCTG GTGGCGGCTT ACGCCCGGCT GGAGACGGTG
TAA
 
Protein sequence
MDVSYLLDSL NDKQREAVAA PRSNLLVLAG AGSGKTRVLV HRIAWLMSVE NCSPYSIMAV 
TFTNKAAAEM RHRIGQLMGT SQGGMWVGTF HGLAHRLLRA HHMDANLPQD FQILDSEDQL
RLLKRLIKAM NLDEKQWPPR QAMWYINSQK DEGLRPHHIQ SYGNPVEQTW QKVYQAYQEA
CDRAGLVDFA ELLLRAHELW LNKPHILQHY RERFTNILVD EFQDTNNIQY AWIRLLAGDT
GKVMIVGDDD QSIYGWRGAQ VENIQRFLND FPGAETIRLE QNYRSTSNIL SAANALIENN
NGRLGKKLWT DGADGEPISL YCAFNELDEA RFVVNRIKTW QDNGGALAEC AILYRSNAQS
RVLEEALLQA SMPYRIYGGM RFFERQEIKD ALSYLRLIAN RNDDAAFERV VNTPTRGIGD
RTLDVVRQTS RDRQLTLWQA CRELLQEKAL AGRAASALQR FMELIDALAQ ETADMPLHVQ
TDRVIKDSGL RTMYEQEKGE KGQTRIENLE ELVTATRQFS YNEEDEDLMP LQAFLSHAAL
EAGEGQADTW QDAVQLMTLH SAKGLEFPQV FIVGMEEGMF PSQMSLDEGG RLEEERRLAY
VGVTRAMQKL TLTYAETRRL YGKEVYHRPS RFIGELPEEC VEEVRLRATV SRPVSHQRMG
TPMVENDSGY KLGQRVRHAK FGEGTIVNME GSGEHSRLQV AFQGQGIKWL VAAYARLETV