Gene ECD_02401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02401 
SymbolxseA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2498712 
End bp2500082 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID 
Productexodeoxyribonuclease VII large subunit 
Protein accessionACT44221 
Protein GI253978551 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000149221 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACCTT CTCAATCCCC TGCAATTTTT ACCGTTAGTC GCCTGAATCA AACGGTTCGT 
CTGCTGCTTG AGCATGAGAT GGGACAGGTT TGGATCAGCG GCGAAATTTC TAATTTCACG
CAACCAGCTT CCGGTCACTG GTACTTTACA CTCAAAGACG ACACCGCCCA GGTACGCTGC
GCGATGTTCC GCAACAGCAA CCGCCGGGTG ACCTTCCGCC CACAGCATGG GCAACAAGTT
TTAGTTCGCG CCAATATTAC GCTCTACGAG CCGCGCGGCG ACTACCAGAT AATCGTTGAG
AGTATGCAGC CGGCCGGTGA AGGGCTGCTG CAACAGAAGT ACGAACAGCT CAAAGCGAAG
TTGCAGGCTG AAGGTTTGTT CGATCAGCAA TACAAAAAAC CACTTCCCTC CCCTGCGCAT
TGCGTTGGTG TGATCACCTC AAAAACCGGT GCTGCGCTAC ATGATATTTT GCATGTGTTA
AAACGTCGCG ATCCTTCTCT GCCGGTGATC ATCTACCCTG CCGCCGTTCA GGGCGATGAC
GCGCCGGGGC AAATTGTTCG CGCCATTGAA CTGGCGAATC AGCGCAATGA GTGCGACGTA
TTGATCGTCG GGCGCGGCGG CGGTTCGCTG GAAGATTTAT GGAGTTTTAA CGACGAACGC
GTAGCGCGGG CGATTTTTAC CAGCCGCATT CCGGTTGTCA GCGCCGTCGG GCATGAGACG
GATGTGACCA TTGCCGATTT TGTTGCCGAT CTGCGTGCGC CAACGCCGTC TGCCGCCGCT
GAAGTAGTGA GCCGTAATCA GCAAGAGTTA CTGCGCCAGG TGCAATCGAC CCGTCAACGG
CTGGAGATGG CGATGGATTA TTATCTCGCC AACCGCACAC GTCGCTTTAC GCAAATTCAT
CACCGATTAC AGCAACAGCA TCCGCAGCTC CGGCTGGCAC GCCAGCAAAC CATGCTTGAG
CGCCTGCAAA AGCGAATGAG CTTTGCGCTG GAAAATCAAC TTAAGCGTAC CGGGCAACAG
CAGCAGCGGT TAACACAGCG GCTGAATCAG CAAAATCCAC AGCCGAAGAT TCATCGCGCG
CAAACGCGCA TTCAGCAACT GGAATATCGT TTAGCAGAAA CCCTGCGCGC ACAGCTTAGC
GCCACGCGTG AACGTTTCGG TAATGCAGTA ACGCACCTCG AAGCCGTAAG CCCACTGTCA
ACGCTGGCGC GTGGATACAG CGTTACTACT GCTACTGACG GCAATGTACT GAAAAAAGTG
AAGCAAGTTA AAGCGGGTGA AATGCTAACC ACACGTCTGG AAGACGGCTG GATAGAAAGT
GAAGTAAAAA ACATCCAGCC AGTAAAAAAA TCGCGTAAAA AGGTGCATTA A
 
Protein sequence
MLPSQSPAIF TVSRLNQTVR LLLEHEMGQV WISGEISNFT QPASGHWYFT LKDDTAQVRC 
AMFRNSNRRV TFRPQHGQQV LVRANITLYE PRGDYQIIVE SMQPAGEGLL QQKYEQLKAK
LQAEGLFDQQ YKKPLPSPAH CVGVITSKTG AALHDILHVL KRRDPSLPVI IYPAAVQGDD
APGQIVRAIE LANQRNECDV LIVGRGGGSL EDLWSFNDER VARAIFTSRI PVVSAVGHET
DVTIADFVAD LRAPTPSAAA EVVSRNQQEL LRQVQSTRQR LEMAMDYYLA NRTRRFTQIH
HRLQQQHPQL RLARQQTMLE RLQKRMSFAL ENQLKRTGQQ QQRLTQRLNQ QNPQPKIHRA
QTRIQQLEYR LAETLRAQLS ATRERFGNAV THLEAVSPLS TLARGYSVTT ATDGNVLKKV
KQVKAGEMLT TRLEDGWIES EVKNIQPVKK SRKKVH