Gene EcolC_1168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1168 
SymbolxseA 
ID6066491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1277147 
End bp1278517 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID641600584 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_001724162 
Protein GI170019208 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000633307 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0126087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACCTT CTCAATCCCC TGCAATTTTT ACCGTTAGTC GCCTGAATCA AACGGTTCGT 
CTGCTGCTTG AGCATGAGAT GGGACAGGTT TGGATCAGCG GCGAAATCTC TAATTTCACA
CAACCGGCTT CCGGTCACTG GTACTTTACA CTCAAAGACG ACACCGCCCA GGTACGCTGC
GCGATGTTCC GCAACAGCAA CCGCCGGGTG ACCTTCCGCC CACAGCATGG GCAACAAGTT
TTAGTTCGCG CCAATATTAC GCTCTACGAG CCGCGCGGCG ACTACCAGAT AATCGTTGAG
AGTATGCAGC CGGCCGGTGA AGGGCTGCTG CAACAGAAGT ACGAACAGCT CAAAGCGAAG
TTGCAGGCTG AAGGTTTGTT CGATCAGCAA TACAAAAAAC CACTTCCCTC CCCTGCGCAT
TGCGTTGGTG TGATCACCTC AAAAACCGGT GCTGCGCTAC ATGATATTTT GCATGTGTTA
AAACGTCGCG ATCCTTCTCT ACCGGTGATC ATCTACCCCA CCGCCGTTCA GGGCGATGAC
GCGCCGGGGC AAATTGTTCG CGCCATTGAA CTGGCGAATC AGCGCAATGA GTGCGACGTG
TTGATCGTTG GGCGCGGCGG CGGTTCGCTG GAAGATTTAT GGAGTTTTAA CGACGAACGC
GTAGCGCGGG CTATTTTTGC CAGCCGCATT CCGGTCGTCA GCGCCGTCGG GCATGAGACG
GATGTGACCA TTGCCGATTT TGTTGCCGAT CTGCGTGCGC CAACGCCGTC TGCCGCCGCT
GAAGTAGTAA GCCGTAATCA GCAAGAGTTA CTGCGCCAGG TGCAATCGGC CCGTCAACGG
CTGGAGATGG CGATGGATTA TTATCTCGCC AACCGTACGC GTCGCTTTAC GCAGATTCAT
CATCGCTTGC AGCAGCAGCA TCCGCAGCTC CGGCTGGCAC GCCAGCAAAC CATGCTTGAA
CGCCTGAAAA AACGGATGAG CTTTGCGCTG GAAAATCAGC TTAAGCGTGC CGGGCAACAG
CAGCAGCGAT TAACACAGCG GCTGAATCAG CAAAATCCAC AGCCGAAGAT TCATCGCGCG
CAAACGCGCA TTCAGCAACT GGAATATCGT TTAGCAGAAA TCCTGCGCGC ACAGCTTAGC
GCCACGCGTG AACGTTTCGG TAATGCAGTA ACGCACCTCG AAGCCGTAAG CCCACTGTCA
ACGCTGGCGC GTGGATACAG CGTTACTACT GCTACTGACG GCAATGTACT GAAAAAAGTG
AAGCAAGTTA AAGCGGGTGA AATGCTAACC ACACGTCTGG AAGACGGCTG GATAGAAAGT
GAAGTAAAAA ACATCCAGCC GGTAAAAAAA TCGCGTAAAA AGGTTCATTA A
 
Protein sequence
MLPSQSPAIF TVSRLNQTVR LLLEHEMGQV WISGEISNFT QPASGHWYFT LKDDTAQVRC 
AMFRNSNRRV TFRPQHGQQV LVRANITLYE PRGDYQIIVE SMQPAGEGLL QQKYEQLKAK
LQAEGLFDQQ YKKPLPSPAH CVGVITSKTG AALHDILHVL KRRDPSLPVI IYPTAVQGDD
APGQIVRAIE LANQRNECDV LIVGRGGGSL EDLWSFNDER VARAIFASRI PVVSAVGHET
DVTIADFVAD LRAPTPSAAA EVVSRNQQEL LRQVQSARQR LEMAMDYYLA NRTRRFTQIH
HRLQQQHPQL RLARQQTMLE RLKKRMSFAL ENQLKRAGQQ QQRLTQRLNQ QNPQPKIHRA
QTRIQQLEYR LAEILRAQLS ATRERFGNAV THLEAVSPLS TLARGYSVTT ATDGNVLKKV
KQVKAGEMLT TRLEDGWIES EVKNIQPVKK SRKKVH