Gene EcolC_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2054 
Symbol 
ID6067715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2265423 
End bp2267900 
Gene Length2478 bp 
Protein Length825 aa 
Translation table11 
GC content49% 
IMG OID641601466 
Productexonuclease RNase T and DNA polymerase III 
Protein accessionYP_001725025 
Protein GI170020071 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0160649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAG TCTTTATTTG CGCCGCTATT CCTGACGAAC TGGCAACAAG GGAAGAAGGC 
GCTGTGGCTG TAGCCACAGC CATTGAAGCT GGCGACGAAC GCCGTGCTCG AGCAAAATTT
CACTGGCAAT TCCTGGAACA TTATCCGGCT GCTCAGGACT GCGCTTATCA ATTTATTGTC
TGCGAGGATA AACCTGGCAT ACCCCGCCCT GCCCTCGATT CATGGGATGC TGAATATATG
CAGGAAAACC GCTGGGATGA GGAGTCTGCT TCTTTTGTCC CGGTTGAGAC TGAATCCGAT
CCGATGAACG TCACTTTTGA CAAGCTGGCC CCTGAAGTAC AGAACGCTGT CATGGTTAAG
TTCGACACAT GTGAAAACAT CACCGTTGAT ATGGTTATTA GCGCACAGGA ATTGTTGCAG
GAAGACATGG CAACATTCGA CGGACATATC GTTGAAGCGT TGATGAAAAT GCCAGAAGTT
AACGCCATGT ATCCGGAGCT TAAGTTGCAC GCCATTGGGT GGGTTAAGCA TAAATGTATT
CCTGGTGCTA AATGGCCCGA AATTCAGGCA GAGATGCGCA TCTGGAAAAA ACGTCGCGAA
GGTGAACGCA AGGAAACCGG AAAATACACG TCTGTTGTTG ATCTCGCCCG CGCCAGAGCC
AGAGCCAATC AACAGTACAC TGAAAATTCA ACAGGAAAAA TCAGCCCGGT CATTGCTGCC
ATTCATCGCG AATACAAGCA GACATGGAAA ACACTGGATG ACGAACTGGC CTACGCTCTC
TGGCCTAGTG ATGTGGATGC CGGAAACATT GACGGCAGCA TCCATCGCTG GGCAAAAAAA
GAAGTTATCG ACAACGACCG CGAAGACTGG AAGCGTATCT CGGCATCAAT GCGCAAACAG
CCTGATGCCC TTCGCTACGA CCGCCAAACT ATTTTTGGCC TTGTCCGTGA GCGTCCGATC
GACATTCACA AAGATCCCGT AGCACTGAAC AAATATATCT GCGAATACCT GACGACAAAG
GGCGTGTTTG AGAATGAAGA AACAGACCTG GGCACTGTTG ATGTTCTCCA GTCATCAGAA
ACACAAACTG ATGCAGTGGA AACTGAGGTA TCTGATATCC CAAAAAATGA AACCGCGCCG
GAAGCTGAAC CATCTGTAGA GCGTGAGGGG CCGTTCTATT TCCTCTTCGC AGATAAGGAC
GGAGAAAAAT ACGGTCGCGC AAACAAACTC TCTGGTCTGG ATAAGGCACT GGCTGCTGGC
GCCACTGAAA TCACAAAAGA AGAATATTTT GCCCGAAAAA ATGGCACATA CACGGGCTTA
CCGCAAAATG TAGATACCGC TGAAGATTCA GAACAACCAG AGCCGATAAA AGTTACCGCT
GACGAAGTAA ACAAAATTAT GCAGGCAGCC AATATCAGCC AGCCTGACGC CGATAAGTTG
CTTGCTGCAT CACGTGGTGA ATTTGTTGAA GGGATTAGTG ACCCGAATGA TCCGAAATGG
GTTAAGGGGC TCCAGACCCG CGATTCTGTG AACCAGAACC AGCATGAATC GGAACGGAAC
TACCAAAAAG CGGAACAAAA CAGTCCAAAT GCGTTACAAA ACGAGCCAGA AACGAAACAG
CCTGAACCAG TAGCGCAACA GGAAGTGGAA AAAGTCTGCA CCGCCTGCGG TCAGACCGGC
GGCGGCAACT GCCCTGATTG TGGCGCGGTG ATGGGCGACG CAACATACCA AGAAACATTC
GATGAAGAGT ATCAGGTTGA AGTTCAGGAA GATGATCCGG AGGAAATGGA AGGCGCTGAA
CATCCACACA AGGAGAACAC TGGCGGCAAT CAGCATCACA ATAGCGATAA TGAAACTGGC
GAGACGGCAG ATCACCCAAT TAAGGTGAAC GGTCATCACG AAATCACATC CACCAGCAGG
ACGTGTGACC ATCTAATGAT CGACCTTGAA ACCATGGGAA AAAATCCTGA TGCCCCGATC
ATCTCAATAG GTGCAATATT TTTCGATCCG CAAACCGGAG ATATGGGACC GGAATTTAGT
AAGACTATCG ATCTGGAAAC TGCTGGCGGA GTCATTGATC GGGACACCAT TAAATGGTGG
CTTAAGCAAT CACGCGAAGC GCAATCTGCC ATTATGACCG ATGAAATCCC GTTAGATGAT
GCACTGTTAC AATTGCGAGA ATTTATCGAC GAAAACTCCG GTGAATTTTT TGTTCAGGTT
TGGGGAAATG GAGCCAACTT CGACAACACG ATTTTGCGCC GTTCATACGA ACGGCAGGGG
ATCCCCTGCC CGTGGCGTTA CTACAACGAT CGCGATGTAC GCACAATCGT TGAGCTGGGG
AAAGCCATAG ACTTCGATGC CAGAACGGCT ATTCCATTCG AAGGTGAGCG CCATAATGCA
CTTGATGACG CCCGTTACCA GGCAAAATAC GTTTCAGTTA TCTGGCAAAA ACTGATCCCG
AGTCAGGCTG ATTCTTAA
 
Protein sequence
MSKVFICAAI PDELATREEG AVAVATAIEA GDERRARAKF HWQFLEHYPA AQDCAYQFIV 
CEDKPGIPRP ALDSWDAEYM QENRWDEESA SFVPVETESD PMNVTFDKLA PEVQNAVMVK
FDTCENITVD MVISAQELLQ EDMATFDGHI VEALMKMPEV NAMYPELKLH AIGWVKHKCI
PGAKWPEIQA EMRIWKKRRE GERKETGKYT SVVDLARARA RANQQYTENS TGKISPVIAA
IHREYKQTWK TLDDELAYAL WPSDVDAGNI DGSIHRWAKK EVIDNDREDW KRISASMRKQ
PDALRYDRQT IFGLVRERPI DIHKDPVALN KYICEYLTTK GVFENEETDL GTVDVLQSSE
TQTDAVETEV SDIPKNETAP EAEPSVEREG PFYFLFADKD GEKYGRANKL SGLDKALAAG
ATEITKEEYF ARKNGTYTGL PQNVDTAEDS EQPEPIKVTA DEVNKIMQAA NISQPDADKL
LAASRGEFVE GISDPNDPKW VKGLQTRDSV NQNQHESERN YQKAEQNSPN ALQNEPETKQ
PEPVAQQEVE KVCTACGQTG GGNCPDCGAV MGDATYQETF DEEYQVEVQE DDPEEMEGAE
HPHKENTGGN QHHNSDNETG ETADHPIKVN GHHEITSTSR TCDHLMIDLE TMGKNPDAPI
ISIGAIFFDP QTGDMGPEFS KTIDLETAGG VIDRDTIKWW LKQSREAQSA IMTDEIPLDD
ALLQLREFID ENSGEFFVQV WGNGANFDNT ILRRSYERQG IPCPWRYYND RDVRTIVELG
KAIDFDARTA IPFEGERHNA LDDARYQAKY VSVIWQKLIP SQADS