Gene EcolC_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1031 
Symbol 
ID6066721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1120678 
End bp1122783 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content52% 
IMG OID641600444 
Productribonucleotide-diphosphate reductase subunit alpha 
Protein accessionYP_001724027 
Protein GI170019073 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0209] Ribonucleotide reductase, alpha subunit 
TIGRFAM ID[TIGR02506] ribonucleoside-diphosphate reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.578338 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACC ACGCGCTGAA TGCGATGCTT AACCTCTACG ATAGCGCAGG TCGCATTCAG 
TTCGATAAAG ACCGCCAGGC CGTTGACGCC TTTATTGCGA CGCATGTGCG TCCGAACAGT
GTGACCTTCA GTAGCCAGCA GCAGCGCCTG AACTGGCTGG TCAACGAAGG TTACTATGAT
GAAAGCGTTC TTAATCGCTA CTCTCGCGAC TTTGTCATTA CGCTGTTTGC CCACGCACAC
ACCAGCGGTT TTCGTTTCCA GACATTCCTC GGGGCATGGA AGTTTTACAC CAGCTATACG
TTGAAGACAT TCGACGGTAA ACGTTATCTG GAAGATTTTG CCGATCGAGT CACGATGGTG
GCGCTGACGC TGGCACAAGG CGATGAGACG CTGGCGTTGC AACTGACAGA TGAAATGCTG
TCAGGACGCT TTCAGCCAGC CACGCCAACA TTCCTCAACT GCGGTAAGCA GCAGCGCGGC
GAACTGGTTT CCTGTTTTTT GCTGCGTATT GAAGACAATA TGGAGTCGAT TGGTCGGGCG
GTAAATTCCG CACTGCAACT GTCGAAACGC GGCGGCGGCG TAGCATTTTT GCTGTCGAAT
CTGCGAGAAG CGGGCGCGCC AATTAAACGT ATTGAAAATC AATCTTCTGG CGTAATTCCG
GTGATGAAAA TGCTGGAAGA CGCATTTTCC TATGCCAACC AACTCGGCGC TCGTCAGGGG
GCTGGTGCAG TCTATTTACA TGCTCATCAT CCCGATATTC TGCGTTTTCT CGACACGAAA
CGGGAAAATG CCGACGAAAA AATCCGCATT AAAACACTGT CGCTTGGCGT GGTGATCCCG
GATATCACTT TCCATCTGGC AAAAGAGAAT GCGCAGATGG CGCTGTTTTC GCCTTATGAC
GTAGAGCGAG TTTATGGCAA GCCGTTTGCC GATGTGGCCA TCAGCCAACA CTATGACGAA
CTGGTTGCCG ATGAACGCAT TCGCAAAAAA TACCTCAACG CCCGTGATTT CTTCCAGCGA
CTGGCAGAAA TCCAGTTTGA GTCCGGCTAT CCCTACATCA TGTATGAAGA CACGGTAAAC
CGTGCTAACC CTATCGCCGG GCGCATAAAT ATGAGTAATC TCTGCTCAGA AATTTTGCAG
GTTAACAGCG CCTCAGAGTA TGACGAGAAT CTCGACTATA CCCGCACAGG CCATGATATT
TCCTGCAATT TAGGTTCGTT GAATATTGCG CACACCATGG ATTCCCCCGA TTTTGCCCGC
ACGGTAGAGA CTGCCGTGCG CGGTTTAACG GCAGTATCAG ATATGAGTCA TATCCGCAGC
GTGCCGTCCA TCGAAGCCGG AAATGCCGCC TCGCACGCCA TCGGACTGGG GCAGATGAAT
TTACACGGCT ATCTGGCGCG AGAAGGCATC GCTTATGGTT CGCCGGAAGC ACTGGATTTC
ACCAATCTCT ATTTCTATGC CATCACCTGG CATGCACTGC GTACCTCGAT GTTGCTGGCA
CGCGAACGCG GTGAAACCTT CGCCGGGTTC AAACAGTCAC GCTATGCCAG TGGTGAATAT
TTTAGCCAAT ATCTGCAAGG GAACTGGCAG CCGAAAACGG CGAAAGTTGG CGAACTGTTT
ACCCGTAGCG GTATTACGTT ACCTACCCGT GAGATGTGGG CGCAGCTGCG CGACGACGTG
ATGCGCTACG GCATATACAA CCAGAATCTT CAGGCGGTGC CGCCAACCGG TTCTATCTCT
TATATCAACC ATGCTACGTC GAGTATTCAT CCGATTGTGG CGAAAGTAGA GATACGCAAA
GAGGGCAAAA CAGGACGCGT TTACTACCCT GCCCCGTTTA TGACTAACGA GAATCTGGCG
CTGTATCAGG ACGCTTACGA AATTGGCGCA GAAAAGATCA TCGACACCTA CGCGGAAGCG
ACTCGCCATG TCGATCAGGG GCTGTCGCTG ACGCTTTTTT TCCCCGATAC CGCCACCACT
CGCGATATCA ACAAAGCGCA GATTTACGCC TGGCGCAAGG GTATCAAAAC GCTCTATTAC
ATCCGCCTGC GTCAGATGGC GCTGGAAGGC ACTGAAATTG AAGGCTGCGT CTCCTGTGCA
CTTTAA
 
Protein sequence
MDYHALNAML NLYDSAGRIQ FDKDRQAVDA FIATHVRPNS VTFSSQQQRL NWLVNEGYYD 
ESVLNRYSRD FVITLFAHAH TSGFRFQTFL GAWKFYTSYT LKTFDGKRYL EDFADRVTMV
ALTLAQGDET LALQLTDEML SGRFQPATPT FLNCGKQQRG ELVSCFLLRI EDNMESIGRA
VNSALQLSKR GGGVAFLLSN LREAGAPIKR IENQSSGVIP VMKMLEDAFS YANQLGARQG
AGAVYLHAHH PDILRFLDTK RENADEKIRI KTLSLGVVIP DITFHLAKEN AQMALFSPYD
VERVYGKPFA DVAISQHYDE LVADERIRKK YLNARDFFQR LAEIQFESGY PYIMYEDTVN
RANPIAGRIN MSNLCSEILQ VNSASEYDEN LDYTRTGHDI SCNLGSLNIA HTMDSPDFAR
TVETAVRGLT AVSDMSHIRS VPSIEAGNAA SHAIGLGQMN LHGYLAREGI AYGSPEALDF
TNLYFYAITW HALRTSMLLA RERGETFAGF KQSRYASGEY FSQYLQGNWQ PKTAKVGELF
TRSGITLPTR EMWAQLRDDV MRYGIYNQNL QAVPPTGSIS YINHATSSIH PIVAKVEIRK
EGKTGRVYYP APFMTNENLA LYQDAYEIGA EKIIDTYAEA TRHVDQGLSL TLFFPDTATT
RDINKAQIYA WRKGIKTLYY IRLRQMALEG TEIEGCVSCA L