Gene EcolC_3667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3667 
Symbol 
ID6066137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4016602 
End bp4017984 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content58% 
IMG OID641603082 
ProductDNA repair protein RadA 
Protein accessionYP_001726605 
Protein GI170021651 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1066] Predicted ATP-dependent serine protease 
TIGRFAM ID[TIGR00416] DNA repair protein RadA 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.696085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAAAAG CTCCAAAACG CGCCTTTGTT TGTAATGAAT GCGGGGCCGA TTATCCGCGC 
TGGCAGGGGC AGTGCAGTGC CTGTCATGCC TGGAACACCA TCACCGAGGT GCGTCTTGCT
GCGTCGCCAA CGGTGGCGCG TAACGAGCGT CTCAGCGGCT ATGCCGGTAG CGCCGGGGTG
GCAAAAGTCC AGAAACTCTC CGATATCAGC CTTGAAGAGC TGCCGCGTTT TTCCACCGGA
TTTAAAGAGT TCGACCGCGT ACTAGGCGGC GGCGTGGTGC CAGGAAGTGC CATTCTGATT
GGCGGTAACC CTGGTGCGGG GAAATCCACG CTGTTGCTGC AAACGCTGTG CAAACTGGCC
CAGCAGATGA AAACGCTGTA TGTCACCGGC GAAGAGTCGC TGCAACAGGT GGCAATGCGC
GCTCATCGCC TTGGCCTGCC GACTGACAAT CTCAATATGT TGTCGGAAAC CAGCATCGAG
CAGATCTGCC TGATTGCCGA AGAAGAGCAA CCGAAGCTGA TGGTAATTGA CTCGATCCAG
GTGATGCATA TGGCGGATGT ACAGTCATCG CCTGGCAGCG TGGCGCAGGT GCGTGAAACG
GCGGCTTATC TGACGCGCTT CGCCAAAACG CGCGGTGTGG CGATTGTCAT GGTTGGGCAC
GTAACCAAAG ATGGCTCGCT GGCTGGCCCT AAAGTGCTGG AACACTGTAT CGACTGTTCG
GTGCTTCTGG ATGGTGATGC CGACTCCCGT TTTCGCACCT TGCGCAGCCA TAAAAACCGC
TTCGGCGCGG TGAATGAGCT GGGCGTCTTC GCGATGACCG AACAGGGGCT GCGTGAAGTC
AGCAACCCTT CGGCAATTTT CTTAAGTCGC GGAGATGAAG TGACCTCCGG CAGCTCCGTG
ATGGTGGTGT GGGAAGGAAC GCGTCCGTTG CTGGTGGAGA TTCAGGCGCT GGTCGATCAC
TCGATGATGG CGAATCCGCG CCGCGTGGCA GTGGGGCTGG AACAAAACCG TCTGGCAATC
CTGCTGGCTG TCTTGCACCG TCACGGTGGT CTGCAAATGG CCGATCAGGA TGTATTTGTG
AACGTGGTCG GCGGCGTGAA GGTAACCGAA ACCAGTGCCG ATTTAGCTTT ACTGCTGGCG
ATGGTTTCCA GCCTGCGTGA CAGACCGCTG CCACAGGATC TGGTGGTGTT TGGTGAAGTC
GGGCTGGCAG GGGAGATCCG CCCGGTGCCC AGCGGTCAGG AACGAATTTC AGAAGCGGCG
AAACACGGTT TTCGCCGGGC GATTGTTCCG GCGGCTAACG TACCGAAAAA AGCGCCGGAA
GGGATGCAGA TTTTTGGCGT TAAAAAACTC TCCGACGCGC TTAGCGTGTT CGACGACTTA
TAA
 
Protein sequence
MAKAPKRAFV CNECGADYPR WQGQCSACHA WNTITEVRLA ASPTVARNER LSGYAGSAGV 
AKVQKLSDIS LEELPRFSTG FKEFDRVLGG GVVPGSAILI GGNPGAGKST LLLQTLCKLA
QQMKTLYVTG EESLQQVAMR AHRLGLPTDN LNMLSETSIE QICLIAEEEQ PKLMVIDSIQ
VMHMADVQSS PGSVAQVRET AAYLTRFAKT RGVAIVMVGH VTKDGSLAGP KVLEHCIDCS
VLLDGDADSR FRTLRSHKNR FGAVNELGVF AMTEQGLREV SNPSAIFLSR GDEVTSGSSV
MVVWEGTRPL LVEIQALVDH SMMANPRRVA VGLEQNRLAI LLAVLHRHGG LQMADQDVFV
NVVGGVKVTE TSADLALLLA MVSSLRDRPL PQDLVVFGEV GLAGEIRPVP SGQERISEAA
KHGFRRAIVP AANVPKKAPE GMQIFGVKKL SDALSVFDDL