Gene EcolC_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2069 
Symbol 
ID6067550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2274292 
End bp2275605 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content48% 
IMG OID641601477 
Productintegrase catalytic region 
Protein accessionYP_001725036 
Protein GI170020082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAG AGACTGTTAC GATGAGTCAT AAGGAACTCC ACCGACTTCA GATTATTCAG 
GAACAAGCTG CGGCACGCAT TGGCATTTCT ATTCGGCAGG TTAAACGTCT GGTGCAACGG
TATAGAAATG AAGGGCCTTC TGGTCTGGTT TCCCACCGAC GTGGAAAGCG TCCTAATAAT
TCCTTTTCTA CTGAATTCAG AGCAACAGTA ATTTCACTCC TCAAAGGCCG TTACGCTGAT
TTTGGACCTA CGTTTGCGTG CGAAAAATTG CGCGAGATAC ACGGTTTATC TTTATCCGTT
GAAACTCTCA GAAAGTGGAT GATAGAAGAG GGGTTATGGC GTGAACGCCG TCGTAAAATT
GCCCGTATAT ATCAACGCCG CATGCGACGA CCATCTTACG GTGAACTGAT CCAGGTTGAT
GGCTCACCTC ATGACTGGTT TGAAAATCGA GGCCCCAGAT GTACACTGAT CGTTTTCATT
GATGATGCCA CCAGTGCGTT GATGGCGTTG CGTTTTGTGC CTGCTGAAAC AACCCGGGCT
TACATGGAAA CCCTCCGGGG TTACCTTAAT GATCATGGCG TACCGCTCGC TCTCTACTCT
GATAGACACA GTATATTCAG GGTAAATAAC CCAGAGCGGG AAGGTGAGCT GACCCAGTTC
ACTCGTGCGA TAAAGACACT GGGCATCGAG CCAATCCATG CCAACAGCCC GCAGGCAAAA
GGGCGGGTAG AGCGCGCCAA TCAGACACTA CAGGACAGGC TGGTCAAAGA AATGCGGCTT
CAGAATATCA GTGATATTGA AACAGCAAAT GCATGGTTGC CGACCTTTAT TGAAGCCTAT
AACAACCGGT TCGCTACGTC GCCTCGTACT ACTGATAATG CTCATCTTGA TGTGCACCAT
TCTGAAGAGG AACTGGGTTA TATCTTCAGC CTACAGGCGA AGCGCGTTCT GTCTAAAAAT
CTCACTTTCC AGTACAAAAG CAGTGCGTTT CAGGTACGCA GTGAGGGCCG GGGATATCGA
CTTAGGCATT CGGTTGTTAC TGTATGCGAG AACTTTGACG GTGAAATTAA CGTTCTGTAT
GACGGGAAAG CGCTGGGCTG GGAAAAGTAT GTTGATGGCC CGGAGCCTAT ACCACTGGAT
GATGAAAAGA GTGTCCATGA ACGAGTGGAT AATGCCCGTA TTGATTTACG CTCAAAATAC
TATGTTAAAC CTAAAGCTGA CCATCCCTGG CTTACGCGCC GAACGCAAAG TCATCAGCAA
GTTAAGCCCC CGAAGTTACC TAAAAAGAAG CCTGATCCCG ATAAAAAAGA TTGA
 
Protein sequence
MIKETVTMSH KELHRLQIIQ EQAAARIGIS IRQVKRLVQR YRNEGPSGLV SHRRGKRPNN 
SFSTEFRATV ISLLKGRYAD FGPTFACEKL REIHGLSLSV ETLRKWMIEE GLWRERRRKI
ARIYQRRMRR PSYGELIQVD GSPHDWFENR GPRCTLIVFI DDATSALMAL RFVPAETTRA
YMETLRGYLN DHGVPLALYS DRHSIFRVNN PEREGELTQF TRAIKTLGIE PIHANSPQAK
GRVERANQTL QDRLVKEMRL QNISDIETAN AWLPTFIEAY NNRFATSPRT TDNAHLDVHH
SEEELGYIFS LQAKRVLSKN LTFQYKSSAF QVRSEGRGYR LRHSVVTVCE NFDGEINVLY
DGKALGWEKY VDGPEPIPLD DEKSVHERVD NARIDLRSKY YVKPKADHPW LTRRTQSHQQ
VKPPKLPKKK PDPDKKD