Gene EcolC_1437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1437 
Symbol 
ID6067560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1586142 
End bp1587206 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content57% 
IMG OID641600856 
ProductAraC family transcriptional regulator 
Protein accessionYP_001724427 
Protein GI170019473 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0350] Methylated DNA-protein cysteine methyltransferase
[COG2169] Adenosine deaminase 
TIGRFAM ID[TIGR00589] O-6-methylguanine DNA methyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.195699 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG CCACATGCTT AACTGACGAT CAACGCTGGC AATCTGTCTT AGCCCGCGAC 
CCGAATGTCG ACGGCGAATT CGTTTTCGCC GTGCGTACCA CAGGCATCTT TTGCCGTCCG
TCTTGCCGCG CCAGACATGC TTTGCGGGAA AACGTCTCCT TCTACGCAAA TGCCAGCGAG
GCACTCGCCG CTGGTTTTCG CCCCTGCAAA CGTTGTCAGC CAGAAAAAGC CAATGCCCAG
CAACATCGGT TGGATAAAAT CACCCACGCG TGTCGACTGC TGGAACAGGA AACGCCTGTA
ACGCTGGAAG CCTTAGCCGA CCAGGTGGCG ATGAGTCCAT TTCATCTGCA TCGGTTGTTT
AAAGCGACTA CCGGAATGAC GCCTAAAGCC TGGCAACAGG CCTGGCGCGC TCGCCGTTTG
CGCGAATCGC TGGCGAAAGG GGAGAGCGTG ACGACGTCTA TTCTTAACGC CGGATTCCCC
GACAGCAGCA GTTACTATCG CAAAGCTGAC GAAACGCTGG GCATGACGGC TAAACAATTC
CGTCACGGTG GCGAAAATCT GGCGGTGCGT TACGCGCTGG CTGATTGTGA GCTGGGTCGT
TGCCTGGTGG CAGAAAGCGA GCGGGGGATT TGCGCGATAT TGCTGGGCGA TGATGACGCG
ACACTAATCA GCGAGTTGCA GCAGATGTTT CCCGCTGCCG ACAACGCGCC TGCCGATCTG
ATGTTTCAGC AACATGTGCG TGAAGTGATC GCCAGCCTCA ATCAACGCGA TACGCCGCTG
ACGTTACCGC TGGACATTCG CGGCACTGCT TTTCAGCAAC AAGTCTGGCA GGCACTGCGC
ACGATACCTT GCGGTGAAAC CGTCAGTTAT CAGCAACTGG CTAACGCCAT CGGCAAACCG
AAAGCGGTAC GGGCCGTTGC CAGCGCCTGT GCCGCCAACA AGCTGGCTAT CATAATACCC
TGTCATCGGG TGGTCCGTGG TGATGGCACA CTTTCCGGTT ACCGCTGGGG CGTGTCGCGT
AAAGCGCAAC TGCTGCGCCG CGAAGCTGAA AATGAGGAGA GGTAA
 
Protein sequence
MKKATCLTDD QRWQSVLARD PNVDGEFVFA VRTTGIFCRP SCRARHALRE NVSFYANASE 
ALAAGFRPCK RCQPEKANAQ QHRLDKITHA CRLLEQETPV TLEALADQVA MSPFHLHRLF
KATTGMTPKA WQQAWRARRL RESLAKGESV TTSILNAGFP DSSSYYRKAD ETLGMTAKQF
RHGGENLAVR YALADCELGR CLVAESERGI CAILLGDDDA TLISELQQMF PAADNAPADL
MFQQHVREVI ASLNQRDTPL TLPLDIRGTA FQQQVWQALR TIPCGETVSY QQLANAIGKP
KAVRAVASAC AANKLAIIIP CHRVVRGDGT LSGYRWGVSR KAQLLRREAE NEER