Gene EcolC_2891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2891 
Symbol 
ID6065339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3149803 
End bp3152064 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content51% 
IMG OID641602296 
Producthypothetical protein 
Protein accessionYP_001725845 
Protein GI170020891 
COG category[C] Energy production and conversion 
COG ID[COG1048] Aconitase A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0493763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGT TATCTGAAAA AGGCGTGTTT CTCGCCAGTA ATAACGAAAT AATTGCCGAA 
GAACATTTCA CCCGCGAAAT TAAAAAAGAA GAAGCCAAAA AAGGCACTAT TGCCTGGTCT
ATTCTCTCTT CTCATAATAC GTCCGGAAAT ATGGATAAAC TTAAAATTAA GTTTGATTCA
TTAGCCTCTC ACGATATTAC CTTTGTTGGT ATTGTACAGA CCGCTAAAGC GTCCGGAATG
GAACGTTTCC CGCTGCCGTA TGTGCTGACC AACTGCCATA ACTCACTCTG CGCCGTCGGC
GGCACCATTA ACGGTGATGA CCATGTTTTT GGTTTATCGG CAGCTCAGCG TTATGGCGGT
ATTTTTGTGC CTCCGCATAT TGCGGTCATC CATCAATATA TGCGTGAGAT GATGGCAGGC
GGCGGCAAAA TGATCCTCGG GTCAGACAGC CACACCCGTT ACGGTGCATT AGGGACAATG
GCAGTCGGTG AGGGCGGCGG TGAGTTGGTA AAACAGCTGC TTAATGACAC CTGGGATATC
GACTATCCGG GAGTTGTTGC GGTGCATCTG ACCGGAAAAC CAGCGCCGTA TGTGGGGCCG
CAGGATGTGG CGCTGGCTAT CATCGGTGCC GTGTTCAAAA ACGGCTACGT CAAAAACAAA
GTGATGGAAT TCGTAGGTCC CGGTGTTGCT GCGCTCTCTA CCGATTTCCG TAACAGCGTT
GACGTTATGA CCACTGAAAC GACCTGTTTA AGTTCTGTCT GGCAAACCGA TGAAGAAGTC
CATAACTGGC TGGCGCTGCA CGGTCGCGGC CAGGATTACT GCCAGCTTAA CCCTCAACCG
ATGGCGTACT ACGATGGCTG CATCAGCGTT GATTTAAGCG CCATCAAACC AATGATTGCG
CTGCCGTTCC ACCCGAGCAA CGTGTATGAA ATCGACACAC TGAACCAGAA CCTGACCGAC
ATTCTGCGTG AGATTGAAAT TGAGTCCGAA CGCGTGGCGC ACGGTAAAGC CAAACTCTCG
CTGCTGGATA AAGTGGAAAA TGGTCGCCTG AAAGTGCAGC AGGGGATTAT CGCGGGCTGT
TCTGGCGGTA ACTACGAAAA CGTCATCGCG GCGGCGAATG CACTGCGCGG TCAATCCTGT
GGCAATGACA CCTTCTCGCT GGCAGTTTAC CCGTCATCAC AGCCGGTGTT TATGGATCTC
GCCAAAAAAG GTGTGGTAGC AGATTTGATT GGCGCAGGCG CAATCATCAG AACCGCGTTC
TGCGGCCCAT GCTTTGGCGC GGGCGATACG CCAATCAACA ACGGATTAAG TATTCGCCAC
ACCACGCGCA ACTTCCCGAA CCGCGAAGGC TCTAAGCCAG CTAATGGGCA GATGTCAGCG
GTGGCGTTGA TGGACGCTCG TTCTATCGCT GCGACTGCGG CAAACGGTGG CTATTTAACC
TCTGCCAGCG AACTTGATTG CTGGGACAAC GTGCCGGAGT ACGCCTTCGA TGTAACGCCG
TATAAAAACC GTGTTTATCA GGGCTTTGTG AAAGGGGCAA CTCAGCAACC GCTGATTTAC
GGACCGAACA TTAAAGACTG GCCGGAATTG GGTGCGCTGA CTGACAATAT CGTCCTGAAA
GTGTGCTCGA AGATCCTCGA CGAAGTGACC ACCACCGACG AACTGATTCC TTCCGGTGAA
ACCTCTTCTT ATCGTTCAAA TCCGATTGGT CTGGCGGAGT TTACCCTGTC ACGCCGCGAT
CCCGGTTATG TTGGCAGAAG TAAAGCGACT GCTGAGCTGG AAAATCAGCG TCTGGCGGGG
AATGTCAGCG AGCTGACAGA GGTGTTTGCG CGCATTAAGC AGATTGCTGG TCAGGAGCAT
ATTGATCCGC TGCAAACTGA AATTGGCAGC ATGGTATATG CGGTGAAACC AGGCGATGGT
TCTGCGCGTG AACAGGCGGC GAGCTGCCAG CGTGTGATTG GCGGTCTGGC GAATATTGCC
GAAGAGTACG CGACTAAACG CTACCGTTCT AACGTCATCA ACTGGGGGAT GTTACCGCTG
CAGATGGCGG AAGTGCCAAC CTTTGAAGTG GGGGATTACA TTTACATCCC TGGCATTAAA
GCGGCGCTGG ATAATCCGGG TACGACGTTT AAAGGTTATG TGATCCATGA AGATGCGCCG
GTAACGGAAA TTACGCTCTA TATGGAAAGT CTGACTGCTG AAGAGCGCGA GATTATCAAG
GCGGGTAGTT TGATTAACTT CAATAAAAAC CGTCAGATGT AA
 
Protein sequence
MIKLSEKGVF LASNNEIIAE EHFTREIKKE EAKKGTIAWS ILSSHNTSGN MDKLKIKFDS 
LASHDITFVG IVQTAKASGM ERFPLPYVLT NCHNSLCAVG GTINGDDHVF GLSAAQRYGG
IFVPPHIAVI HQYMREMMAG GGKMILGSDS HTRYGALGTM AVGEGGGELV KQLLNDTWDI
DYPGVVAVHL TGKPAPYVGP QDVALAIIGA VFKNGYVKNK VMEFVGPGVA ALSTDFRNSV
DVMTTETTCL SSVWQTDEEV HNWLALHGRG QDYCQLNPQP MAYYDGCISV DLSAIKPMIA
LPFHPSNVYE IDTLNQNLTD ILREIEIESE RVAHGKAKLS LLDKVENGRL KVQQGIIAGC
SGGNYENVIA AANALRGQSC GNDTFSLAVY PSSQPVFMDL AKKGVVADLI GAGAIIRTAF
CGPCFGAGDT PINNGLSIRH TTRNFPNREG SKPANGQMSA VALMDARSIA ATAANGGYLT
SASELDCWDN VPEYAFDVTP YKNRVYQGFV KGATQQPLIY GPNIKDWPEL GALTDNIVLK
VCSKILDEVT TTDELIPSGE TSSYRSNPIG LAEFTLSRRD PGYVGRSKAT AELENQRLAG
NVSELTEVFA RIKQIAGQEH IDPLQTEIGS MVYAVKPGDG SAREQAASCQ RVIGGLANIA
EEYATKRYRS NVINWGMLPL QMAEVPTFEV GDYIYIPGIK AALDNPGTTF KGYVIHEDAP
VTEITLYMES LTAEEREIIK AGSLINFNKN RQM