Gene EcolC_2766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2766 
Symbol 
ID6064812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3036658 
End bp3038169 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content33% 
IMG OID641602172 
Producthypothetical protein 
Protein accessionYP_001725721 
Protein GI170020767 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.405053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000336411 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGCCCCA AAATTGGTGA TTTTGAATTT GGTGAGGTTT ATGGAGAGAA TGAGGTATTA 
TTTCTCGATA ATTACTCGAA ATATTTTTAT GATATAAATA ATTCATTAAG TAAACTTGAT
AGAAAAAATA AAATGCTCGT AATTGGCAGG AAGGGGACGG GTAAGACATT GCTTGTTAAT
GTCTACTGTA ACGCCAAGCG AAAAAATAAT TATATTGCAG TTGTGGAATC ATTAAAAGAT
TTTGTTTTTC ATGAACTTAC TCACTTTCAG GGGCAGGATG TGTCTTCCAC AAAATATGTA
CCAATTTTCA AGTGGATGAT ATTGGTCAAT CTTGCTAAGA ATATTGTAAG TAATAAAAAA
GGATTTAGTG AGGATAAAAT AGTTCATTTG GAAAGTTTTT TACGTTCTTT TGGCCATGTC
GCGGGCGAGT TAAGGCCTGA GCAAACAGTT GAAATAACTA GGGAGTACCA AGCGTCGGGT
GAGGTTGGTA TCGGATTTAG ATTCCCAGTA TTACGTGGTG AAGCCAAAGC AAAAGATGGT
GAGGTCGAGA AAACAAAAGA AACAAAAAAG AATTATTTGG AATGTATGGA ATCTTTGCAG
TGTTTTATTG TTGACATGCT AAAAGAGAGT AATAAAAAAA TATATGTATT TTATGATGAG
TTAGATGATA AGTTTGATGC AACAGTTGAA TATAAAAATG CAATGATAAG TTTTTTAAAT
GCTGTTGTGT CAATTAATAA AACTCTAATG CAAAATAAAA TAGATGCTAA AATTGGTGCA
GTTATTCGTC ATGATATAAT AAATACTTTT TCATCGCCGA ATATTAATAA AATCATTGAA
GATAACTCTG TTACACTTGA TTGGTGTTCT GCTGGAGAAC GAGCAAGTGA TTCTGAGATT
TTTGATATGA TTGCTTTTAA GATCAAAAAC TCCACTGATT ATTATAATGA TTTAAATGGT
TCTAATTTGT TCGGGAAAAT TTTCACTGAG AGAGTTGCAG GTGAGCATAG CTCTATTTAT
ATTTTACATA GAACTCTAGG TCGCCCAAGA GATGCCGTTA GGATGCTAAC TTATATTCAA
GATGAATATG GAGAAAATAC TGAAAGATTT GAAAGTTCCA TGTTTACAAA GATTTCTAAA
AAATACTCAT CTTATCTTTT ACGTGAAATT AGATCTGAGC TTGCGGGACA TTTAAGTGAT
TCAGAAATAG ATGACAGTTT TTCTCTTTTA CGTTCATTAA AAAAAAGAGG TTTCACTCCA
CATTTAATCA AGGAAAAATT TGAAGAATTG AAGTTAGGAG ATGGTACATT AACGCTTAAT
AAAATACTGA GTTGTTTATT TAAAGTCGGT GCTATCGGGA ATGTACTCAG GAGATCTAAA
ACAGATGGCG GAGATGTTTA TTTGTGGTCA TTTAATGATG AAGATTTAGA AATGGACCCA
ACGTTGAATT TTGAAATACA CTGTGGTTTG TGGGATGCAC TAGGAATTAT CAAGCCTAAA
CTTAGGCAAT AA
 
Protein sequence
MRPKIGDFEF GEVYGENEVL FLDNYSKYFY DINNSLSKLD RKNKMLVIGR KGTGKTLLVN 
VYCNAKRKNN YIAVVESLKD FVFHELTHFQ GQDVSSTKYV PIFKWMILVN LAKNIVSNKK
GFSEDKIVHL ESFLRSFGHV AGELRPEQTV EITREYQASG EVGIGFRFPV LRGEAKAKDG
EVEKTKETKK NYLECMESLQ CFIVDMLKES NKKIYVFYDE LDDKFDATVE YKNAMISFLN
AVVSINKTLM QNKIDAKIGA VIRHDIINTF SSPNINKIIE DNSVTLDWCS AGERASDSEI
FDMIAFKIKN STDYYNDLNG SNLFGKIFTE RVAGEHSSIY ILHRTLGRPR DAVRMLTYIQ
DEYGENTERF ESSMFTKISK KYSSYLLREI RSELAGHLSD SEIDDSFSLL RSLKKRGFTP
HLIKEKFEEL KLGDGTLTLN KILSCLFKVG AIGNVLRRSK TDGGDVYLWS FNDEDLEMDP
TLNFEIHCGL WDALGIIKPK LRQ