Gene EcolC_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1870 
Symbol 
ID6064419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2071819 
End bp2072859 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content41% 
IMG OID641601283 
Producthypothetical protein 
Protein accessionYP_001724845 
Protein GI170019891 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00999711 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000115044 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAG TTTTATTACA AAACCATCCT GGGAGCGAGA AGTATTCTTT TAATGGCTGG 
GAAATATTTA ATAGTAATTT TGAACGGATG ATTAAAGAAA ATAAGGCCAT GCTGCTTTGT
AAGTGGGGGT TTTATTTAAC ATGTGTTGTC GCTGTAATGT TTGTGTTCGC AGCGATAACA
TCCAACGGTT TGAATGAAAG AGGCCTGATT ACCGCGGGAT GCTCTTTTCT TTATCTATTA
ATTATGATGG GGCTTATTGT TCGGGCCGGT TTTAAAGCAA AAAAAGAACA ACTGCATTAT
TATCAGGCTA AAGGTATTGA GCCGCTCAGT ATCGAGAAGT TACAGGCGCT ACAATTGATC
GCACCTTATC GATTCTATCA TAAGCAATGG TCTGAAACGC TGGAGTTCTG GCCGCGAAAG
CCTGAACCTG GCAAAGATAC CTTCCAATAT CATGTGCTTC CTTTTGACTC GATCGATATC
ATAAGTAAAA GACGCGAGTC TTTAGAGGAT CAATGGGGTA TCGAAGATAG CGAAAGTTAT
TGTGCCTTAA TGGAGCATTT TCTTTCTGGC GACCATGGAG CCAATACCTT TAAAGCAAAC
ATGGAGGAAG CCCCAGAGCA GGTTATCGCC TTGTTGAATA AATTTGCTGT TTTTCCCTCA
GACTATATCT CTGATTGCGC TAATCATAGC TCCGGTAAAT CCTCGGCGAA GCTAATATGG
GCGGCGGAAT TATCATGGAT GATCTCGATA TCAAGCACAG CTTTTCAAAA CGGGACAATT
GAAGAAGAAC TGGCCTGGCA TTATATAATG CTTGCTTCTC GAAAGGCGCA CGAGTTGTTC
GAAAGCGAAG AAGATTATCA AAAAAATAGT CAAATGGGAT TTCTTTACTG GCATATCTGC
TGCTATCGCA GAAAGTTAAC GGATGCTGAA CTTGAAGCGT GTTATCGTTA CGACAAGCAG
TTTTGGGAGC ACTACAGTAA GAAATGCCGT TGGCCCATAA GAAATGTTCC GTGGGGAGCA
TCATCCGTTA AATACTCATA A
 
Protein sequence
MKKVLLQNHP GSEKYSFNGW EIFNSNFERM IKENKAMLLC KWGFYLTCVV AVMFVFAAIT 
SNGLNERGLI TAGCSFLYLL IMMGLIVRAG FKAKKEQLHY YQAKGIEPLS IEKLQALQLI
APYRFYHKQW SETLEFWPRK PEPGKDTFQY HVLPFDSIDI ISKRRESLED QWGIEDSESY
CALMEHFLSG DHGANTFKAN MEEAPEQVIA LLNKFAVFPS DYISDCANHS SGKSSAKLIW
AAELSWMISI SSTAFQNGTI EEELAWHYIM LASRKAHELF ESEEDYQKNS QMGFLYWHIC
CYRRKLTDAE LEACYRYDKQ FWEHYSKKCR WPIRNVPWGA SSVKYS