Gene EcolC_2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2016 
Symbol 
ID6068021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2225132 
End bp2226640 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content49% 
IMG OID641601428 
Producthypothetical protein 
Protein accessionYP_001724987 
Protein GI170020033 
COG category[S] Function unknown 
COG ID[COG5339] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.235486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.970517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT CGCTGGTAGC GGTAGGCGTC ATTGTTGCGC TAGGCGTAGT CTGGACAGGC 
GGCGCATGGT ATACAGGCAA GAAGATTGAA ACCCATCTCG AAGACATGGT CGCGCAGGCG
AACGCGCAAC TCAAACTGAC AGCTCCTGAA TCCAACCTGG AAGTGAGTTA TCAAAACTAT
CATCGCGGCG TATTCAGCAG CCAGTTGCAA CTGTTGGTGA AACCCATTGC CGGGAAAGAA
AATCCGTGGA TTAAAAGCGG TCAGAGCGTC ATCTTCAACG AATCGGTTGA TCATGGTCCC
TTCCCGCTTG CCCAGCTTAA AAAACTGAAC CTGATCCCGT CGATGGCATC AATTCAAACC
ACGCTGGTTA ATAACGAAGT AAGCAAACCA CTGTTTGATA TGGCAAAAGG TGAAACGCCT
TTTGAGATTA ACTCGCGCAT TGGTTACAGC GGTGATTCCA GTTCCGATAT TTCGCTCAAG
CCACTGAATT ACGAGCAAAA GGATGAAAAA GTCGCCTTTA GCGGCGGCGA GTTCCAGTTA
AATGCTGACA GAGACGGCAA AGCCATCTCC CTTTCCGGGG AGGCGCAAAG TGGTCGGATA
GACGCAGTTA ACGAATACAA CCAGAAAGTG CAGTTGACCT TTAATAATCT GAAAACCGAC
GGTTCCAGCA CGCTGGCAAG TTTTGGTGAG CGCGTAGGAA ACCAAAAACT GTCACTGGAA
AAAATGACCA TTTCAGTGGA AGGCAAAGAA CTGGCACTGC TGGAAGGCAT GGAGATCAGC
GGTAAATCGG ATCTGGTCAA TGACGGTAAA ACGATCAATA GCCAACTGGA TTACTCGCTA
AACAGCCTGA AGGTACAGAA TCAGGATCTG GGCAGCGGCA AGCTGACTTT AAAAGTCGGC
CAGATTGATG GTGAAGCCTG GCATCAGTTT AGCCAGCAAT ATAACGCGCA AACTCAGGCG
CTGCTGGCAC AGCCAGAAAT TGCCAACAAT CCCGAACTTT ATCAGGAGAA AGTGACGGAA
GCCTTCTTTA GCGCCCTGCC GCTGATGTTG AAAGGCGATC CGGTGATTAC TATCGCGCCG
CTAAGCTGGA AAAACAGTCA GGGTGAAAGT GCGCTGAATC TGTCGCTGTT CCTGAAAGAT
CCGGCAACGA CTAAAGAAGC GCCGCAAACG CTGGCGCAGG AAGTAGATCG TTCGGTTAAA
TCTCTGGATG CGAAACTGAC CATTCCGGTG GATATGGCAA CTGAGTTTAT GACTCAGGTA
GCGAAGCTGG AAGGTTATCA GGAAGATCAA GCGAAAAAAC TGGCGAAACA GCAAGTTGAA
GGTGCATCAG CAATGGGGCA GATGTTCCGT CTGACCACCT TGCAGGACAA TACCATCACC
ACCAGCCTGC AATATACTAA CGGTCAGATA ACGTTAAACG GGCAGAAAAT GCCACTGGAA
GATTTCGTTG GTATGTTTGC AATGCCGGCA TTAAATGTTC CGGTCGTACC CGCTATTCCG
CAGCAGTAA
 
Protein sequence
MNKSLVAVGV IVALGVVWTG GAWYTGKKIE THLEDMVAQA NAQLKLTAPE SNLEVSYQNY 
HRGVFSSQLQ LLVKPIAGKE NPWIKSGQSV IFNESVDHGP FPLAQLKKLN LIPSMASIQT
TLVNNEVSKP LFDMAKGETP FEINSRIGYS GDSSSDISLK PLNYEQKDEK VAFSGGEFQL
NADRDGKAIS LSGEAQSGRI DAVNEYNQKV QLTFNNLKTD GSSTLASFGE RVGNQKLSLE
KMTISVEGKE LALLEGMEIS GKSDLVNDGK TINSQLDYSL NSLKVQNQDL GSGKLTLKVG
QIDGEAWHQF SQQYNAQTQA LLAQPEIANN PELYQEKVTE AFFSALPLML KGDPVITIAP
LSWKNSQGES ALNLSLFLKD PATTKEAPQT LAQEVDRSVK SLDAKLTIPV DMATEFMTQV
AKLEGYQEDQ AKKLAKQQVE GASAMGQMFR LTTLQDNTIT TSLQYTNGQI TLNGQKMPLE
DFVGMFAMPA LNVPVVPAIP QQ