Gene EcolC_1607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1607 
Symbol 
ID6066178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1787581 
End bp1788684 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content31% 
IMG OID641601023 
Producthypothetical protein 
Protein accessionYP_001724593 
Protein GI170019639 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.634641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAT TAAAAACTAA ATCAAGTAAT TGCGGAATTA TAACATTCCA TAAAGCACAT 
AATTATGGAG CTGTTTTACA GGCATATGCG TTAATGACAA CATTAAAAAA AAATGGATTA
CAAGTTAGTT TTATAGATTA TGAGAATGAA GTGCTTTCAA GGGGGTATAG CTTTTATCCT
ATATTAAAGG GAACGTCGAC TATAAATTAT ATTAAGGAAT GGATTCATTT GATTTTAGAC
TTAAAACGTA AATATAAAAG GTTTAAAGCA TTTAGTGATT TTATAAATAA ATATATTGTA
TGTACGCCAT TGAATAAAGA CACTAAATGT TTCGATATTA TCTTTCTTGG TAGTGATCAG
ATATGGAATG CAAATTATAC TAACGGAGTA GACCCTAACT ATTATGGACA AGGACCTTAT
TGTAAAGCGC ACAAATTAGT TTCTTATGCG GCAAGTATGG GTAAGCTGTG TTTGGGAAAA
TATGAGGAAC AAGCATTCTT ATCTTTGATA AATAATATAC AGCAGATTGG TGTTAGAGAA
AATTATTTAA AGACGTACAT AGAAGAAAAA ACCGATTTAA AATGTGATGT TAATCTTGAT
CCGACTCTTC TGCTTACTAA AAGCGATTGG GATAAACTGG CAGCACCTAA CGATACACAA
GAACGCTATC TTCTTATATA TGAAATGCAT ACTCACAGGA GTACGGATAT TATTGCTAAT
AAAATAGCAA AAATACTAAA TTTAAAGATT AAAAAATTAG CATGTCGAAC AAATTACAAA
ATTGAGAAGG ATGTAATAAC AAATGCAGGA CCACAAAATT TCTTAACTTT ATTCAAAAAT
GCAGCTTTCG TGGTGACCAC TTCTTTTCAC GGAACTGTAT TCTCAATTAT AAATCAGGTG
CCATTTTTTA CTTTGGAATT TGGTAACGAG ATAGACTTAA GAAGCCGTTC ACTTCTTGAA
ATGCTTAATT TGAATGAACG AATGATCAGT GACGATGCAA ATTTGAATTA TGAGAAGCTT
TTCTTGGAAT TTGATGAGGC TCATTCAATA TTAGAAAGTA AAAGGCAGGA TTCTTTAAGT
TTCATTGAGA GAGCTCTGAG TTAA
 
Protein sequence
MAKLKTKSSN CGIITFHKAH NYGAVLQAYA LMTTLKKNGL QVSFIDYENE VLSRGYSFYP 
ILKGTSTINY IKEWIHLILD LKRKYKRFKA FSDFINKYIV CTPLNKDTKC FDIIFLGSDQ
IWNANYTNGV DPNYYGQGPY CKAHKLVSYA ASMGKLCLGK YEEQAFLSLI NNIQQIGVRE
NYLKTYIEEK TDLKCDVNLD PTLLLTKSDW DKLAAPNDTQ ERYLLIYEMH THRSTDIIAN
KIAKILNLKI KKLACRTNYK IEKDVITNAG PQNFLTLFKN AAFVVTTSFH GTVFSIINQV
PFFTLEFGNE IDLRSRSLLE MLNLNERMIS DDANLNYEKL FLEFDEAHSI LESKRQDSLS
FIERALS