Gene EcolC_0588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0588 
Symbol 
ID6066210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp633909 
End bp635219 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content57% 
IMG OID641599994 
Producthypothetical protein 
Protein accessionYP_001723591 
Protein GI170018637 
COG category[S] Function unknown 
COG ID[COG3681] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATT CGACTTTAAA TCCGTTATGG CAGCGTTACA TCCTCGCCGT TCAGGAGGAA 
GTAAAACCGG CGCTGGGATG TACTGAACCG ATTTCACTGG CGCTGGCGGC GGCGGTTGCT
GCGGCAGAAC TGGAAGGTCC GGTTGAACGT GTAGAAGCCT GGGTTTCGCC AAATCTGATG
AAGAACGGTC TGGGCGTCAC CGTTCCCGGC ACGGGAATGG TGGGGCTGCC GATTGCGGCG
GCGCTGGGGG CGTTAGGTGG AAATGCCAAC GCCGGGCTGG AAGTGCTGAA AGACGCAACT
GCGCAGGCAA TTGCCGATGC CAAAGCACTG CTGGCGGCGG GGAAAGTCTC CGTTAAGATC
CAGGAACCTT GCAATGAAAT CCTCTTCTCA CGCGCCAAAG TCTGGAACGG TGAGAAGTGG
GCGTGTGTCA CCATCGTCGG CGGGCATACC AACATTGTGC ATATTGAGAC GCACGATGGT
GTGGTGTTTA CCCAGCAGGC GTGTGTGGCA GAGGGCGAGC AAGAGTCTCC GCTGACGGTG
CTTTCCAGAA CGACGCTGGC TGAGATCCTG AAGTTCGTCA ATGAAGTCCC GTTTGCGGCG
ATCCGCTTTA TTCTCGATTC TGCGAAGCTA AATTGTGCGT TATCGCAGGA AGGTTTGAGC
GGTAAGTGGG GGCTGCATAT TGGCGCGACG CTGGAAAAAC AGTGCGCGCG CGGTTTGCTG
GCGAAAGATC TCTCTTCATC CATTGTGATT CGTACCAGCG CGGCATCCGA TGCGCGTATG
GGCGGCGCTA CGCTTCCGGC AATGAGTAAC TCTGGCTCGG GTAACCAGGG GATCACCGCA
ACAATGCCTG TGGTGGTGGT AGCAGAACAC TTCGGAGCGG ATGATGAACG GCTGGCGCGT
GCGCTGATGC TTTCGCATTT GAGCGCAATT TACATCCATA ACCAGTTACC GCGTTTGTCT
GCACTTTGTG CCGCAACGAC CGCAGCAATG GGGGCCGCCG CCGGGATGGC ATGGCTGGTG
GATGGGCGTT ATGAAACTAT CTCGATGGCG ATCAGCAGTA TGATCGGCGA TGTCAGCGGC
ATGATTTGCG ATGGAGCGTC GAACAGCTGT GCGATGAAGG TTTCGACCAG TGCTTCGGCT
GCGTGGAAAG CGGTGTTAAT GGCGCTGGAT GATACCGCCG TGACCGGCAA TGAAGGGATC
GTGGCGCATG ATGTTGAGCA GTCGATTGCC AACCTGTGTG CGTTAGCAAG CCATTCGATG
CAGCAAACGG ATCGGCAGAT TATCGAGATT ATGGCGAGCA AGGCCAGATA A
 
Protein sequence
MFDSTLNPLW QRYILAVQEE VKPALGCTEP ISLALAAAVA AAELEGPVER VEAWVSPNLM 
KNGLGVTVPG TGMVGLPIAA ALGALGGNAN AGLEVLKDAT AQAIADAKAL LAAGKVSVKI
QEPCNEILFS RAKVWNGEKW ACVTIVGGHT NIVHIETHDG VVFTQQACVA EGEQESPLTV
LSRTTLAEIL KFVNEVPFAA IRFILDSAKL NCALSQEGLS GKWGLHIGAT LEKQCARGLL
AKDLSSSIVI RTSAASDARM GGATLPAMSN SGSGNQGITA TMPVVVVAEH FGADDERLAR
ALMLSHLSAI YIHNQLPRLS ALCAATTAAM GAAAGMAWLV DGRYETISMA ISSMIGDVSG
MICDGASNSC AMKVSTSASA AWKAVLMALD DTAVTGNEGI VAHDVEQSIA NLCALASHSM
QQTDRQIIEI MASKAR