Gene EcolC_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4001 
Symbol 
ID6064552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4394179 
End bp4396275 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content54% 
IMG OID641603412 
Productprotein of unknown function DUF940 membrane lipoprotein putative 
Protein accessionYP_001726927 
Protein GI170021973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.501947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA GACATCTGCT TAGCTTACTG GCGCTGGGCA TTAGCATGGC TTGCTACGGC 
GAAACATATC CTGCGCCCAT TGGCCCGTCG CAGTCGGATT TCGGTGGCGT AGGATTATTA
CAAACGCCCA CCGCACGCAT GGCGCGGGAA GGGGAGTTGA GCCTGAACTA TCGCGATAAC
GATCAGTACC GTTATTACTC AGCTTCAGTG CAACTCTTCC CGTGGCTGGA AACAACGCTG
CGCTACACCG ACGTGCGCAC CCGGCAGTAC AGCAGCGTCG AAGCGTTCTC TGGCGATCAA
ACGTATAAAG ATAAAGCCTT CGATCTCAAA CTGCGTTTGT GGGAAGAGAG TTACTGGCTG
CCGCAAGTGG CGGTTGGTGC GCGGGATATC GGCGGTACGG GGCTGTTTGA TGCGGAATAT
CTTGTTGCCA GCAAAGCCTG GGGGCCGTTC GATTTTACGC TCGGCCTGGG CTGGGGATAT
TTGGGCACCA GCGGTAATGT GAAAAATCCG CTCTGTTCAG CCAGTGATAA ATATTGCTAT
CGCGATAACA GCTACAAACA GGCGGGATCT ATCGACGGTA GCCAGATGTT CCACGGTCCT
GCCTCACTGT TTGGCGGCGT GGAATACCAG ACGCCCTGGC AACCGCTGCG CCTGAAACTG
GAGTATGAAG GCAATAATTA TCAGCAGGAT TTTGCCGGGA AGCTGGAGCA AAAAAGTAAG
TTTAACGTCG GTGCGATTTA TCGCGTTACC GATTGGGCCG ACGTTAACCT TAGCTATGAA
CGTGGCAACA CCTTTATGTT TGGCGTTACG TTGCGCACCA ACTTTAACGA TCTGCGCCCG
TCTTACAACG ATAACGCCCG CCCGCAATAT CAACCGCAGC CGCAGGATGC CATTTTGCAG
CATTCGGTGG TGGCGAATCA GTTAACGCTG TTGAAATACA ATGCTGGACT TGCCGATCCA
CAGATCCAGG CGAAAGGCGA TACGCTGTAT GTTACCGGCG AGCAGGTGAA ATATCGTGAT
TCGCGCGAAG GGATCATCCG TGCGAATCGG ATCGTGATGA ACGATCTGCC GGATGGGATC
AAAACGATCC GCATTACGGA AAATCGCCTT AACATGCCGC AGGTGACGAC GGAAACCGAT
GTCGCCAGCC TGAAAAATCA TCTCGCCGGA GAGCCGTTGG GCCACGAAAC GACGCTGGCG
CAAAAACGCG TCGAGCCAGT GGTTCCGCAG TCCACCGAGC AGGGCTGGTA TATCGACAAA
TCACGCTTTG ATTTCCATAT CGATCCGGTG CTGAACCAGT CGGTCGGTGG CCCGGAAAAC
TTTTACATGT ATCAGCTGGG CGTGATGGGA ACGGCAGATT TGTGGCTGAC GGACCATCTG
CTGACCACCG GCAGCCTGTT TGCAAATCTT GCCAACAACT ACGACAAGTT TAACTACACT
AATCCTCCGC AGGACTCGCA CTTACCGCGC GTGCGTACCC ATGTGCGCGA GTATGTGCAG
AACGATGTCT ATGTGAATAA CCTGCAAGCC AACTACTTCC AGCATCTGGG CAACGGCTTC
TACGGTCAGG TCTACGGTGG TTATCTCGAA ACCATGTTTG GCGGTGCGGG GGCAGAAGTG
TTGTATCGCC CGCTGGACAG CAACTGGGCG TTTGGTCTGG ATGCCAACTA CGTTAAACAG
CGCGACTGGC GTAGTGCAAA AGATATGATG AAATTCACCG ACTACAGCGT GAAAACCGGA
CATCTGACCG CCTACTGGAC GCCATCTTTC GCTCAGGATG TGTTAGTTAA AGCCAGCGTC
GGGCAGTATC TGGCAGGGGA TAAAGGCGGC ACGCTGGAGA TCGCCAAACG CTTTGATAGC
GGCGTGGTGG TGGGTGGCTA TGCCACGATC ACTAATGTTT CGAAAGAGGA GTACGGCGAA
GGGGACTTCA CCAAAGGCGT GTATGTCTCG GTACCGTTGG ATCTCTTCTC GTCTGGCCCG
ACACGCAGCC GTGCGGCGAT TGGCTGGACG CCGCTGACGC GTGACGGTGG TCAGCAACTT
GGGCGTAAGT TCCAGTTGTA TGACATGACC AGCGACCGTA GCGTCAATTT CCGCTAA
 
Protein sequence
MKKRHLLSLL ALGISMACYG ETYPAPIGPS QSDFGGVGLL QTPTARMARE GELSLNYRDN 
DQYRYYSASV QLFPWLETTL RYTDVRTRQY SSVEAFSGDQ TYKDKAFDLK LRLWEESYWL
PQVAVGARDI GGTGLFDAEY LVASKAWGPF DFTLGLGWGY LGTSGNVKNP LCSASDKYCY
RDNSYKQAGS IDGSQMFHGP ASLFGGVEYQ TPWQPLRLKL EYEGNNYQQD FAGKLEQKSK
FNVGAIYRVT DWADVNLSYE RGNTFMFGVT LRTNFNDLRP SYNDNARPQY QPQPQDAILQ
HSVVANQLTL LKYNAGLADP QIQAKGDTLY VTGEQVKYRD SREGIIRANR IVMNDLPDGI
KTIRITENRL NMPQVTTETD VASLKNHLAG EPLGHETTLA QKRVEPVVPQ STEQGWYIDK
SRFDFHIDPV LNQSVGGPEN FYMYQLGVMG TADLWLTDHL LTTGSLFANL ANNYDKFNYT
NPPQDSHLPR VRTHVREYVQ NDVYVNNLQA NYFQHLGNGF YGQVYGGYLE TMFGGAGAEV
LYRPLDSNWA FGLDANYVKQ RDWRSAKDMM KFTDYSVKTG HLTAYWTPSF AQDVLVKASV
GQYLAGDKGG TLEIAKRFDS GVVVGGYATI TNVSKEEYGE GDFTKGVYVS VPLDLFSSGP
TRSRAAIGWT PLTRDGGQQL GRKFQLYDMT SDRSVNFR