Gene EcolC_3175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3175 
Symbol 
ID6066572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3479143 
End bp3481275 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content50% 
IMG OID641602591 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001726125 
Protein GI170021171 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.659876 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAT CAATAGTGGA AAATGGAAAA AAACGGGAAG AGACCGTCGA CGTTTCCAGA 
TTTGCCAAAG AAATTAAAAA GAACGCCTGT AAGATTGTAC TTGCGGGAAT TATCAGCGGT
GCAGTTGCCT ACCCATTAAT CAGCATGCTG TCATCAAAAT ATGTCTCAAC AGCTACGGTG
TTGCTAAAGG CTCAGGCTGA TAACGTTTCG CCGTTCCCAC AGGTGGAAGA TTTTGATTCC
ACGCGCACCG GCTACTATGA GACGCAATAT GCCTTGATGC AGTCGCGTAT TGTTCTGGAG
AAAGCGGTTC GCGAGTTAAA GCTGGATCAA AACCCAGACT TTATTGGCAA AAAAGCGGAT
GAAAAGGCCA GCAACAGCGA AGATGCTGAA CAGCAGCGCA TTGAGCGCGC GCTGAACACG
CTGCAAAAAA ATCTTACCGT TAGCGGTATT CGAACCACTA ATCTGGCGAC AGTCTCTTAT
GAGTCGACAT CGCCACAACT CTCCTCTGAG ATTGCCAACG GCGTCGCACA GGCGTTTATC
GATTATACGT TGGACCAAAA GCGGCTGAAG ACAGAAAAAG CCAGAGAAGT AAACCTTCAG
AAAATGGAGG AAGTGCAGAA AGAGATCGCG CAGCAGAAAG CCGATATCGA TAACTTCCTG
GCGAAAGAGG GCTTATTAAC GTTCCGCGGC ATTGATGGCT TCGAAACCGA GCAACTCAGC
ATTGTTACCA ACCGTCTGGC CGATGCTACC CAACGGCGTA TTGCGGCAGA ATCTCTGGAA
AAAGCCGTCA GCGCTGGGGG CCGGGTCTCT CTGGATAACA TCATTTCATT ACCGACGATC
TCTAACCATG CGCAAATTCA GGATTTGCGT ATCGCCATGA TTCAGGCGCA GCGGTCTTTG
TATGAGTTAC AAAAATCATA TGGCCCGAAA CATGCGAAGA TCCTGGAAGC GCAGGCTCAG
GTGAAGGCTA TTCAGGATCA GATGGGCGTG GTGCTCAGTG AGCTTAAAAA AGGCATTCAT
CAGCAATATC TGGCCGCGCT GGCGGATGAA AAGGATTATC AGGCGCAACT TGATCAACAG
AAAGAAATTT TTCAGAAACT GGCTGAAAAA CGCAGCCTGT ATAACAGCCA GAAATTGTCA
CTGGATAAAC TGGAAGATCT TTATAAAACC CTGTATCAGC GGACCCAGGA ACTGTCTCTG
TCCGGCATTA ATGCGGATGC AGTGCTGTAC GATCCGGCTG TCCCGGCAGT GAAGCCATCT
AAGCCAAATA AAGCGTTACT GTTAGTGATG GTGGTGGCGC TGGCCATGGC CTTCTTCTTT
ATGTACGTCA TTGTAAAAGC GGCGATGGAT AATTCCATCA GGACGCTCGG ACAAGTGACA
AAACGACTGG GCGTCGTCTC ACTGGGTGAG ATCCGCCGCA TTGCGGGGGC CGGGGACCGT
GCACAGGTTC GCGATTTGAT CACGCGAAAC CCCTTGAACG CCGACATTAT CCACAGCATT
CGTACACAGA TTTTGTTGGA TAACCGCCCG CAGCAGGTTC TGGCAATCTC CTCTGCAAAG
CAGGGTGAGG GGCGCTCTTT ACTGGCCAGT CTGCTGGCAA ACTCCTTCAG CTTTGATCAG
AAAACCTTAC TGCTTGATTT GGATTTCTTT AACCGTGATG GCCTGTCCGC CGAGTTTTCA
ACATCGACCT CTGCGGGAGT TGCAGAGCTG TTGCGTGGAG AAGTGACACT TGACGCTGCG
CGGATCACGC TTAGTGACAC GCTGGACTTT TTACCCCGCG GAAAAGCGAA CGCTTCGTCT
TTGCTGATGC TGTCTTCGGA ACGTTTTGAA CCTCTCATTC GTGACCTGCG AAATCGCTAC
CAGCGGATCA TCGTCGATGT CTCTGCGGTG AGCCAGAGTC AGGACATCGA GCTGATTAGT
CGGGTGGTTG ATGGTGTGGT TTTCGTTGTG CAAGCGGGGG CTGCGTCCGT GGAGACGCTG
CGCGCGGCGC TGGCGAAAGT TGACGCCAAC CAGGAAGTGG TCATGGGAGC GGTACTCAAT
CTGGTTGAGG AAAAAAATCT GCAGACGAAA GAGAGTCTTC GCTCGCTCAA TATCACTACT
GACGAATTGA TGAATACCAC AGGTCGGTTA TGA
 
Protein sequence
MKLSIVENGK KREETVDVSR FAKEIKKNAC KIVLAGIISG AVAYPLISML SSKYVSTATV 
LLKAQADNVS PFPQVEDFDS TRTGYYETQY ALMQSRIVLE KAVRELKLDQ NPDFIGKKAD
EKASNSEDAE QQRIERALNT LQKNLTVSGI RTTNLATVSY ESTSPQLSSE IANGVAQAFI
DYTLDQKRLK TEKAREVNLQ KMEEVQKEIA QQKADIDNFL AKEGLLTFRG IDGFETEQLS
IVTNRLADAT QRRIAAESLE KAVSAGGRVS LDNIISLPTI SNHAQIQDLR IAMIQAQRSL
YELQKSYGPK HAKILEAQAQ VKAIQDQMGV VLSELKKGIH QQYLAALADE KDYQAQLDQQ
KEIFQKLAEK RSLYNSQKLS LDKLEDLYKT LYQRTQELSL SGINADAVLY DPAVPAVKPS
KPNKALLLVM VVALAMAFFF MYVIVKAAMD NSIRTLGQVT KRLGVVSLGE IRRIAGAGDR
AQVRDLITRN PLNADIIHSI RTQILLDNRP QQVLAISSAK QGEGRSLLAS LLANSFSFDQ
KTLLLDLDFF NRDGLSAEFS TSTSAGVAEL LRGEVTLDAA RITLSDTLDF LPRGKANASS
LLMLSSERFE PLIRDLRNRY QRIIVDVSAV SQSQDIELIS RVVDGVVFVV QAGAASVETL
RAALAKVDAN QEVVMGAVLN LVEEKNLQTK ESLRSLNITT DELMNTTGRL