Gene EcolC_0551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0551 
Symbol 
ID6064908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp592924 
End bp594960 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content54% 
IMG OID641599958 
ProductLppC family lipoprotein 
Protein accessionYP_001723555 
Protein GI170018601 
COG category[R] General function prediction only 
COG ID[COG3107] Putative lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.551185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACCCT CAACATTTTC TCGTTTGAAA GCCGCGCGTT GTCTGCCTGT TGTTCTGGCA 
GCCCTGATTT TCGCCGGTTG TGGCACCCAT ACTCCCGATC AGTCCACTGC TTATATGCAG
GGCACGGCGC AGGCTGATTC TGCCTTTTAT CTTCAGCAGA TGCAGCAAAG CTCTGATGAT
ACCAGGATCA ACTGGCAATT ACTCGCCATT CGTGCACTGG TGAAAGAAGG TAAAACCGGG
CAGGCGGTTG AGTTGTTTAA CCAACTACCG CAAGAACTGA ACGATGCTCA GCGTCGCGAG
AAAACACTGC TGGCGGTAGA GATTAAACTG GCGCAGAAAG ATTTTGCTGG CGCGCAAAAC
TTGCTGGCGA AAATCACACC TGCCGATTTA GAACAAAACC AGCAAGCGCG TTACTGGCAG
GCAAAAATCG ATGCCAGCCA GGGGCGTCCT TCCATTGATT TACTGCGCGC GTTAATTGCT
CAGGAACCGC TGCTTGGCGC GAAAGAAAAA CAGCAGAATA TTGATGCCAC CTGGCAGGCG
CTCTCCTCCA TGACTCAGGA ACAGGCGAAT ACGCTGGTGA TCAACGCCGA CGAAAATATT
CTGCAAGGCT GGCTGGATCT GCAGCGCGTC TGGTTTGATA ACCGTAACGA TCCCGACATG
ATGAAAGCCG GGATCGCCGA CTGGCAGAAA CGTTATCCGA ACAATCCGGG CGCGAAAATG
CTGCCAACGC AGTTGGTTAA CGTAAAAGCG TTTAAACCAG CCTCGACCAA CAAAATCGCC
CTGCTGTTGC CACTGAATGG CCAGGCAGCG GTATTTGGTC GCACTATTCA GCAAGGCTTT
GAAGCGGCGA AAAATATCGG CACTCAGCCA GTGGCAGCTC AGGTAGCTGC CGCACCTGCC
GCAGACGTAG CAGAACAACC TCAACCGCAA ACCGTGGATG GCGTTGCCAG CCCGGCACAA
GCCTCGGTTA GCGATCTGAC CGGTGAACAG CCTGCAGCCC AGCCGGTGCC TGTAAGCGCC
CCGGCGACAA GCACCGCAGC GGTAAGCGCA CCCGCAAATC CATCCGCAGA GCTGAAAATC
TACGATACCT CATCACAACC ACTTAGCCAG ATCTTAAGCC AGGTTCAGCA GGATGGCGCG
AGTATTGTGG TCGGTCCGTT GCTGAAAAAT AACGTTGAAG AGTTGCTGAA GAGCAACACT
CCGCTGAACG TACTGGCACT GAACCAGTCG GAGAATATCG AAAATCGCGT CAATATTTGT
TACTTCGCGC TTTCACCGGA AGACGAAGCG CGCGATGCAG CGCGTCATAT TCGTGACCAG
GGTAAACAAG CGCCGCTGGT GCTGATCCCA CGCAGTTCAT TGGGCGATCG CGTAGCCAAT
GCGTTTGCGC AAGAGTGGCA GAAACTGGGC GGCGGCACCG TTCTGCAACA AAAATTTGGT
TCCACCAGCG AATTACGCGC GGGTGTTAAC GGCGGTTCTG GTATTGCTTT AACGGGTAGC
CCGATTACTC TCAGAGCGAC AACCGACTCC GGCATGACGA CCAACAATCC AACGCTGCAA
ACCACGCCAA CCGATGACCA GTTCACCAAT AATGGCGGTC GTGTCGATGC GGTGTACATT
GTGGCAACGC CGGGTGAAAT CGCTTTTATC AAACCGATGA TCGCCATGCG TAACGGTAGC
CAGAGCGGTG CAACGCTGTA CGCCAGCTCC CGCAGTGCGC AAGGGACCGC TGGCCCGGAT
TTCCGACTGG AGATGGAAGG CTTGCAGTAC AGCGAAATCC CGATGCTGGC AGGCGGTAAT
CTACCGTTAA TGCAGCAGGC ACTCAGCGCG GTGAATAACG ATTATTCACT GGCTCGCATG
TATGCGATGG GCGTCGATGC CTGGTCGCTG GCAAATCATT TCTCACAAAT GCGCCAGGTT
CAGGGTTTTG AAATCAACGG TAATACCGGA AGCCTGACGG CTAACCCGGA TTGCGTGATT
AACAGGAAGT TATCATGGCT ACAGTACCAA CAAGGTCAGG TAGTCCCCGT CAGTTAA
 
Protein sequence
MVPSTFSRLK AARCLPVVLA ALIFAGCGTH TPDQSTAYMQ GTAQADSAFY LQQMQQSSDD 
TRINWQLLAI RALVKEGKTG QAVELFNQLP QELNDAQRRE KTLLAVEIKL AQKDFAGAQN
LLAKITPADL EQNQQARYWQ AKIDASQGRP SIDLLRALIA QEPLLGAKEK QQNIDATWQA
LSSMTQEQAN TLVINADENI LQGWLDLQRV WFDNRNDPDM MKAGIADWQK RYPNNPGAKM
LPTQLVNVKA FKPASTNKIA LLLPLNGQAA VFGRTIQQGF EAAKNIGTQP VAAQVAAAPA
ADVAEQPQPQ TVDGVASPAQ ASVSDLTGEQ PAAQPVPVSA PATSTAAVSA PANPSAELKI
YDTSSQPLSQ ILSQVQQDGA SIVVGPLLKN NVEELLKSNT PLNVLALNQS ENIENRVNIC
YFALSPEDEA RDAARHIRDQ GKQAPLVLIP RSSLGDRVAN AFAQEWQKLG GGTVLQQKFG
STSELRAGVN GGSGIALTGS PITLRATTDS GMTTNNPTLQ TTPTDDQFTN NGGRVDAVYI
VATPGEIAFI KPMIAMRNGS QSGATLYASS RSAQGTAGPD FRLEMEGLQY SEIPMLAGGN
LPLMQQALSA VNNDYSLARM YAMGVDAWSL ANHFSQMRQV QGFEINGNTG SLTANPDCVI
NRKLSWLQYQ QGQVVPVS