Gene EcolC_0648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0648 
Symbol 
ID6065681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp701381 
End bp703042 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content50% 
IMG OID641600055 
Productband 7 protein 
Protein accessionYP_001723651 
Protein GI170018697 
COG category[S] Function unknown 
COG ID[COG2268] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGATA TTGTTAATTC TGTGCCCTCC TGGATGTTTA CCGCGATTAT TGCCGTATGC 
ATTCTGTTTA TTATTGGAAT TATTTTCGCC AGGCTCTATC GTCGCGCTTC GGCAGAGCAA
GCTTTTGTTC GTACTGGTTT AGGTGGGCAA AAAGTGGTAA TGAGCGGTGG CGCAATCGTG
ATGCCGATCT TTCATGAAAT AATCCCCATC AATATGAATA CTCTGAAGCT GGAAGTCAGC
CGCTCAACCA TTGATAGCCT GATTACGAAA GATCGTATGC GCGTCGATGT AGTAGTCGCT
TTCTTTGTGC GGGTAAAACC TTCAGTAGAA GGGATTGCCA CCGCTGCCCA GACGCTGGGG
CAACGCACCC TGTCGCCTGA AGACTTACGT ATGTTGGTTG AAGATAAATT TGTCGATGCC
CTCCGTGCAA CAGCTGCGCA AATGACCATG CATGAGTTAC AGGATACCCG CGAGAACTTC
GTGCAGGGGG TGCAAAATAC AGTGGCAGAA GACCTGTCGA AAAACGGTCT GGAACTGGAG
AGCGTTTCAC TTACCAACTT TAACCAGACC TCGAAAGAAC ATTTCAATCC GAACAATGCC
TTTGACGCCG AAGGTTTAAC CAAACTGACT CAGGAAACAG AGCGCCGTCG CCGCGAACGT
AACGAAGTTG AACAGGATGT AGAAGTTGCG GTGCGTGAAA AAAATCGCGA TGCGCTATCG
CGCAAGCTGG AGATTGAACA GCAAGAAGCG TTTATGACGC TTGAGCAGGA GCAGCAGGTT
AAAACCCGTA CTGCCGAACA GAATGCACGT ATTGCGGCTT TTGAAGCTGA ACGTCGTCGT
GAAGCAGAGC AGACACGAAT TCTGGCTGAA CGACAGATTC AGGAAACAGA AATCGACCGC
GAACAGGCCG TCCGCTCAAG AAAGGTTGAA GCTGAACGTG AAGTTCGCAT TAAAGAGATC
GAACAGCAGC AGGTCACCGA AATCGCTAAC CAGACGAAAT CGATCGCTAT TGCCGCCAAA
TCGGAACAAC AGTCCCAGGC AGAAGCGCGT GCTAATCTCG CACTTGCAGA AGCGGTAAGC
GCCCAACAAA ACGTAGAAAC CACTCGCCAG ACTGCCGAAG CCGATCGTGC TAAACAAGTT
GCCCTAATCG CTGCCGCGCA GGATGCAGAA ACCAAAGCGG TTGAACTGAC CGTGCGGGCG
AAAGCAGAAA AAGAAGCCGC AGAGATGCAG GCGGCGGCTA TCGTTGAGTT AGCCGAAGCT
ACACGTAAAA AGGGTCTGGC GGAAGCAGAA GCACAACGTG CGCTGAACGA TGCTATCAAC
GTACTTTCTG ATGAACAAAC CAGCCTTAAA TTCAAACTGG CCTTGTTGCA GGCGCTGCCT
GCGGTAATAG AGAAATCCGT TGAGCCGATG AAGTCAATCG ACGGTATCAA GATTATTCAG
GTCGATGGTC TGAATCGTGG CGGCGCTGCG GGTGATGCGA GTACAGGTAG CGTTAGTGGA
GGAAACCTTG CAGAGCAGGC ATTGTCTGCC GCCCTTTCTT ACCGCACACA GGCACCGCTG
ATTGACTCCT TGCTCAATGA AATTGGTGTT TCAGGCGGCT CACTGACGGC ATTGACTTCA
CCCTTAACCT CAACAACTCC CGTCGCCGAA AACGTAGAAT AA
 
Protein sequence
MDDIVNSVPS WMFTAIIAVC ILFIIGIIFA RLYRRASAEQ AFVRTGLGGQ KVVMSGGAIV 
MPIFHEIIPI NMNTLKLEVS RSTIDSLITK DRMRVDVVVA FFVRVKPSVE GIATAAQTLG
QRTLSPEDLR MLVEDKFVDA LRATAAQMTM HELQDTRENF VQGVQNTVAE DLSKNGLELE
SVSLTNFNQT SKEHFNPNNA FDAEGLTKLT QETERRRRER NEVEQDVEVA VREKNRDALS
RKLEIEQQEA FMTLEQEQQV KTRTAEQNAR IAAFEAERRR EAEQTRILAE RQIQETEIDR
EQAVRSRKVE AEREVRIKEI EQQQVTEIAN QTKSIAIAAK SEQQSQAEAR ANLALAEAVS
AQQNVETTRQ TAEADRAKQV ALIAAAQDAE TKAVELTVRA KAEKEAAEMQ AAAIVELAEA
TRKKGLAEAE AQRALNDAIN VLSDEQTSLK FKLALLQALP AVIEKSVEPM KSIDGIKIIQ
VDGLNRGGAA GDASTGSVSG GNLAEQALSA ALSYRTQAPL IDSLLNEIGV SGGSLTALTS
PLTSTTPVAE NVE