Gene EcolC_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1963 
Symbol 
ID6068321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2167956 
End bp2169560 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content48% 
IMG OID641601375 
Producthypothetical protein 
Protein accessionYP_001724936 
Protein GI170019982 
COG category[S] Function unknown 
COG ID[COG4529] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000030745 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA TTGCTATTGT GGGTGCCGGG CCTACGGGGA TCTACACCTT ATTCTCGCTT 
CTACAGCAAC AAACTCCACT TTCTATTTCT ATCTTCGAGC AGGCTGACGA GGCCGGTGTC
GGGATGCCAT ACAGTGATGA GGAAAACTCA AAAATGATGC TGGCAAATAT TGCCAGTATT
GAAATACCGC CGATTAATTG TACGTATCTC GAATGGCTAC AAAAGCAAGA AGCCAGCCAT
CTCCAGCGTT ATGGCGTTAA AAAAGAAACC TTGCACGATC GTCAGTTTTT ACCGCGAATT
CTGCTGGGCG AATATTTCCG CGATCAATTT TTACGATTAG TAGACCAGGC ACGAAAGCAA
AAATTTGCAG TGGCTGTTTA TGAATCATGC CAGGTTACCG ATCTGCAAAT TACAAATGCT
GGCGTCATGC TCGCTACAAA TCAGGGTTTA CCCAGAGAGA CGTTTGATTT AGCGGTGATT
GCCACGGGTC ACGTCTGGCC TGATGAAGAA GAAGCAACCC GAACGTATTT TCCCAGCCCG
TGGTCAGGCC TGATGGAAGC AAAGGTCGAT GCGTGTAACG TGGGTATTAT GGGAACATCC
TTGAGCGGAC TGGATGCGGC AATGGCAGTG GCTATTCAGC ATGGTTCGTT CATTGAAGAT
GATAAACAAC ACGTCGTTTT TCACCGCGAT AACGCAAGTG AAAAGCTAAA TATCACGTTG
ATGTCGCGCA CGGGTATTTT ACCCGAAGCC GATTTCTATT GCCCTATTCC CTACGAGCCC
TTACACATTG TCACCGATCA GGCATTAAAT GCTGAGATTC AAAAAGGCGA AGAGGGCCTT
TTGGATCGGG TATTTAGATT GATAGTAGAG GAAATCAAGT TTGCTGATCC AGACTGGAGC
CAACGCATAG CCTTAGAGAG CCTGAATGTC GATTCCTTTG CTCAAGCCTG GTTTGCCGAG
CGCAAACAAC GCGACCCATT TGACTGGGCA GAAAAAAATC TCCAGGAAGT CGAACGCAAT
AAACGAGAAA AACATACTGT TCCCTGGCGT TATGTCATTC TGCGCCTGCA TGAAGCCGTA
CAGGAAATTG TTCCACATCT GAATGAACAC GACCATAAAC GGTTCAGTAA AGGCCTTGCC
CGGGTTTTTA TCGATAATTA TGCGGCAATC CCTTCAGAGT CTATTCGTCG GCTGCTGGCC
TTACGTGAAG CGGGGATCAT TCATATTCTC GCCCTCGGTG AAGACTACGA AATGGAAATT
AATGAGTCGC GCACCGTCCT GAAAACGGAA GACAACAGCT ACTCGTTTGA CGTTTTTATT
GATGCCCGCG GACAGCGTCC GCTTAAAGTG AAAGATATTC CTTTCCCTGG GCTACGCGAG
CAATTACAGA AAACAGGGGA TGAAATCCCT GATGTTGGCG AAGATTATAC GTTACAGCAA
CCCGAAGATA TTCGTGGACG CGTAGCGTTC GGCGCGTTGC CCTGGTTGAT GCACGACCAG
CCTTTCGTTC AGGGACTTAC GGCATGTGCA GAAATTGGTG AGGCGATGGC TCGGGCGGTC
GTAAAACCTG CATCCCGTGC ACGTCGGCGT CTTTCGTTTG ATTAA
 
Protein sequence
MKKIAIVGAG PTGIYTLFSL LQQQTPLSIS IFEQADEAGV GMPYSDEENS KMMLANIASI 
EIPPINCTYL EWLQKQEASH LQRYGVKKET LHDRQFLPRI LLGEYFRDQF LRLVDQARKQ
KFAVAVYESC QVTDLQITNA GVMLATNQGL PRETFDLAVI ATGHVWPDEE EATRTYFPSP
WSGLMEAKVD ACNVGIMGTS LSGLDAAMAV AIQHGSFIED DKQHVVFHRD NASEKLNITL
MSRTGILPEA DFYCPIPYEP LHIVTDQALN AEIQKGEEGL LDRVFRLIVE EIKFADPDWS
QRIALESLNV DSFAQAWFAE RKQRDPFDWA EKNLQEVERN KREKHTVPWR YVILRLHEAV
QEIVPHLNEH DHKRFSKGLA RVFIDNYAAI PSESIRRLLA LREAGIIHIL ALGEDYEMEI
NESRTVLKTE DNSYSFDVFI DARGQRPLKV KDIPFPGLRE QLQKTGDEIP DVGEDYTLQQ
PEDIRGRVAF GALPWLMHDQ PFVQGLTACA EIGEAMARAV VKPASRARRR LSFD