Gene EcolC_2403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2403 
Symbol 
ID6068594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2650429 
End bp2651859 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content50% 
IMG OID641601812 
Producthypothetical protein 
Protein accessionYP_001725364 
Protein GI170020410 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000223756 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTTTTC ATTTTTTACC GGAAGTTACC GACGTTTTGA GCCGTTTCGT TCCTCGCATT 
ATTTCGTTTT ATTTACTCTT GCTGGCGGCA GGCGGTACAG CTAACGCACA ATCTACCTTC
GAGCAAAAAG CGGCAAATCC CTTTGATAAT AACAATGATG GTCTGCCGGA TTTAGGCATG
GCACCTGAAA ATCATGATGG GGAAAAACAC TTTGCTGAAA TTGTGAAAGA TTTCGGCGAA
ACCAGTATGA ATGATAACGG GCTGGATACT GGCGAGCAGG CAAAAGCTTT CGCATTGGGA
AAAGTCCGCG ACGCGCTTAG TCAACAGGTT AATCAGCACG TAGAGTCCTG GCTATCACCG
TGGGGAAATG CCAGTGTTGA TGTCAAAGTG GATAACGAAG GTCATTTTAC CGGCAGTCGT
GGAAGCTGGT TTGTGCCGTT ACAAGATAAT GATCGTTATC TCACCTGGAG CCAGCTTGGT
CTTACTCAGC AGGATGATGG GTTGGTGAGC AATGTGGGCG TTGGGCAACG CTGGGCGCGC
GGCAACTGGC TGGTGGGTTA TAACACTTTT TATGACAACT TGCTGGACGA AAATCTTCAG
CGAGCGGGCT TTGGTGCCGA AGCGTGGGGC GAATATTTGC GACTATCGGC AAACTTTTAT
CAGCCATTTG CTGCATGGCA TGAACAGACA GCCACGCAGG AACAACGGAT GGCGCGCGGG
TACGACCTGA CAGCCCGGAT GCGCATGCCG TTCTATCAAC ACCTCAATAC CAGTGTCAGC
GTAGAACAGT ATTTTGGTGA TCGTGTTGAT TTGTTTAACT CTGGTACGGG TTATCACAAT
CCCGTCGCGT TGAGTCTGGG ATTAAATTAC ACCCCTGTGC CATTAGTCAC TGTGACGGCC
CAGCATAAAC AGGGTGAAAG TGGCGAGAAT CAAAATAACC TCGGGCTGAA TCTTAATTAC
CGCTTTGGTG TACCGCTCAA AAAACAACTT TCTGCGGGCG AGGTTGCCGA AAGTCAGTCG
TTACGTGGTA GTCGCTATGA TAATCCGCAG CGAAATAATC TACCGACTCT TGAGTACCGA
CAGCGAAAAA CGTTAACGGT GTTTCTGGCG ACACCGCCGT GGGATCTAAA ACCTGGCGAA
ACAGTGCCGC TGAAATTACA AATCCGCAGT CGTTACGGTA TTCGGCAACT GATTTGGCAG
GGCGATACGC AGATATTAAG TTTGACGCCA GGCGCACAAG CCAACAGCGC GGAGGGCTGG
ACGCTGATCA TGCCTGACTG GCAGAACGGG GAAGGGGCGA GCAATCACTG GCGATTGTCG
GTGGTGGTGG AAGATAACCA GGGGCAGCGT GTCTCCTCCA ATGAGATCAC GCTAACGCTT
GTCGAACCGT TCGACGCATT GTCAAACGAC GAACTGCGCT GGGAACCGTA A
 
Protein sequence
MVFHFLPEVT DVLSRFVPRI ISFYLLLLAA GGTANAQSTF EQKAANPFDN NNDGLPDLGM 
APENHDGEKH FAEIVKDFGE TSMNDNGLDT GEQAKAFALG KVRDALSQQV NQHVESWLSP
WGNASVDVKV DNEGHFTGSR GSWFVPLQDN DRYLTWSQLG LTQQDDGLVS NVGVGQRWAR
GNWLVGYNTF YDNLLDENLQ RAGFGAEAWG EYLRLSANFY QPFAAWHEQT ATQEQRMARG
YDLTARMRMP FYQHLNTSVS VEQYFGDRVD LFNSGTGYHN PVALSLGLNY TPVPLVTVTA
QHKQGESGEN QNNLGLNLNY RFGVPLKKQL SAGEVAESQS LRGSRYDNPQ RNNLPTLEYR
QRKTLTVFLA TPPWDLKPGE TVPLKLQIRS RYGIRQLIWQ GDTQILSLTP GAQANSAEGW
TLIMPDWQNG EGASNHWRLS VVVEDNQGQR VSSNEITLTL VEPFDALSND ELRWEP