Gene EcSMS35_2325 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2325 
Symbol 
ID6145387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2356384 
End bp2357940 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content50% 
IMG OID641617199 
Producthypothetical protein 
Protein accessionYP_001744372 
Protein GI170681845 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.802767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.00255877 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCATAC GCGCTCCCAA TTCTGGACGT AAGCTCCTGC TTACCTGCAT TGTTGCAGGC 
GTGATGATTG CGATACTGGT GAGCTGCCTT CAGTTTTTAG TGGCCTGGCA TAAGCACGAA
GTCAAATACG ACACACTGAT TACCGACGTA CAAAAGTATC TCGATACCTA TTTTGCCGAC
CTGAAATCCA CTACTGACCG GCTTCAGCCG CTGACCTTAG ATACCTGCCA GCAGGCTAAC
CCCGAACTGA CCGCTCGCGC GGCGTTTAGC ATGAATGTCC GTACGTTTGT GCTGGTGAAA
GATAAAAAAA CATTCTGTTC ATCTGCGACT GGTGAGATGG ACATTCCACT AAAAGAATTG
ATTCCGGCGC TCGACATTAA TAAAAATGTC GATATGGCGA TCTTACCCGG TACGCCGATG
GTGCCGAACA AACCCGCAAT CGTCATCTGG TATCGCAACC CTTTGCTGAA AAATAGCGGC
GTCTTTGCCG CTCTGAATCT CAACCTGACG CCTTCACTCT TTTATAGTTC ACGGCAGGAA
GATTACGATG GCCTCGCCCT CATTATTGGT AATACTGCGC TATCTACCTT TTCTTCACGT
TTGATGAATG TTAATGAATT AACCGACATG CCGGTCCGTG AAACTAAAAT TGCGGGCATT
CCTCTGACCG TTCGGCTTTA TGCGGATGAC TGGACATGGA ACGATGTGTG GTACGCATTT
TTACTGGGTG GCATGAGTGG AACTTTCGTT GGACTTCTCT GCTATTACCT GATGAGTGTG
CGTATGCGCC CAGGCAGAGA AATCATGACC GCCATCAAGC GCGAACAATT TTACGTGGTA
TATCAACCGG TGGTTGATAC ACAAGCTTTG CGGGTAACGG GCCTGGAAGT ACTGCTACGC
TGGCGGCATC CAGTAGCAGG AGAAATCCCC CCGGATGCCT TCATTAACTT TGCCGAAGCG
CAAAAGATGA TTGTGCCACT GACTCAGCAC CTGTTTGAGT TGATTGCCCG CGATGCCGCA
GAATTAGAAA AAGTACTGCC GGTAGGCGTC AAATTTGGCA TTAACATTGC GCCGGCCCAC
TTGCACAGCG AAAGCTTTAA AGCGGATATC CAGAAACTGC TCACTTCCCT GCCCGCACAC
CATTTCCAGA TTGTGCTGGA AATTACCGAG CGCGATATGC TGAAAGAGCG AGAAGCCACA
CAACTCTTCG CCTGGCTGCA TTCGGTCGGC GTAGAAATTG CTATTGATGA CTTCGGCACC
GGGCACAGCG CGCTTATCTA TCTTGAGCGT TTTACGCTCG ATTATCTGAA AATTGATCGT
GGATTTATCA ACGCCATCGG TACGGAAACG ATCACTTCAC CCGTACTTGA CGCGGTGCTG
ACGCTGGCGA AACGTCTCAA TATGCTGACA GTTGCTGAAG GGGTCGAAAC GCCAGAACAG
GCACGATGGC TAAGCGAACG CGGCGTTAAT TTCATGCAAG GCTACTGGAT TAGTCGCCCG
TTACCGCTGG ACGATTTTGT TCGCTGGCTG AAGAAACCGT ATACGCCGCA GTGGTAA
 
Protein sequence
MFIRAPNSGR KLLLTCIVAG VMIAILVSCL QFLVAWHKHE VKYDTLITDV QKYLDTYFAD 
LKSTTDRLQP LTLDTCQQAN PELTARAAFS MNVRTFVLVK DKKTFCSSAT GEMDIPLKEL
IPALDINKNV DMAILPGTPM VPNKPAIVIW YRNPLLKNSG VFAALNLNLT PSLFYSSRQE
DYDGLALIIG NTALSTFSSR LMNVNELTDM PVRETKIAGI PLTVRLYADD WTWNDVWYAF
LLGGMSGTFV GLLCYYLMSV RMRPGREIMT AIKREQFYVV YQPVVDTQAL RVTGLEVLLR
WRHPVAGEIP PDAFINFAEA QKMIVPLTQH LFELIARDAA ELEKVLPVGV KFGINIAPAH
LHSESFKADI QKLLTSLPAH HFQIVLEITE RDMLKEREAT QLFAWLHSVG VEIAIDDFGT
GHSALIYLER FTLDYLKIDR GFINAIGTET ITSPVLDAVL TLAKRLNMLT VAEGVETPEQ
ARWLSERGVN FMQGYWISRP LPLDDFVRWL KKPYTPQW