Gene EcSMS35_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3646 
SymboltsgA 
ID6146691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3704697 
End bp3705878 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content52% 
IMG OID641618473 
Producthypothetical protein 
Protein accessionYP_001745613 
Protein GI170682755 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.865659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.000149858 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTAACA GCAATCGCAT CAAGCTCACA TGGATTAGCT TTCTCTCCTA CGCACTGACC 
GGTGCGTTGG TTATTGTCAC CGGGATGGTG ATGGGAAATA TCGCCGATTA TTTCAATCTG
CCTGTTTCCA GTATGAGTAA TACCTTCACC TTCCTCAACG CCGGCATTTT AATCTCTATC
TTCCTCAACG CCTGGCTGAT GGAAATCGTC CCGTTGAAAA CGCAGTTACG TTTTGGCTTT
CTCCTGATGG TGCTGGCGGT TGCCGGTTTG ATGTTCAGCC ACAGCCTGGC ACTGTTCTCG
GCGGCGATGT TCATTCTCGG GGTGGTCAGC GGCATCACCA TGTCGATTGG TACATTCCTG
GTAACACAAA TGTATGAAGG ACGTCAGCGC GGTTCACGCC TGTTATTTAC CGACTCCTTC
TTCAGTATGG CCGGGATGAT TTTCCCAATG ATCGCCGCGT TTCTGCTGGC GCGCAGCATT
GAGTGGTACT GGGTTTATGC CTGCATCGGG CTGGTGTACG TCGCTATCTT TATTCTGACC
TTCGGCTGTG AGTTCCCGGC GCTGGGTAAA CATGCGCCAA AAACGGACGC TCCGGTAGCG
AAAGAAAAAT GGGGGATCGG CGTGCTGTTT CTCTCCATTG CGGCACTGTG CTACATCCTC
GGTCAGTTAG GTTTTATCTC CTGGGTGCCT GAGTATGCCA AAGGCCTGGG CATGAGCCTG
AACGACGCGG GCACGCTGGT GAGTAACTTC TGGATGTCAT ACATGGTCGG CATGTGGGCG
TTCAGCTTTA TTCTTCGCTT CTTTGATTTG CAACGCATTC TGACCGTACT GGCTGGTCTG
GCTGCGATTC TGATGTACGT CTTTAACACC GGAACACCGG CACATATGGC GTGGTCAATT
CTCGCCCTGG GCTTCTTCTC CAGCGCGATC TATACCACCA TCATCACTTT GGGTTCACAG
CAGACCAAAG TACCGTCGCC AAAACTGGTT AACTTTGTCC TGACCTGCGG GACCATCGGT
ACTATGTTGA CCTTTGTGGT TACCGGCCCG ATTGTTGAAC ATAGCGGTCC GCAGGCGGCG
CTGCTGACGG CAAACGGTCT GTACGCTGTC GTCTTTGTGA TGTGCTTCCT GTTAGGTTTC
GTCAGCCGTC ACCGTCAGCA TAACACCCTG ACCTCTCATT AA
 
Protein sequence
MTNSNRIKLT WISFLSYALT GALVIVTGMV MGNIADYFNL PVSSMSNTFT FLNAGILISI 
FLNAWLMEIV PLKTQLRFGF LLMVLAVAGL MFSHSLALFS AAMFILGVVS GITMSIGTFL
VTQMYEGRQR GSRLLFTDSF FSMAGMIFPM IAAFLLARSI EWYWVYACIG LVYVAIFILT
FGCEFPALGK HAPKTDAPVA KEKWGIGVLF LSIAALCYIL GQLGFISWVP EYAKGLGMSL
NDAGTLVSNF WMSYMVGMWA FSFILRFFDL QRILTVLAGL AAILMYVFNT GTPAHMAWSI
LALGFFSSAI YTTIITLGSQ QTKVPSPKLV NFVLTCGTIG TMLTFVVTGP IVEHSGPQAA
LLTANGLYAV VFVMCFLLGF VSRHRQHNTL TSH