Gene EcSMS35_1286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1286 
SymbolaraG 
ID6145114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1274447 
End bp1275961 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content52% 
IMG OID641616164 
ProductL-arabinose transporter ATP-binding protein 
Protein accessionYP_001743344 
Protein GI170682274 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00939133 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000000120675 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAACAGT CTACCCCGTA TCTCTCATTT CGCGGCATCG GTAAAACATT TCCCGGCGTT 
AAGGCGCTGA CGGATATTAG TTTTGATTGC TATGCCGGTC AGGTTCATGC GTTGATGGGT
GAAAATGGCG CAGGAAAATC AACTCTCTTA AAAATCCTCA GCGGCAACTA TGCGCCAACC
ACGGGTTCTG TAGTGATTAA TGGGCAGGAA ATGTCCTTTT CCGACACGAC CGCAGCACTT
AATGCGGGTG TGGCGATTAT TTACCAGGAA CTGCATCTCG TGCCGGAAAT GACCGTCGCG
GAAAACATCT ATCTCGGCCA GCTGCCGCAT AAAGGCGGCA TTGTGAATCG CTCATTGCTG
AATTATGAGG CGGGTTTACA ACTTAAACAT CTTGGTATGG ATATTGACCC GGACACGCCG
CTGAAATATC TCTCCATTGG TCAGTGGCAG ATGGTTGAAA TCGCCAAGGC GCTGGCGCGT
AACGCCAAAA TTATCGCCTT TGATGAGCCA ACCAGCTCCC TTTCTGCCCG CGAAATCGAC
AATCTTTTCC GCGTTATTCG TGAACTGCGA AAAGAGGGGC GGGTGATCTT ATACGTTTCT
CACCGTATGG AAGAAATATT TGCCCTCAGC GATGCCATCA CCGTCTTTAA AGATGGACGT
TATGTCAAAA CCTTTACCGA TATGCAGCAG GTTGACCACG ACGCGCTGGT GCAGGCGATG
GTCGGGCGCG ACATTGGCGA TATCTACGGC TGGCAACCTC GTAGTTATGG CGAGGAGCGC
CTGCGTCTTG ATGCTGTGAA AGCACCAGGC GTGCGTACGC CAATAAGTCT GGCGGTTCGC
AGTGGTGAAA TTGTCGGTCT GTTTGGTCTG GTAGGAGCGG GGCGTAGCGA ATTAATGAAA
GGCTTGTTTG GCGGGACGCA AATCACCGCC GGTCAGGTTT ATATCGACCA ACAGCCGATC
GATATTCGTA AACCGAGCCA CGCCATTGCC GCAGGCATGA TGCTCTGCCC GGAAGATCGC
AAAGCCGAAG GCATTATTCC CGTGCACTCC GTTCGCGACA ATATCAACAT CAGTGCCAGA
CGTAAACATG TGCTCGGCGG TTGTGTAATC AACAACGGTT GGGAAGAAAA CAATGCCGAT
CACCACATTC GTTCGCTCAA CATCAAAACG CCGGGCGCTG AGCAACTGAT CATGAATCTC
TCAGGCGGAA ATCAGCAAAA AGCCATTCTG GGCCGCTGGT TATCGGAAGA GATGAAGGTC
ATTTTGCTGG ATGAACCTAC GCGCGGCATT GATGTTGGTG CTAAGCATGA AATTTACAAC
GTGATTTATG CGCTGGCGGC GCAGGGTGTG GCGGTGCTGT TTGCCTCCAG CGACTTACCT
GAAGTCCTCG GCGTTGCCGA CCGGATTGTG GTGATGCGGG AAGGTGAAAT CGCCGGTGAA
TTGTTACACG AGCAGGCAGA TGAGCGTCAG GCACTGAGCC TTGCGATGCC TAAAGTCAGC
CAGGCAGTTG CCTGA
 
Protein sequence
MQQSTPYLSF RGIGKTFPGV KALTDISFDC YAGQVHALMG ENGAGKSTLL KILSGNYAPT 
TGSVVINGQE MSFSDTTAAL NAGVAIIYQE LHLVPEMTVA ENIYLGQLPH KGGIVNRSLL
NYEAGLQLKH LGMDIDPDTP LKYLSIGQWQ MVEIAKALAR NAKIIAFDEP TSSLSAREID
NLFRVIRELR KEGRVILYVS HRMEEIFALS DAITVFKDGR YVKTFTDMQQ VDHDALVQAM
VGRDIGDIYG WQPRSYGEER LRLDAVKAPG VRTPISLAVR SGEIVGLFGL VGAGRSELMK
GLFGGTQITA GQVYIDQQPI DIRKPSHAIA AGMMLCPEDR KAEGIIPVHS VRDNINISAR
RKHVLGGCVI NNGWEENNAD HHIRSLNIKT PGAEQLIMNL SGGNQQKAIL GRWLSEEMKV
ILLDEPTRGI DVGAKHEIYN VIYALAAQGV AVLFASSDLP EVLGVADRIV VMREGEIAGE
LLHEQADERQ ALSLAMPKVS QAVA