Gene Spro_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2097 
Symbol 
ID5606474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2291700 
End bp2293079 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content58% 
IMG OID640937635 
Productmajor facilitator transporter 
Protein accessionYP_001478328 
Protein GI157370339 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00895] benzoate transport 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.201063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACC GCAACATTGT CGACGTCAAA GCGTGGATAG ACTCCCGCCC GATTTCCCGC 
TATCAGTGGC TGGTATTAAT TCTGTGTTTC GTGATCATCA TGTTTGATGG TTATGACGCC
GCGGTGATGG GTTTTATCGC CCCCGCACTG ATTGAAGACT GGGGTATTTC GCGCGCCGAA
ATGGGCCCGA TCCTCGGTGC TGCAATGTTT GGCGTCGCTT TGGGAGCGCT GATCGCCGGC
CCCTATTCCG ATCGTTTTGG CCGCAAGAAG GTGCTGCTGC TGTCCATTCT GTGCTTTGCG
TTGTTCAGCC TGCTGAGCAC CTTCGCCCGC ACCCCGCTGG AAATGGCCTT GCTGCGTTTT
CTTACCGGCC TGGGCTTGGG CGCGGTGATG CCCAACTGTG TCACGCTGGT GTCAGAGTAT
ATGCCCGAAC GCCGCCGCGG CCTGATGATC ACCCTGATGT ACAGCGGCTT TAATATCGGT
TCCGGTGCCG GCGGGTTTAT CGCCGCCGCC ATGTTACCGC ACTATGGCTG GAAGGCGGTG
CTGTTCCTGG GCGGCATCAT GCCGCTGCTG ATGTTGCCGC TGCTGATTTG GATCCTGCCG
GAATCGACGC TGTATATGGT GGTGCGCAAC CATCACAAAG AAAAAATTGC CCAAATCCTG
CGCCGCGCCG GTGGCGTGTT CAGCGCCGGT ACCACCTTCA TACTGAAAAC CCCGGTGATC
CCAAAGAAAG CCCGCGTGCT GCAACTGTTC AGTAATGGTT ACGCCAGGGG CACCCTGGTG
CTGTGGCTGA CCTATTTTAT GGGGCTGTTC GTGATTTACC TGCTCAACGG CTGGCTGCCG
ACCATCGTGC GCGGCGCCGG TTTCTCACTG GAGCGCGCCG CGATAATCGC CGGTCTGTTC
CAACTGGGCG GTACCCTGGG AGGCCTGCTG GTGGGTTACC TGATGGACCG CTTCGTCGCC
AAACGGGTGA TCGGACTGTT TTATCTGATG GGCATGTTCT GCCTGCTGTC ACAGGGTATC
TGGGGCTTTG GTCCGACGCT GCTGGCGGTG TTGGTCTTTG TTAGCGGCAT GTGTATCAAC
GGGGCGCAGA CCGGTCTGCA AGCATTCTCA CCGGCATTCT ACCCCACTGA AATGCGCGCG
ACCGGCGTCA GCTGGATGCA CGGCATCGGC CGCAGCGGCG CCATTATCAG TTCGTCAATG
GGTGGCGTAT TGCTGGGTAT TTTCCCCGGC GCCACCAGCA TTTTTATCAT TCTGGCGATC
CCGGCGTTGC TGGCGGCGAT CACTATCATT AATCATCAAA AGGCACATCC GGCGGAAATG
GTTAAGGGCA TCGACATCAC CGATCTGCCC GCGTTGTCAC GCACCATGAA TAACCGGTAA
 
Protein sequence
MSNRNIVDVK AWIDSRPISR YQWLVLILCF VIIMFDGYDA AVMGFIAPAL IEDWGISRAE 
MGPILGAAMF GVALGALIAG PYSDRFGRKK VLLLSILCFA LFSLLSTFAR TPLEMALLRF
LTGLGLGAVM PNCVTLVSEY MPERRRGLMI TLMYSGFNIG SGAGGFIAAA MLPHYGWKAV
LFLGGIMPLL MLPLLIWILP ESTLYMVVRN HHKEKIAQIL RRAGGVFSAG TTFILKTPVI
PKKARVLQLF SNGYARGTLV LWLTYFMGLF VIYLLNGWLP TIVRGAGFSL ERAAIIAGLF
QLGGTLGGLL VGYLMDRFVA KRVIGLFYLM GMFCLLSQGI WGFGPTLLAV LVFVSGMCIN
GAQTGLQAFS PAFYPTEMRA TGVSWMHGIG RSGAIISSSM GGVLLGIFPG ATSIFIILAI
PALLAAITII NHQKAHPAEM VKGIDITDLP ALSRTMNNR