Gene Spro_3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3970 
Symbol 
ID5603537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4397790 
End bp4399037 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content59% 
IMG OID640939530 
Productmajor facilitator transporter 
Protein accessionYP_001480193 
Protein GI157372204 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGCA TTGTGCAACA ACGTCAACGC TGGTTCGGGG TTGTGGCCCT GCTGTTTTTG 
ATCGTCATCG CCTATGCCGA TCGGGTCAAT ATCGCCGTGA TGCTGGTTAA CCCCGACTTT
CTGCAGCACT TTCAGCTCGG GGGAAACCGT GCGCATCAGG GCATGCTGAT GACGGTGTTT
TTGCTCGGTT ACGGGCTTTC CGCCATGCTG CTGACGCCGT TTCTGGAAAC CCTGATGGGT
TATCGCCGTG CGCTGACGCT GAGCATTGTG CTATGGGCGT TGCTCACCGC CGCCTCGCCG
CTGGCAGGGT CGCTGCTGCT GTTATTTGCG GTGCGCGCAT TGTTGGGTGC CAGTGAAGGC
CCGTTGTTCT CGCTGAAAAC CATGTATATC GGCGATCACT TCGCTGCCGA TGAGCGCGGC
AAGCCCAATG CGGTCAGCAC TCTGGGCGTC TCGCTGGGGC TGGTGATTGG CTTTCCGCTG
GTGAGCTTCT TGATGGCGCA CTTCGGCTGG GCGGTTTCTT TCTATCTGCT GGCGCTGATC
AACCTGTTGC TGGGGCTGGC GTTGGTACGG CTGTTTATCC ATCCCGCCGC ATTGCCGCCG
CGTGCCGTTG ACCCAAGACC GATCCTGCAA CGCGTCTGGG ATACTTTTAC CCTCGCCTGG
CGCACCCCGA TGCTCGGCTG GATCATGCTT ATCGAGATCG CCACCCTCAG CTATCTGTGG
GGATCCAGCT CCTGGCTCCC GGCCTACCTG ACCGACGAGA AAGGCTTTTC CATCAAACAG
ATGGGCTGGA TGGCCTCCTT GCCGTTTATC GTCAGCATTG CCTCTAAATA TCTCGGCGGC
GTACTGCTTG ATCGCATCCG CCCTTATCAG GCACCGCTGA TTTTCGCCTT TGGCGGTGCG
GCAACGGCAC TGTGCATCTA TGGCGTGATG CATAGCGAAC AGCTTGGCTG GATAGCCTTC
TTCCTGCTGG CGGCCAATGC CTGTTGGGGC GCTCAGGGAG CCGCGATCCC GACGCTGTTG
CAGCATTATG CGCAGCCGCA GGCCGTCGGC AGCGCCTATG GGCTGATTAA CGGCATCGGC
AATATGTTCT CGGCGTTTGT ACCTATGATT ATGGGCATGG TGATGGCCAG CCAGGGCAAG
GTGTCTTCGG GGTTTGCGGT GCTGATTGTG TCGCAGGTAG TGACACTGTT GGCGGGCGGC
GTGCTGTTCA GCCGGATGCT GATGACGCGT GAGGCAAGGC GGGCGTAA
 
Protein sequence
MERIVQQRQR WFGVVALLFL IVIAYADRVN IAVMLVNPDF LQHFQLGGNR AHQGMLMTVF 
LLGYGLSAML LTPFLETLMG YRRALTLSIV LWALLTAASP LAGSLLLLFA VRALLGASEG
PLFSLKTMYI GDHFAADERG KPNAVSTLGV SLGLVIGFPL VSFLMAHFGW AVSFYLLALI
NLLLGLALVR LFIHPAALPP RAVDPRPILQ RVWDTFTLAW RTPMLGWIML IEIATLSYLW
GSSSWLPAYL TDEKGFSIKQ MGWMASLPFI VSIASKYLGG VLLDRIRPYQ APLIFAFGGA
ATALCIYGVM HSEQLGWIAF FLLAANACWG AQGAAIPTLL QHYAQPQAVG SAYGLINGIG
NMFSAFVPMI MGMVMASQGK VSSGFAVLIV SQVVTLLAGG VLFSRMLMTR EARRA