Gene Spro_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_0100 
Symbol 
ID5605678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp107431 
End bp108972 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content56% 
IMG OID640935585 
Productxylose transporter ATP-binding subunit 
Protein accessionYP_001476338 
Protein GI157368349 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID[TIGR02633] D-xylose ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGCC TATTAGAAAT GAAAAATATC ACCATGCAAT TCGGTGCGGT CAAGGCGGTA 
GATAATGTCA GCCTGAAGTT AGAGGCAGGC CAGGTATTAT CGCTGTGTGG TGAAAACGGT
TCGGGAAAGT CCACCCTGAT GAAAGTGCTG TGCGGTATAT ATCCGCATGG CAGTTATCAG
GGCGATATTT ATTTCTCTGA TGATTTATTA GCGGCGAAAA ATATTCGCGA CACCGAGCAG
AAAGGCATCG CCATCATTCA TCAGGAGCTG GCGCTGGTGA AGCAGATGAC GGTGCTGGAA
AACATGTTTC TCGGCAATGA ATGGCGCCGG TACGGCGTAA TGGATTATGA CGCCATGTAT
CTGCGCTGCC GACGGATGCT GGCGCAGGTC AAGCTGGCGG TGGATCCCAA CACCCCGATC
GGCGAACTGG GCCTGGGGCA GCAACAGCTG GTGGAGATTG CCAAGGCGCT GAACAAGCAG
GTGCGGCTGC TGGTGCTCGA TGAGCCGACG GCGTCGCTGA CCGAAAGTGA AACCGCTACG
CTGCTGGCGA TCATCGAGGA TCTGCGCGAT CACGGCATCG CCTGCATTTA CATTTCGCAC
AAGCTCAATG AGGTGAAGGC GATCTCGGAT CTGATCTGCG TGATCCGCGA CGGCAAGCAT
ATCGGTACGC GCCCGGCAGC GGAAATGAGT GAGGACGACA TTATCGCCAT GATGGTTGGG
CGCGAATTAA CCGAGCTCTA TCCGCAGCAA CAGCATGATA TCGGCGAGGT GATCCTGCAG
GTGGATAATC TCACCGCCTG GCACCCGGTG AACCGCCACG TTCGTCGGGT GGATGATGTT
TCTTTCACCC TGCGACGTGG CGAGATCCTC GGCGTGGCCG GGTTGGTCGG CTCCGGGCGC
ACCGAAACCA TGCAGTGCCT GTTCGGCGTC TACCCTGGCC GTTGGCAGGG GAGCATTAGC
CTCAATGGCC AGCCGGCCGT CATCAACAAT TGCCGCCAGG CGATGCGATA TGGCATTGCC
ATGGTGCCGG AAGATCGCAA GCGTGATGGC ATCGTGCCGG TGATGGGCGT GGGGGCCAAC
ATGACGTTGG CGGCGCTGAG CGACTTTAGC GGCGTGCTGA CGGTGCTTGA TGATGCCCGG
GAACAGGCGA CCATTCGTCA ATCGCTGGCG CAGTTAAAAG TGAAGACTTC CTCACCCGAA
CTGGCGATAG CCCGACTGAG CGGCGGTAAC CAGCAGAAGG CCATACTCGC CAAATGCCTG
CTACTGAAGC CGAAAATTCT CATTCTGGAT GAGCCGACGC GCGGTATCGA TATCGGTGCC
AAACACGAAA TCTATAAGTT AATCAATCAG CTGGTCCAGC AGGGTATTTC GGTGATCGTG
GTGTCTTCGG AGCTGCCTGA AGTGTTGGGA TTAAGCGATC GGGTATTGGT GATGCACCAG
GGGCGTATCA AGGCTTCGTT GGTCAACCAG GGTCTGACCC AGGAACAGGT GATGGAAGCC
GCATTGAGGA GTGAAACAGA TGTCGAAAAA CACGCAGTAT GA
 
Protein sequence
MPSLLEMKNI TMQFGAVKAV DNVSLKLEAG QVLSLCGENG SGKSTLMKVL CGIYPHGSYQ 
GDIYFSDDLL AAKNIRDTEQ KGIAIIHQEL ALVKQMTVLE NMFLGNEWRR YGVMDYDAMY
LRCRRMLAQV KLAVDPNTPI GELGLGQQQL VEIAKALNKQ VRLLVLDEPT ASLTESETAT
LLAIIEDLRD HGIACIYISH KLNEVKAISD LICVIRDGKH IGTRPAAEMS EDDIIAMMVG
RELTELYPQQ QHDIGEVILQ VDNLTAWHPV NRHVRRVDDV SFTLRRGEIL GVAGLVGSGR
TETMQCLFGV YPGRWQGSIS LNGQPAVINN CRQAMRYGIA MVPEDRKRDG IVPVMGVGAN
MTLAALSDFS GVLTVLDDAR EQATIRQSLA QLKVKTSSPE LAIARLSGGN QQKAILAKCL
LLKPKILILD EPTRGIDIGA KHEIYKLINQ LVQQGISVIV VSSELPEVLG LSDRVLVMHQ
GRIKASLVNQ GLTQEQVMEA ALRSETDVEK HAV