Gene Spro_2297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2297 
Symbol 
ID5604780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2499917 
End bp2501167 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content58% 
IMG OID640937836 
Productextracellular solute-binding protein 
Protein accessionYP_001478526 
Protein GI157370537 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.33558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCC ATACCCTTGC CGCTACGGCA GTCTGTACCC TGTCGCTCCT GGCGCTTAGC 
CTGTCATCTG CCTATGCCGC CCCGACGCAA ATTAACGCGT TGTTTATGAC CCAGGCGGCG
TACAGCGAAA ATGATATCCG TGCCATGACC GCCGATTTTA GCAAGCAGCA CCCGGATATC
ACCGTTAACC TGGAGTTCGT TCCTTACGAG GCGCTGCACG ATAAAATCGT CGCGGCGCGC
GGTGCCGGCA GTAACGGCTA CGATGTGGTG CTGTTCGACG CCATCTGGCC GGCAGAATTC
ACCAAGTTCG GCCTGCTGCA GGACGTAACC TCGCGCATCA GCGCCGACGA CAGCGCCAAA
ATCTTTGCCG GCGCCATGAC CACCGTCACC TATAAGGACA AGCGCTGGGG CATGCCGTGG
ATCCTCGACA CCAAATACCT GTATTACAAC AAAGCCATGC TGGCCAAGGC CGGGATTGCC
GCCCCGCCGA AAACCTGGCA GGAACTGGCG CAGCAGGCAG AGATCCTGAA GCAAAAAAAC
GTGGTCAAAT ACCCGCTGGT ATGGAGCTGG TCACAGGCCG AGGCACTGGT TTGCGATTAC
ACCACCCTGG TGTCTGCCTA TAAGGGGCAG TTTATCCAGC AGGGGAAAAT CACCTTCTCC
AGCCCAGGTT CACTGCAGGC CGTCGACTAT ATGAAAGCGT CGCTGGACAA GGGGCTGACC
AATCCGAACT CCCGCGAATA TCTGGAAGAG GACGTGCGCA AAGCGTTTTC CAACGGTGAC
GCGGCCTTCG CCCTTAACTG GACCTACATG TACAACATGG CCAACGATCC CAAGCAAAGC
AAAGTGGCCG GTGACGTCGG CATCGTGCCG GCTCCGGGAT CGGTGGCGGG TCAGGTCTCT
GCGGTTAACG GTTCGATGGG GCTAGGCATC GCCAAGGCCA GCGCCCACCC CGATCAGGCC
TGGCAATACA TCAGCTACAT GACCTCACAG CCGGTGCAGG ACAAATACGC CAAACTAAGC
CTGCCGATCT GGAAGTCGTC TTACGACGAT CCGACGGTGC AGAAGGGTCA GGAGCCGTTA
ATCGCCGCCG CCAAACAGTC GTTGAACGTG ATGCTGTCGC GCCCTGAAAC CGCCGATTAC
TCTCGTTTGT CCAACGGCCT GCAACAGGAC TTGCAGCAAA TTCTGCAGGG CAAGGTAACG
CCGCAGGCCG GGCTGGATGC GGCCACCCAA AGCGCTGCGC GGCTACGTTA A
 
Protein sequence
MKSHTLAATA VCTLSLLALS LSSAYAAPTQ INALFMTQAA YSENDIRAMT ADFSKQHPDI 
TVNLEFVPYE ALHDKIVAAR GAGSNGYDVV LFDAIWPAEF TKFGLLQDVT SRISADDSAK
IFAGAMTTVT YKDKRWGMPW ILDTKYLYYN KAMLAKAGIA APPKTWQELA QQAEILKQKN
VVKYPLVWSW SQAEALVCDY TTLVSAYKGQ FIQQGKITFS SPGSLQAVDY MKASLDKGLT
NPNSREYLEE DVRKAFSNGD AAFALNWTYM YNMANDPKQS KVAGDVGIVP APGSVAGQVS
AVNGSMGLGI AKASAHPDQA WQYISYMTSQ PVQDKYAKLS LPIWKSSYDD PTVQKGQEPL
IAAAKQSLNV MLSRPETADY SRLSNGLQQD LQQILQGKVT PQAGLDAATQ SAARLR