Gene Spro_4472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4472 
SymbolmalE 
ID5603064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4945782 
End bp4946975 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content56% 
IMG OID640940034 
Productmaltose ABC transporter periplasmic protein 
Protein accessionYP_001480694 
Protein GI157372705 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGCA GCTTTACCAC CGCCCGCACA CTGGTGCTTT CGGCTCTGAC CACGTTGGTG 
CTTTCTTCTT CCGCCTTCGC CAAGATCGAG GAAGGGAAAT TGGTTATCTG GATCAACGGC
GATAAGGGCT ATAACGGCCT GGCCGAGGTC GGCAAAAAAT TTGAGAAAGA CACCGGTATC
AAAGTCACCA TCGAGCACCC GGATAAGCTG GAAGAGAAAT ACCCGCAGGT TGCCGCCACC
GGCGACGGCC CGGATATTAT CTTCTGGGCC CATGACCGCT TTGGCGGCTA CGCGCAGTCG
GGCCTGCTGG CCGAAATCCA TCCTTCCAAA GCCTTCCAGG ACAAACTGTT CCCATTCACC
TGGGACGCGG TGCGCTACGA CGGCAAGCTG ATCGGCTACC CGATCGCGGT GGAAGCGCTG
TCGCTGATTT ATAACAAAGA CCTGATCAAA CAGGCGCCAA AAACCTGGGA AGAGATCCCG
GCGCTGGACA AAGAGCTGCG CGCCAAAGGC AAGAGCGCCA TCATGTGGAA CCTGCAAGAA
CCCTACTTCA CCTGGCCGAT TATCGCTGCC GACGGCGGTT ATGCCTTCAA GTATGAGAAC
GGCAAGTACA ACATCAAGGA CGTAGGCGTG GCCAATACCG GTTCACAGGC AGGCCTGCAG
TTCATCGTCG ATTTGGTGAA AAACAAACAC ATCAACGCCG ATACCGATTA CTCGATTGCC
GAAGCCGCGT TCAATAAAGG CCAGACCGCG ATGACCATCA ATGGCCCATG GGCCTGGTCA
AACATCGAAC AAAGCAAAAT CAACTACGGC GTGACCCTGC TGCCAACCTT TAAAGGCAAG
CCTTCCAAAC CTTTCGTCGG CGTACTGACC GCCGGGATCA ACGCCGCCAG CCCGAACAAA
GAGCTGGCGA CCGAATTCCT GGAAAACTAC CTGCTGACCA ACGAAGGGCT GGCGGACGTC
AACAAGGACA AACCGCTGGG CGCTGTAGCG CTGAAGTCCT ATCAGGAACA ACTGGCGAAA
GATCCGAAGA TTGCCGCCAC CATGGAGAAC TCACAGAACG GCGAAATCAT GCCGAATATC
CCGCAGATGA GCGCCTTCTG GTATGCCGAG CGCAGTGCGG TGATCAACGC CGTCAGCGGC
CGCCAGACGG TGAAAGCCGC GCTGGATGAC GTACAAACCC GTATCACCAA GTAA
 
Protein sequence
MTRSFTTART LVLSALTTLV LSSSAFAKIE EGKLVIWING DKGYNGLAEV GKKFEKDTGI 
KVTIEHPDKL EEKYPQVAAT GDGPDIIFWA HDRFGGYAQS GLLAEIHPSK AFQDKLFPFT
WDAVRYDGKL IGYPIAVEAL SLIYNKDLIK QAPKTWEEIP ALDKELRAKG KSAIMWNLQE
PYFTWPIIAA DGGYAFKYEN GKYNIKDVGV ANTGSQAGLQ FIVDLVKNKH INADTDYSIA
EAAFNKGQTA MTINGPWAWS NIEQSKINYG VTLLPTFKGK PSKPFVGVLT AGINAASPNK
ELATEFLENY LLTNEGLADV NKDKPLGAVA LKSYQEQLAK DPKIAATMEN SQNGEIMPNI
PQMSAFWYAE RSAVINAVSG RQTVKAALDD VQTRITK