Gene Spro_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1421 
Symbol 
ID5606609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1552436 
End bp1553704 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content58% 
IMG OID640936953 
Productputative substrate-binding periplasmic transport protein 
Protein accessionYP_001477653 
Protein GI157369664 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGTC GTCACTTTAT AAAAGCTTTC GCGCTGTCTG CCAGCATGGT CGGTATGGGG 
ATCGCCTGGA GCGTGCAGGC TGCCGATACC ATCAAGGTCG GCATCCTGAG TTCGCTGTCC
GGCACCATGG CCATTTCTGA AACGCCGCTC AAGGACGTGG CACTGATGAC CATTGATGAC
ATCAATGCCA AGGGCGGCGT ATTGGGTAAA AAACTCGAAC CGGTGGTGGT GGATCCGGCC
TCCAACTGGC CGCTGTTCGC CGAAAAGGCG CGTCAGTTGC TGAGCCAGGA CAAGGTGGCG
GCGGTGTTTG GCTGCTGGAC TTCGGTATCG CGTAAATCCG TGTTGCCGGT GTTTGAAGAG
TTGAACGGTT TGCTGTTCTA CCCGGTGCAA TACGAAGGGG AAGAGATGTC GCCCAATGTG
TTCTATACCG GTGCGGCCCC TAATCAGCAG GCGATCCCGG CGGTGGAATA CCTGCTGAGC
GAAGACGGCG GATCGGCGAA ACGCTTCTTC CTGCTGGGCA CTGACTACGT TTATCCGCGT
ACCACCAACA AGATCCTGCG CGCCTTCCTG CACTCGAAAG GCATTCAGGA TAAAGACATC
GAAGAGGTCT ATACGCCGTT TGGTTACAGC GACTACCAGA CCATTGTCGC CAACATCAAG
AAATTCTCTG CCGGTGGCAA AACGGCGGTG ATCTCCACCA TCAACGGTGA TTCCAACGTC
CCCTTCTACA AAGAGCTGGC CAATCAGGGC ATCAAGGCCA CCGACGTGCC AGTGATCGCC
TTCTCGGTAG GGGAAGAAGA GCTGCGCGGC ATCGACACCA AACCGCTGGT GGGTAACCTG
GCGGCCTGGA ACTACTTCGA ATCGGTGGAT AACCCGACCA ACAAGCAGTT CGTCAGCGAA
TGGCGCGCTT ACGCCAAGGC GCATAACCTG CCGAACTATG CCACCGCCGT GACCAATGAC
CCGATGGAAA CCACCTATGT CGGCATCCAC ATGTGGGCGC AGGCGGTCGA GAAGGCCGGA
ACCACGGACG TGGATAAGGT TCGTGCGGCG ATGGCCGGGC AGACCTTCGC CGCGCCGTCG
GGCTTTACCC TGACTATGGA TGCTACCAAC CATCACCTGC ACAAACCGGT GATGATTGGC
GAGATTGAAG GCAACGGCCA GTTCAACGTG GTGTGGCAAA CCGATGCTCC GGTACGCGCC
CAGCCGTGGA GCCCGTACAT TGCCGGCAAC GACAAAAAGT CGGAAAGCCC GGTAAAAGGC
GGCAAGTAA
 
Protein sequence
MQRRHFIKAF ALSASMVGMG IAWSVQAADT IKVGILSSLS GTMAISETPL KDVALMTIDD 
INAKGGVLGK KLEPVVVDPA SNWPLFAEKA RQLLSQDKVA AVFGCWTSVS RKSVLPVFEE
LNGLLFYPVQ YEGEEMSPNV FYTGAAPNQQ AIPAVEYLLS EDGGSAKRFF LLGTDYVYPR
TTNKILRAFL HSKGIQDKDI EEVYTPFGYS DYQTIVANIK KFSAGGKTAV ISTINGDSNV
PFYKELANQG IKATDVPVIA FSVGEEELRG IDTKPLVGNL AAWNYFESVD NPTNKQFVSE
WRAYAKAHNL PNYATAVTND PMETTYVGIH MWAQAVEKAG TTDVDKVRAA MAGQTFAAPS
GFTLTMDATN HHLHKPVMIG EIEGNGQFNV VWQTDAPVRA QPWSPYIAGN DKKSESPVKG
GK