Gene Spro_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1970 
Symbol 
ID5603421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2153496 
End bp2155868 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content57% 
IMG OID640937508 
Productputative glycosyl hydrolase 
Protein accessionYP_001478201 
Protein GI157370212 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.434752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0141312 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATGA TTAATAAACT GCCGCCTTTG GCCATAGCTA TCACGCTGGC AATATCAGGC 
TGCTCTTCTC ACCCGGCGAA TGAGAACGCC TTGCGGGCAG CGGCCTATAA AAACGTCATC
AATCGCAGCG GCGCGCCACA CTACATGCTC GACTACGATT TTGATGAACA CCAACGTTTT
AACCCGTTCT TTGATCTGGG GGCCTGGCAT GGCCACTTGC TGCCGGACGG CCCGGAAGGT
ATCGGCGGTT TCCCCGGCCC GGCGCTGCTT ACCGAAGAAT ACATCAACTT TATGGCCGAC
AACTTCGATC GGCTGAGCCT GTATAAAAAT GGGGAGAAGG TAAACTTCAG CATGACCGCC
TACAGCATCC CGGGGGCGCT TGTCCAACAA CTCAGCGCGC CGGGGATCCA GGTCAATCTG
ACGCTGCGTT TCGCTACCGC ACGCACCTCG CTGCTGGAAA CCCAAATCAT CACCGATACG
CCGCTGGAGC TGGTGTGGGA CGGTGCGCTG TTGGAAAAAT ACTACGCCAA GGAGCAGAAG
CCACAGTCAC CTGAGACCAT CGAGCAGGCA TTTCCGGGCT ACACGCGGCA CATTCTACCG
ACGGCGGACG GCCTGCGGGT CACCTTTGGC AAAGTGCGAG CCTCTTCGCA ACTGATGACC
TCCGGCCAGT CTGAATATCA GATTCATAAA TCACTGCCGC AGCAAACGAC GGTGAACGGC
CATCAGTTTA TCGCCAAAAC CAATATCAAA GGCTCCACCA CCCTCTACAC CACCTACTCT
CACCTGCTCA CCGCCGCTGA GGTACAGCAG GAACAGGGCA AGATTGCCGC TATCCTCGCC
AACCCGCAGC AGTATATGAA TGCTTCCGCG CAACGCTGGG AAAACTATCT CAGCAAGGGG
CTTACCAACC CACACGCCAC CCAGGCACAG GAGCGCGTGG CGGTCAAAGC GATAGAAACC
TTGAATGGCA ACTGGCGCGG CGCGGCCGGG GCGATGAAGT TTGATTCGGT GACGCCTTCG
GTGACCGGAC GCTGGTTCTC CGGCAACCAG ACCTGGCCCT GGGATACCTG GAAACAGGCC
TACGCCATGG CGCATTTTAA TCCTGACGTC GCCAAGGATA ATATCCGTGC GGTGTTCGCC
TTCCAAATCC AGCCGGATGA CCCACTGCGC CCATGGGATG CCGGCTTTAT CCCGGATCTG
ATCGCCTATA ACCCCAGCCC GGAACGCGGC GGCGACGGCA GCAACTGGAA TGAGCGCAAT
ACCAAACCCA GCCTGGCGGC CTGGGCGGTA ATGGAGGTGT ATAACACCAC CGGCGATAAA
CAGTGGCTGG CCGAGATGTA TCCGAAGCTG GTCGCCTACC ACAACTGGTG GCTGCATAAC
CGCGACCATA ACGGCAACGG CGTGCCGGAA TATGGCGCGA CGCGGGATAA GGCGCACAAT
ACGCCAGATG GTCGGATGCT GTTCACCGTT AAGCGCGGGC AGCAGGAGAA AACCCTCGCC
GGACTGAACA ACTATGACCG GGTTGTTCGC TCCGGGCACT ATGACAACAT CGAAATCCCG
GCCCAGGTTG CCGCCTCCTG GGAGTCTGGA CGTGACGATG CCGCTGCCTT TGGTTTTATC
GACCCGGATC AGCTGGCCCG CTACCTGGCA CAAGGTGGAA AGCGGCAGGA CTGGCAGGTG
AAATTTGCGG AAAACCGTGC GGCAGACGGC ACTCTGCTCG GCTATTCCTT ACTGCAGGAG
TCGGTGGATC AAGCCAGCTA CATGTACAGC GACAACAAAT ACCTGGCCAA AATGGCCGAT
ATTCTGGGCC ACAGCGCCGA TGCCGCCACC TTCCGCAGCA AGGCCGACAA GCTGGCGGAT
TACATCAATA GCTGCATGTT CGATTCGTCC AGTGGCTTCT TCTACGACAT TCGCATTGAA
GATAAACCCC TGCCCAATGG CTGTGCCGGC AAACCCATTG TCGAACGCGG CAAAGGGCCG
GAAGGTTGGT CCCCCTTATT TAATGGCGCA GCCAGCCAGC AACATGCCGA TGCGGTGGTG
AAGGTGATGA AGGACTCGCG TGAGTTCAAT ACCTACGTCC CACTAGGTAC GGCGGCACTC
ACCAACCCGG CGTTTGGCGC AGATATTTAC TGGCGTGGAC GAGTCTGGGT AGATCAACTG
TATTTCGGGC TAAAGGGTAT GGAAAGCTAT GGCTACCGTG CCGACGCCGT CGCCATGGCA
CAGGCCTTCT TCAACCACGC CGATGGACTG ATCACCGACG GGCCAATCCG CGAAAACTAT
AATCCGCTGA CCGGCATGCA ACAGGGCGCA CCGAATTTCT CCTGGAGCGC AGCTCACCTC
TACATGCTGT ATAACGACTT TTTCACCCGA TAA
 
Protein sequence
MNMINKLPPL AIAITLAISG CSSHPANENA LRAAAYKNVI NRSGAPHYML DYDFDEHQRF 
NPFFDLGAWH GHLLPDGPEG IGGFPGPALL TEEYINFMAD NFDRLSLYKN GEKVNFSMTA
YSIPGALVQQ LSAPGIQVNL TLRFATARTS LLETQIITDT PLELVWDGAL LEKYYAKEQK
PQSPETIEQA FPGYTRHILP TADGLRVTFG KVRASSQLMT SGQSEYQIHK SLPQQTTVNG
HQFIAKTNIK GSTTLYTTYS HLLTAAEVQQ EQGKIAAILA NPQQYMNASA QRWENYLSKG
LTNPHATQAQ ERVAVKAIET LNGNWRGAAG AMKFDSVTPS VTGRWFSGNQ TWPWDTWKQA
YAMAHFNPDV AKDNIRAVFA FQIQPDDPLR PWDAGFIPDL IAYNPSPERG GDGSNWNERN
TKPSLAAWAV MEVYNTTGDK QWLAEMYPKL VAYHNWWLHN RDHNGNGVPE YGATRDKAHN
TPDGRMLFTV KRGQQEKTLA GLNNYDRVVR SGHYDNIEIP AQVAASWESG RDDAAAFGFI
DPDQLARYLA QGGKRQDWQV KFAENRAADG TLLGYSLLQE SVDQASYMYS DNKYLAKMAD
ILGHSADAAT FRSKADKLAD YINSCMFDSS SGFFYDIRIE DKPLPNGCAG KPIVERGKGP
EGWSPLFNGA ASQQHADAVV KVMKDSREFN TYVPLGTAAL TNPAFGADIY WRGRVWVDQL
YFGLKGMESY GYRADAVAMA QAFFNHADGL ITDGPIRENY NPLTGMQQGA PNFSWSAAHL
YMLYNDFFTR