Gene Spro_4239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4239 
Symbol 
ID5602867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4702973 
End bp4704388 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content49% 
IMG OID640939799 
ProductN-acetylglucosamine-binding protein A 
Protein accessionYP_001480461 
Protein GI157372472 
COG category[S] Function unknown 
COG ID[COG3397] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.225029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTGA AAAAAATCGC GCTGGCAGTA GCTGCATTAA CTCTCTCCAG TGGTGCTCTG 
GCGCATGGTT ATGTTGAATC GCCGCAAAGC CGGGCCTATA AATGTAATTT GCAGGAAAAT
ACCGATTGCG GCGCCGTGCA ATACGAGCCG CAAAGCGTTG AGAAAGACTC CGGCTTCCCG
ACCGGGCCTT TACCGCGCGA TGGCGAACTG GCCAGCGCCA GCATTCCTCA TTATTCCCCG
CTTGATAAGC AAAGTATGAA TATGTGGGCA AAAAATCCCA TCAAGGCGGG AATGAATACG
TTTACCTGGT TCCATACTGC CATGCACAAA ACCAATAACT GGCGGTATTA CATCACCAAA
CAGAATTGGG ACGCCAATCA GCCGCTGACT CGCGCGGCAT TCGAAGCGAC GCCTTTCTGC
CAAGCTGAGG GCCATGGACA GATGCCGGCC CAGCGGGCAG TTCATGAATG TAACGTGCCG
CAGCGCAGTG GTTATCAGGT GATTTACGGC GTCTGGGAAA TTGCCGATAC CGTAAACAGT
TTCTATCAGG TTATTGACGT CGATTTTGGC GGAGGTGAAG GCATCACGTC ACCGTGGAGT
AAGCAGCTTT CAGGGCAGGT GTCTGGTAAG GATTTGAAAA AAGGCGATAA GGTTATAGCT
CGATTCTTTG ACGATCAGGG AGAGGTAACG TCTCTGCGCA GCGAAATGAC CATTGATAAT
CAGAAGCAGA CGGATAAAAA CCGTTGGTCA CACGATCTGG CGGTATTGAT TAATGCGAAA
CATGATGATG TACGTATTGG CGTAAAGGAC ACACAAGGAA GTGTGAATCC GGTTTACGGC
AACAACAGTG CTTTCGTCAA GGATGACAGC CGTTTGAGCA AGGTGGTCAT GTCTTATGAA
GAACAGGCCC CAGGCATTGA AGAAGAAGTG GAAATCTCCG GCGTTCAGGT GGATAAAATC
CAGGATGGTC ACGCCAACGT GAGTTTCAAC GTTAACGTCA AGGGGGAAGT CACCTTTGAA
GCCCGGGTGT TCGATCACCA TGGCAGTGAG AAAGGCTACC TGAAGCAAAG CGTAACCGAT
GACCGCCTAT CGATGACTCT GCCGCTGAAC GACGTTACTG CCGGGCACCA TATGCTGAAA
TACTTCGCGA CCAATAAAGA AGGTCAGTTG GTTAAACAGG ATGTGATTAA CCTGCAATTA
GAAGATGCTT CTTCCGGCAA CTATCAATAT ATCTTCCCCG AAGGTCTCGA TAGCTACACG
GCAGGGACGG TGGTGTTACA ACCTAAAAAT GGCAAAACCT ATCAGTGCAA ACCTTTCCCT
TACAGCGGCT ATTGCACACA ATGGGCTAGC GGAAAGGCGC AATTTGAACC GGGTATCGGT
GAACACTGGC AACAAGCCTG GATATTAAAA AACTAA
 
Protein sequence
MELKKIALAV AALTLSSGAL AHGYVESPQS RAYKCNLQEN TDCGAVQYEP QSVEKDSGFP 
TGPLPRDGEL ASASIPHYSP LDKQSMNMWA KNPIKAGMNT FTWFHTAMHK TNNWRYYITK
QNWDANQPLT RAAFEATPFC QAEGHGQMPA QRAVHECNVP QRSGYQVIYG VWEIADTVNS
FYQVIDVDFG GGEGITSPWS KQLSGQVSGK DLKKGDKVIA RFFDDQGEVT SLRSEMTIDN
QKQTDKNRWS HDLAVLINAK HDDVRIGVKD TQGSVNPVYG NNSAFVKDDS RLSKVVMSYE
EQAPGIEEEV EISGVQVDKI QDGHANVSFN VNVKGEVTFE ARVFDHHGSE KGYLKQSVTD
DRLSMTLPLN DVTAGHHMLK YFATNKEGQL VKQDVINLQL EDASSGNYQY IFPEGLDSYT
AGTVVLQPKN GKTYQCKPFP YSGYCTQWAS GKAQFEPGIG EHWQQAWILK N