Gene Spro_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1604 
Symbol 
ID5603756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1758001 
End bp1759089 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content46% 
IMG OID640937136 
Productglycosyl transferase group 1 
Protein accessionYP_001477836 
Protein GI157369847 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000498756 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0753767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATAG ATTACGTTAT TACCGGTTTA GAAATTGGTG GCGCTGAATA TCAGGTTGTT 
TCTCTACTAG AGAGGCTGGT TCAAAAAGGT CATCAGATGC GATTGATTTC ACTGACGCCA
CCGTCATCGG CTGTTTTTAT TGGAAGGCTC GCCGCTGCTG GCGTTCCATT AATATCCTTG
GAAATGAAAT CGGCCAAGGA CTTACCGTTG GCGTTACTAC GCTTGTGCAA AATATTGCGC
CAATCAAAAC CGGATATTGT GCATGCCCAT ATGGTGCACG CCAACCTTTT AACGCGCCTG
GCGCGGCTAT TCCTACCTTC GTTGAAAATA ATCTCTACCG CACATAATAC ACACGAAGGC
GGCAAACTTA GAGACTGGGC TTATCGTCTG ACAAATCCCC TATGTCAGAT AAACACGACT
ATCAGCGAAG CCGCAACCTA CAGATTCACT AATGAAAATG TGCTACCGCA GCATAATACA
TATACCGTTT TCAACGGAAT TGATACGGAT AGGTTTATGC CGCCGACGAT AAAAAAGGTG
ACAACCTCCC CTTTCTGTTG GCTCGCAGTA GGGCGACTCG TTGAGCAAAA GGATTACCCG
ACACTCATTC AAGCATTCAC GCATATTTCA AGCGGAAAAT TACTGATCGC TGGGCAAGGG
CCACTGGATG CAACCTTGAA ACAGCTGGTA AAACACTACG GTATCGAAGA TCGGGTTGAG
TTTATCGGGT TGAGTGATGA TACTGCACAA TTATACCGGC AGGTCGACGG CTTCGTTTTG
TCATCCGCTT GGGAAGGCTA TGGTTTAGTC GTTGCGGAAG CTATGTCTTC GGAACTTCCG
GTAGTTGTTA CTGATAGCGG CGGGCCGCGT GAGATTGTCG GCAGTGATGG GACGTCAGGA
TTGATAGTGC CAATTAAAGA TCCATTGGCT TTAGCGAATG CCCTGATTTC TATCGAAAAA
ATGCCAGCCC ATGAAAGAGA AAAAATGGGT GCTATCGCGC GGACCAGAAT ACAGGAAAAA
TTCTCATTGA ATGAAATAGT TACTCAGTGG GAGAACATTT ACACAGAGCT GAAAAATAAG
AACAAATAG
 
Protein sequence
MQIDYVITGL EIGGAEYQVV SLLERLVQKG HQMRLISLTP PSSAVFIGRL AAAGVPLISL 
EMKSAKDLPL ALLRLCKILR QSKPDIVHAH MVHANLLTRL ARLFLPSLKI ISTAHNTHEG
GKLRDWAYRL TNPLCQINTT ISEAATYRFT NENVLPQHNT YTVFNGIDTD RFMPPTIKKV
TTSPFCWLAV GRLVEQKDYP TLIQAFTHIS SGKLLIAGQG PLDATLKQLV KHYGIEDRVE
FIGLSDDTAQ LYRQVDGFVL SSAWEGYGLV VAEAMSSELP VVVTDSGGPR EIVGSDGTSG
LIVPIKDPLA LANALISIEK MPAHEREKMG AIARTRIQEK FSLNEIVTQW ENIYTELKNK
NK