Gene Spro_4678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4678 
Symbol 
ID5606520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5168690 
End bp5170324 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content59% 
IMG OID640940244 
ProductO-antigen polymerase 
Protein accessionYP_001480899 
Protein GI157372910 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATCCA AATCCAAAAC AGCCTGGTTA TTCGGGCTGG CGGCGTTTTA TTGCCTGATT 
GCCATGCATA TTTACTGGCC CAATCGCGGC GGCAGCGGTT TTTATCTGCC GTGGAATCTG
GTTGGTGGGA TATTTATCGC CCTGACGATC CTCGGCACCC TGCTGTTTTG CCGACCGCCG
CTGGCGGTCT CCGGCTTTTT TAACCGGCTG GCGCTGGGGG GCCTGATTTT ATTTTTACCG
CTGCTGTGGG CGCAACAACC CTGGCTAAGC GAAGCCCTAC CGCGCCTGAT GGGTCTGGCA
CTGGGCATCA TGGCCTACTT CGCACTGCTG CAGATCCCGC TCAGCCGCCA AGGGCGGCGC
AGGCTGCTGA CTCTGTTATT GGCGGCAACG GTGATCGAAG CCCTGTTCGG CCTGGTGCAA
TACAGCCTGC TGCAGCCGGG TAACTGGATC GGCTATAACA GCCTGAAAAA CCGCCCTTAC
GGCATCTTCC AGCAGTGGAA CCTGATGGCC AGCTTTATGG CGACCGGTCT GGCGCTGGCG
CTGTATTTGC TCAGCAGTCG TCGCCCCTTG TCGCGTAGCC TGCAATGGTT GAGCGCCACC
ATGTTGGTAC TGGCACCGCT GTTGCTGGTG GTTATTGCTT CGCGCGTCGG GCTGCTGGCT
GCTCTGCTGC TGTCACCACT GCAACTGTGG ATGCTATATC GCCTTAATCG CCGCCGCGCC
ACCCTCTCCC TGTTGCTACT GCTAGCCGGG GTAGCCGCCG GGGTACTGTT GGTGCTGCTC
AACGGTGCGA CACGGGCGGT CACAGTGACA GAACCGATTT TCTACCGGTT GGCTTACTGG
CAAGAAGCAC TGCGCATGAT CGCCGAACGC CCGTGGTTTG GCTGGGGTTA CGGCCATTTT
CAGCACGATT TCCTGCATCA TTTCTACACC ACCCATAGCA GTGGAATGGA AAGCGTCGCC
ATCAGCCACC CACACAACGA AATTTTGCTG TGGGGCATCG AAGGTGGCCT GCTCGGCCTG
AGCGGCATTG TCATGGTCGG TTGGGGATTA TGGTGTTTGC TGCGGCGCAC TCGAGTACTG
CCACTGCGCC CTGCCCCCTG GATGGCTGCG CTGCCAATCT TGTTGCATAT GATGGTGGAG
TACCCACTTT ATCTTTCCGC CGCCCACGCC GTGCTGCTGC TGGCAATCTT GCGCGCGGGT
GACGTACGCC GCCGCTGGCG GTTACCTCGC TGGCCGCAAC AGACACTGCG TCTGCTTATC
GGTGCCGCTG CCTTACTGAT CCTGCCCTAT CTGTTCAACG GCCTGCACAG CGCACTGATC
GTTACCGCAG TGGAGAAGAG CGGCCTGCGG CAGTTTGGCC CCATGAGCCG GGTGATAACG
CCGACGCCCT GGCAGGTACG TTATGACTAC GACGTCCAGT TGCAGCGGCT GCTGCAATAT
CCGCAAACCC GCGATACCGC CACGCTGTTG AGCTACCGGC AGTGGGCAGA AAACGAAATC
CGCGTGCGGC CGGACGCCAA TATCTACATC AATCTGGTAG CGGTCAGCCG CTTACTGCAA
CAGCCCCAGC GGGCCGCCGA ACTGCGACAT CAGGCTCGTC GACTGTTCCC GCACGATATG
CGTTTTGAGG AGTAA
 
Protein sequence
MISKSKTAWL FGLAAFYCLI AMHIYWPNRG GSGFYLPWNL VGGIFIALTI LGTLLFCRPP 
LAVSGFFNRL ALGGLILFLP LLWAQQPWLS EALPRLMGLA LGIMAYFALL QIPLSRQGRR
RLLTLLLAAT VIEALFGLVQ YSLLQPGNWI GYNSLKNRPY GIFQQWNLMA SFMATGLALA
LYLLSSRRPL SRSLQWLSAT MLVLAPLLLV VIASRVGLLA ALLLSPLQLW MLYRLNRRRA
TLSLLLLLAG VAAGVLLVLL NGATRAVTVT EPIFYRLAYW QEALRMIAER PWFGWGYGHF
QHDFLHHFYT THSSGMESVA ISHPHNEILL WGIEGGLLGL SGIVMVGWGL WCLLRRTRVL
PLRPAPWMAA LPILLHMMVE YPLYLSAAHA VLLLAILRAG DVRRRWRLPR WPQQTLRLLI
GAAALLILPY LFNGLHSALI VTAVEKSGLR QFGPMSRVIT PTPWQVRYDY DVQLQRLLQY
PQTRDTATLL SYRQWAENEI RVRPDANIYI NLVAVSRLLQ QPQRAAELRH QARRLFPHDM
RFEE