Gene Spro_3524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3524 
Symbol 
ID5605215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3897830 
End bp3898909 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content58% 
IMG OID640939077 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_001479750 
Protein GI157371761 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00772577 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGTT TTCAAACCGC AAACCGTCGT GGGGATCGCG CAGTGACCGA CAAAACCTCT 
CTCAGCTATA AAGACGCAGG TGTCGATATC GATGCTGGCA ATGCATTGGT AGACCGCATC
AAAGGTGTAG TTAAACAGAC CCGCCGCCCT GAAGTGATGG GTGGTCTGGG CGGTTTTGGC
GCCCTGTGTG CGTTGCCGCA GAAATACCGC GAGCCGATAC TGGTTTCCGG TACCGACGGC
GTAGGCACCA AGCTGCGTCT GGCGATGGAC CTGAAACGAC ACGACACCAT CGGCATCGAT
CTGGTTGCAA TGTGTGTGAA CGATTTGGTG GTACAGGGCG CTGAGCCGCT GTTCTTCCTG
GATTACTTCG CGACCGGCAA GCTGGACGTG GACACCGCGG CCAGCGTGAT CACCGGTATC
GCCGAGGGCT GCAAGCAGTC CGGTTGTGCG CTGGTGGGCG GTGAAACCGC CGAAATGCCA
GGTATGTATC ATGGCGAAGA TTACGACGTG GCCGGCTTTT GCGTCGGCGT GGTCGAGAAA
TCCGAAATCA TCGACGGCAG CAAGGTGCAG TCAGGCGATG CCCTGATCGC CCTCGGCGCT
TCCGGCCCGC ACTCCAACGG CTACTCGCTG GTGCGCAAAA TTCTGGAAGT CAGCAACACC
GACCCAACCA CTACCGATCT GGACGGCCAA CCACTGGCTG ACCATCTGCT GGCACCAACC
AAAATTTATG TGAAATCCGT GCTGGAGCTG ATCGAGAAAA TCGACGTGCA CGCTATCGCT
CACCTGACCG GCGGCGGCTT CTGGGAAAAC ATCCCACGCG TACTGCCGGA AGGCATGCAG
GCGGTGATCG ACGAATCCAG CTGGCAGTGG CCGGCCGTCT TCAACTGGCT GCAGCAAACC
GGTAACGTCA GCCGTCACGA AATGTACCGC ACCTTTAACT GTGGCGTGGG CATGGTGATC
GCTCTGCCGG AAGAATCGGT TGAATCCGCC ATCGCATTGT TGACCGCAGC CGGTGAAAAA
GCGTGGAAGA TCGGTAAACT GACCGCCTCT TCTGACGAAC AACAAGTGGT CATCAACTGA
 
Protein sequence
MHSFQTANRR GDRAVTDKTS LSYKDAGVDI DAGNALVDRI KGVVKQTRRP EVMGGLGGFG 
ALCALPQKYR EPILVSGTDG VGTKLRLAMD LKRHDTIGID LVAMCVNDLV VQGAEPLFFL
DYFATGKLDV DTAASVITGI AEGCKQSGCA LVGGETAEMP GMYHGEDYDV AGFCVGVVEK
SEIIDGSKVQ SGDALIALGA SGPHSNGYSL VRKILEVSNT DPTTTDLDGQ PLADHLLAPT
KIYVKSVLEL IEKIDVHAIA HLTGGGFWEN IPRVLPEGMQ AVIDESSWQW PAVFNWLQQT
GNVSRHEMYR TFNCGVGMVI ALPEESVESA IALLTAAGEK AWKIGKLTAS SDEQQVVIN