Gene Spro_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2079 
Symbol 
ID5606586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2272721 
End bp2274091 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content62% 
IMG OID640937617 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_001478310 
Protein GI157370321 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCTT ATTTTGCCTC ACGCGCTCTG CTTCCTGAAG GCTGGGCGCA TAACGTTCGG 
TTGGACGTTA ATGCGCAAGG TCACCTGACC CAGGTGATTG CCGATGCCGA TCCTGAGGGC
TGTACCCGAC TGCACGGCGA CGTAGTGCCG GGTATGCCGA ACCTTCATTC TCACGCCTTT
CAGCGTGCGA TGGCCGGGTT GGCGGAGGTG GCGGGCAATC CGCAGGACAG TTTCTGGACC
TGGCGCGATC TGATGTACCG CCTGGTACAG CGGTTGACGC CGGAGCAGGT CGAGGTCATT
GCACGCCAAC TGTATATCGA AATGCTGAAA GGCGGATACA CCCAGGTGGC AGAGTTCCAC
TATCTGCATC ATGGTGCCGA CGGCAATCCT TATGCCGATC GTGGCGAAAT GACCGGTCGG
CTCAGTCAGG CGGCAGCGCA GGCGGGGATC GGTATGACGC TGTTGCCGGT GCTGTACAGC
TACGCCGGGT TCGGCGGCCA GCCTGCCCAG CCGGGGCAAA GACGTTTTAT TCAGGATGCG
GACGGCTATC TGGAACAGCA GCAGGTGATC GCACGGCAGT TGGCTGATCA ACCCCTGCAG
AACCAGGGCT TGTGTTTCCA TTCGCTGCGT GCGGTGGAAC TGGGCCAGAT GCAGCAGATC
CTGGCGGCGT CGGACAGCAC GCTGCCGGTG CATATCCATA TCGCCGAGCA GCAAAAAGAG
GTCAACGACT GCCTGGCCTG GAGCGGTCGT CGTCCGGTGG CCTGGCTGTA TGAACACCTG
CCGGTGGATA GCCGCTGGTG TCTGGTGCAC GCTACCCACC TTGATCGCGA CGAGCTCGAG
CAGTTGGCGC GCAGCAAGGC GGTGGCGGGC CTGTGCCTGA CCACCGAAGC CAATCTGGGC
GACGGTATTT TCCCCGGTGA CGCTTATCTG CAACATCAGG GCCGTTGGGG CATAGGTTCC
GACAGCCATG TATCGCTCAA CGTGGTAGAG GAGCTGCGTT GGTTCGAATA CGGCCAGCGC
CTGCGTGACC AGCGGCGCAA CCGCCTGACC ACGCCGGAGC AGCCTGCGGT GGCTGACGTG
CTTTATCAGC AGGCCCTGCA GGGTGGGGCA CAGGCCTGTG GCGCGCCGAT CGGTCGTTTG
CAAAGCGGCT ACCGTGCCGA CTGGCTGGTG CTGGACGGTG ACGATCCTTA CCTGGCCGCC
GCACCGGATG CCTCCATTCT GAATCGCTGG TTGTTTGCCG GGGGTAAAGA ACAAATTCGC
GACGTCTTTG TCGGCGGTCG TCAGGTGATC GACCAGGGGC GTCACGCGTT ACAGCAACAG
AGCAGCACGG AGTTCTTGCA GGTGCTGAAA ACCTTCCAGC AGGAGGCATG A
 
Protein sequence
MPAYFASRAL LPEGWAHNVR LDVNAQGHLT QVIADADPEG CTRLHGDVVP GMPNLHSHAF 
QRAMAGLAEV AGNPQDSFWT WRDLMYRLVQ RLTPEQVEVI ARQLYIEMLK GGYTQVAEFH
YLHHGADGNP YADRGEMTGR LSQAAAQAGI GMTLLPVLYS YAGFGGQPAQ PGQRRFIQDA
DGYLEQQQVI ARQLADQPLQ NQGLCFHSLR AVELGQMQQI LAASDSTLPV HIHIAEQQKE
VNDCLAWSGR RPVAWLYEHL PVDSRWCLVH ATHLDRDELE QLARSKAVAG LCLTTEANLG
DGIFPGDAYL QHQGRWGIGS DSHVSLNVVE ELRWFEYGQR LRDQRRNRLT TPEQPAVADV
LYQQALQGGA QACGAPIGRL QSGYRADWLV LDGDDPYLAA APDASILNRW LFAGGKEQIR
DVFVGGRQVI DQGRHALQQQ SSTEFLQVLK TFQQEA