Gene Spro_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4043 
Symbol 
ID5606034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4479915 
End bp4481060 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content58% 
IMG OID640939603 
Productadenine DNA glycosylase 
Protein accessionYP_001480266 
Protein GI157372277 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0975151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAAT CGCGCTTCAT TTTCTGCCGG ATATCGAATC TGCTTATGAT GCAAGCACAA 
CAGTTCGCAC AGGTGGTGCT TGACTGGTAC CAGCGTTACG GCCGTAAAAC CCTGCCGTGG
CAGCTTGATA AAACCGCCTA TAAAGTATGG CTCTCTGAGG TCATGTTGCA ACAAACTCAG
GTTGCCACCG TGATCCCTTA CTTTGAACGC TTTATGGCAC GTTTTCCCAA CGTGCGTGCG
CTGGCAGAAG CGCCGCTGGA CGAAGTGCTG CACCTGTGGA CCGGCCTGGG TTACTACGCC
CGTGCTCGCA ACCTGCACAA GGCCGCGCAG ACTATTGTCG CACAGCACGG CGGCGAGTTC
CCGACAACCT TTGAAGAAAT CCACGCCCTG CCCGGCATTG GCCGCTCAAC GGCCGGTGCG
GTACTGTCAT TGGCACTCGG CCAGCATTAC CCGATCCTCG ACGGCAACGT GAAACGCGTG
TTGGCTCGCT GCTATGCAGT CGAAGGCTGG CCGGGCAAAA AAGAGGTCGA AAACCGGCTG
TGGCAGATCA GCGAAGACGT CACCCCGGCG CAGGGCGTCG GCCAGTTCAA TCAGGCGATG
ATGGACCTGG GGGCGATGGT TTGCACCCGC TCCAAACCCA AGTGCGAGCT GTGCCCGCTC
AACCTCGGCT GCATCGCCTA TGCCCATCAC AGTTGGGCGA AATACCCCGG CAAAAAGCCC
AAGCAGACGC TGCCGGAAAA AACCGCCTAC TTTTTATTGC TGCAACACGG CGAACGAGTC
TGGCTGGAAC AGCGCCCGGC CGTCGGCTTA TGGGGCGGCC TGTTCTGCTT CCCGCAGTTC
GGCGAACGCG AAGAGATGGA ACTCTGGCTG CAACAACGCG GTCTGAACAA CAATCGCCAA
CAGCAGTTGA CCGCATTTCG TCATACTTTC AGTCATTTCC ATCTCGATAT CGTGCCGATA
TGGTTGGAAA TGAACGACGC GGCGGCCAGC ATGGATGAGG GCGCCGGTCT CTGGTATAAC
TTGGCGCAGC CGCCATCGGT CGGGCTGGCA GCGCCGGTTG ACCGCCTGTT ACAACAGTTG
GCAAAACAGT CCCCGCGCCA ACAGGGTTTA TTTGGCGATA GAGCCATTGA TGAGGAATTA
GCATGA
 
Protein sequence
MLQSRFIFCR ISNLLMMQAQ QFAQVVLDWY QRYGRKTLPW QLDKTAYKVW LSEVMLQQTQ 
VATVIPYFER FMARFPNVRA LAEAPLDEVL HLWTGLGYYA RARNLHKAAQ TIVAQHGGEF
PTTFEEIHAL PGIGRSTAGA VLSLALGQHY PILDGNVKRV LARCYAVEGW PGKKEVENRL
WQISEDVTPA QGVGQFNQAM MDLGAMVCTR SKPKCELCPL NLGCIAYAHH SWAKYPGKKP
KQTLPEKTAY FLLLQHGERV WLEQRPAVGL WGGLFCFPQF GEREEMELWL QQRGLNNNRQ
QQLTAFRHTF SHFHLDIVPI WLEMNDAAAS MDEGAGLWYN LAQPPSVGLA APVDRLLQQL
AKQSPRQQGL FGDRAIDEEL A