Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4043 |
Symbol | |
ID | 5606034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 4479915 |
End bp | 4481060 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640939603 |
Product | adenine DNA glycosylase |
Protein accession | YP_001480266 |
Protein GI | 157372277 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0975151 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCAAT CGCGCTTCAT TTTCTGCCGG ATATCGAATC TGCTTATGAT GCAAGCACAA CAGTTCGCAC AGGTGGTGCT TGACTGGTAC CAGCGTTACG GCCGTAAAAC CCTGCCGTGG CAGCTTGATA AAACCGCCTA TAAAGTATGG CTCTCTGAGG TCATGTTGCA ACAAACTCAG GTTGCCACCG TGATCCCTTA CTTTGAACGC TTTATGGCAC GTTTTCCCAA CGTGCGTGCG CTGGCAGAAG CGCCGCTGGA CGAAGTGCTG CACCTGTGGA CCGGCCTGGG TTACTACGCC CGTGCTCGCA ACCTGCACAA GGCCGCGCAG ACTATTGTCG CACAGCACGG CGGCGAGTTC CCGACAACCT TTGAAGAAAT CCACGCCCTG CCCGGCATTG GCCGCTCAAC GGCCGGTGCG GTACTGTCAT TGGCACTCGG CCAGCATTAC CCGATCCTCG ACGGCAACGT GAAACGCGTG TTGGCTCGCT GCTATGCAGT CGAAGGCTGG CCGGGCAAAA AAGAGGTCGA AAACCGGCTG TGGCAGATCA GCGAAGACGT CACCCCGGCG CAGGGCGTCG GCCAGTTCAA TCAGGCGATG ATGGACCTGG GGGCGATGGT TTGCACCCGC TCCAAACCCA AGTGCGAGCT GTGCCCGCTC AACCTCGGCT GCATCGCCTA TGCCCATCAC AGTTGGGCGA AATACCCCGG CAAAAAGCCC AAGCAGACGC TGCCGGAAAA AACCGCCTAC TTTTTATTGC TGCAACACGG CGAACGAGTC TGGCTGGAAC AGCGCCCGGC CGTCGGCTTA TGGGGCGGCC TGTTCTGCTT CCCGCAGTTC GGCGAACGCG AAGAGATGGA ACTCTGGCTG CAACAACGCG GTCTGAACAA CAATCGCCAA CAGCAGTTGA CCGCATTTCG TCATACTTTC AGTCATTTCC ATCTCGATAT CGTGCCGATA TGGTTGGAAA TGAACGACGC GGCGGCCAGC ATGGATGAGG GCGCCGGTCT CTGGTATAAC TTGGCGCAGC CGCCATCGGT CGGGCTGGCA GCGCCGGTTG ACCGCCTGTT ACAACAGTTG GCAAAACAGT CCCCGCGCCA ACAGGGTTTA TTTGGCGATA GAGCCATTGA TGAGGAATTA GCATGA
|
Protein sequence | MLQSRFIFCR ISNLLMMQAQ QFAQVVLDWY QRYGRKTLPW QLDKTAYKVW LSEVMLQQTQ VATVIPYFER FMARFPNVRA LAEAPLDEVL HLWTGLGYYA RARNLHKAAQ TIVAQHGGEF PTTFEEIHAL PGIGRSTAGA VLSLALGQHY PILDGNVKRV LARCYAVEGW PGKKEVENRL WQISEDVTPA QGVGQFNQAM MDLGAMVCTR SKPKCELCPL NLGCIAYAHH SWAKYPGKKP KQTLPEKTAY FLLLQHGERV WLEQRPAVGL WGGLFCFPQF GEREEMELWL QQRGLNNNRQ QQLTAFRHTF SHFHLDIVPI WLEMNDAAAS MDEGAGLWYN LAQPPSVGLA APVDRLLQQL AKQSPRQQGL FGDRAIDEEL A
|
| |