Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2416 |
Symbol | |
ID | 8448027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2668090 |
End bp | 2668956 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645041535 |
Product | DNA-formamidopyrimidine glycosylase |
Protein accession | YP_003201779 |
Protein GI | 258652623 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.000221357 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00527912 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCCGAAC TGCCCGAGGT CGAGTCCGCG CGCCGGGTGC TGGCCGACGG CGCACTGCAC CGCCGCATCG CCGACGTGGA CGATCACGAC GACTACGTCA CCCGCCCGTT GACGCCGGGG GCGCTGCGCT CGGCCCTGAT CGGCCGGACC TTCACCGCCG CGCACCGCCG CGGCAAATCG ATGTGGTTGA CCGTCTCCGG CGACCGCGAC GACCCCACCG ATCCCGATGC TTCGCCCGGC GACCCCGACC TGGGCATCCA CCTGGGCATG AGCGGCATCG TGGTCGTCAC CGGGCCGAGC GCACCCGAGG CGAGCGGCAC CGACCTGGTC GGCGGTGACT ACCGGCGGGA CCGCGAGCAG TTCGTGGATC GCGGCGCGTA CCAGCGGTTC GCGGTGACCT TCGCCGACGG GGGACGGATG CGGCTGCTGG ACCCGCGCCG GTTGTCCCGG GTGCGGCTGG ATCCGGACAT CCAGGCCCTG GGGCCGGACG CCCTGGGGTT GTCGCCGACG GCATTCCGGA CCGCGATGAC CGCCGGCCGC CGGGTGTCCA CGGCCCCGGT CAAGGCCCGG TTGCTGGATC AATCGGTGCT GGCCGGGGTG GGCAACCTGC TGGCCGACGA GGCACTGTGG CGGGCCAAGA TCAACCCGGG CCGCGGGGTG GACACCCTGT CGACCGCCCA GCTGAACCGG CTCGGGCGGG CGGTGCAATC GGCCCTGACC GATGCCATCG CCCGGGGCGG CGTGCACACC GGCGACGTCA TCGCCGCCCG GAAGTCGGGG GCCCGCTGCC CGCGCTGCGG CGGCGCGATG GTGTCCGGCG TCGTCGGGGG CCGGACGACC TGGTGGTGCA GCAAGGAGCA GCGGTAG
|
Protein sequence | MPELPEVESA RRVLADGALH RRIADVDDHD DYVTRPLTPG ALRSALIGRT FTAAHRRGKS MWLTVSGDRD DPTDPDASPG DPDLGIHLGM SGIVVVTGPS APEASGTDLV GGDYRRDREQ FVDRGAYQRF AVTFADGGRM RLLDPRRLSR VRLDPDIQAL GPDALGLSPT AFRTAMTAGR RVSTAPVKAR LLDQSVLAGV GNLLADEALW RAKINPGRGV DTLSTAQLNR LGRAVQSALT DAIARGGVHT GDVIAARKSG ARCPRCGGAM VSGVVGGRTT WWCSKEQR
|
| |