Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_2983 |
Symbol | |
ID | 5154373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 3110337 |
End bp | 3111827 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640557859 |
Product | putative Uracil-DNA glycosylase |
Protein accession | YP_001239013 |
Protein GI | 148254428 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0229765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGTCG TAACCCTCGA CAGCGACACC GATTTCGACG GCTGGCGGAC GGCGGCGCGG GCGCTGGCGA TGAACGACGT CAGGCCCGCC GACGTCTCCT GGCGTGTCAA AGGCGCCGCG CCCGACCTGT TCGACGATCA GGCCGATACG CAGCTCCCTG AGGTGCCCAA GGGCGCATTC TCGGTGCCGG CGAGCTTCAT GGAGCTGGCC GAAAGCGCGA TCCTGCACCG CGATCCCAAC CGCTTCACCC TGCTCTATCG GCTGTTGTGG CGTCTCAGGA CCAACCACGA TCTGATGGAG ATCGCGACTG ATCCTGACGT GGCGCAGGCA AATGCGCTGG CGCGGGCGGT GCGGCGCGAC GTCCACAAGA TGCACGCCTT CGTCCGCTTC CGCGAAGTCG GACGCGAGCG CGAGGCGCGC TACGTCGCCT GGTTCGAGCC GGACCACCAC ATCGTGGAAC GCGCCGCGCC GTTCTTCGCC CGCCGCTTCG CCGACATGCC CTGGTCGATC CTGACGCCGG AAATGTGCGC CCACTGGGAC GGCAGCCGCG TCTCCTTCAC ACCCGGTGTG AGCAAGGCCG AAGCGCCGTC GGAAGACCGG CTCGAAGAGG TGTGGCTGCG CTATTACGCC AGCATTTTCA ATCCGGCACG GCTGAAAGTG AAGGCGATGG AAAAGGAGAT GCCGAAGAAA TACTGGCAGA ACCTGCCCGA GGCCAAGCTG ATCCAGCCGC TGATCGCGGC CGCCGGCAGG CGCATGTCGG CAATGATCGC CGAGCCCGGC ACCGCGCCGC ACAAGCCGCA GAAACGGATT GAGGAGCCAG CAATGACGCG CAAACCCACG GCACCCGCTG CTGAGACCTC ACCTGATGTC GCCGCGGCCG ATCTTGAGAC CTTGCGTGAA GAAGCCGCTG CCTGCCGCGC CTGCCCGCTG TGGAAGGATG CCACGCAGAC CGTGTTCGGC GAAGGCCCTG ACCATGCGCC GCTGATGCTG GTCGGCGAGC AGCCCGGCGA CAAGGAGGAC CTCGCTGGCC ATCCCTTCGT CGGCCCGGCC GGCCAGATGC TCGACCGGGC GCTGCAGGAG GCCGGCATCG ACCGCAGCAA GGTCTACGTC ACCAATGCGG TGAAGCACTT CAAGTTCGTC CCCCGCGGCA AGATCCGGCT GCACCAGAAG CCTGCGACGC CGGAGATCAA GGCCTGCCGG CAATGGTACG AGCGTGAGCT CGCTACGGTC CGCCCGGCCC TGGTGGTGGC GCTGGGCGCG ACCGCGGCGC AATCGGTGTT CGGCAAGATC ACGCCAGTCA CCAAGAACCG CGGCCACGTC GTCGACCTGC CCGAGGGTCC CAACATGATC GCAACCAAGG CGATGGTCAC GGTGCATCCG TCCTACCTGC TCCGGCTTCC TGATGCCGAC GCCAAGGCGC GGGAGTATGA GCTCTTCGTC AAGGACCTGA AGATCGCCGC GGCCGCGCTC AAGCGCGCGC ATGCGGCCTG A
|
Protein sequence | MHVVTLDSDT DFDGWRTAAR ALAMNDVRPA DVSWRVKGAA PDLFDDQADT QLPEVPKGAF SVPASFMELA ESAILHRDPN RFTLLYRLLW RLRTNHDLME IATDPDVAQA NALARAVRRD VHKMHAFVRF REVGREREAR YVAWFEPDHH IVERAAPFFA RRFADMPWSI LTPEMCAHWD GSRVSFTPGV SKAEAPSEDR LEEVWLRYYA SIFNPARLKV KAMEKEMPKK YWQNLPEAKL IQPLIAAAGR RMSAMIAEPG TAPHKPQKRI EEPAMTRKPT APAAETSPDV AAADLETLRE EAAACRACPL WKDATQTVFG EGPDHAPLML VGEQPGDKED LAGHPFVGPA GQMLDRALQE AGIDRSKVYV TNAVKHFKFV PRGKIRLHQK PATPEIKACR QWYERELATV RPALVVALGA TAAQSVFGKI TPVTKNRGHV VDLPEGPNMI ATKAMVTVHP SYLLRLPDAD AKAREYELFV KDLKIAAAAL KRAHAA
|
| |