Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3052 |
Symbol | |
ID | 9157223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3161566 |
End bp | 3162777 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | HNH endonuclease |
Protein accession | YP_003647984 |
Protein GI | 296140741 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00448111 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAGG AAAACAGCGA CTCAGGGCCT GGCGGTGATG TGGTCGCCGG GCTGTCCGCG TTGGAGCGGT TGCGTGCGCG GGTGGTGTTC GATCAGTACC GGTTGATCGT CGAGTTGCTG CGCACGCGGG TGTGTGAGCG GATCGCGGCG GGTGTCGCGC AGGAGCGGTG GGAGGCCGGT GTCGCGTCCG AGGTGGCGCT CGCACTGCGG GTCTCGCCGC ATCGGGCCGC GGCGATGCTG TCGCGGGCGA CGGTGTTGGA ACGGGACCTG CCCGCCACGC TGGGTCGGTT GCGTGATGGT GACCTCTCAC CGGAGGCGGT GGAGGTGATC GTGTCCGGGG TCTCGCACCT GGAACCCCGC CTGAAGACCG AAGCCGACAA CGAGTTGTGC GGGGAGAGCT TCGTGGCGTC CGGGCTCGGG CTGAGACGGT TGCAGGATCA GGTCAAAGAG GTGGCTTATC GGCTCGATGC CCGGGCCACG GTCGATCGTG CGGCGCTGGC GGCGAAAGAT CGTCGCGTCA CGATCCGTCC GGCACCGGAC TGCATGGCGC GAGTATCGAT CCTGCTGCCG GTCGCCCAAG CCGTCGGCGT CTATGCGGCG GTGAAGACGG CCGCCGACAG CATCTTCGGC ACCCCAGGCG AGAGCCGCAG TCGCGGACAG ATCATGGCCG ACACGGCTTT CGCCAGGATC ACCGGCCGCG ACATCGCGGA CGGTCTGCCC GTCACGGTGA ACCTCACCAT GCCCGCAGCG GTGCTGCTCG GCGACCAGCC GGGAACCGCG CACCTGGCCG GTGGCGGCAC ACTGCCGGGT GAGATCGCAC GGCACCTCGT CGGGCGAGCC ACTGAGCATG CGGTGGCGTG GGTCAAACGG CTCTACGTGC GACCGGAGTC GGGTGCCGTG GTCGGGCTCG ATTCCCGGTC GCGGCTGTTC CCGGCCGGAT TGGCGGAGCT GATCGCCGCG CGGGATCGGT ACTGCCGCAC CCCGTACTGC GACGCCCCGA TCGCCCACAC CGACCACATC ATTCCGGACG CGCACGGTGG CCGCACCAGC CTGGAGAACG GGCAGGGGCT GTGCGCGGCG TGCAACTACG CCAAAGAAGC AGCGGGCTGG TCCAGCCGCA CCGTCGACGA CAGCAGCGGG CGGCACACCG TCGAAACCCA CACCCCGACA GGACATCTGC ACCGGTCCAC CGCACCACCG CAGGCGGCGT GA
|
Protein sequence | MEEENSDSGP GGDVVAGLSA LERLRARVVF DQYRLIVELL RTRVCERIAA GVAQERWEAG VASEVALALR VSPHRAAAML SRATVLERDL PATLGRLRDG DLSPEAVEVI VSGVSHLEPR LKTEADNELC GESFVASGLG LRRLQDQVKE VAYRLDARAT VDRAALAAKD RRVTIRPAPD CMARVSILLP VAQAVGVYAA VKTAADSIFG TPGESRSRGQ IMADTAFARI TGRDIADGLP VTVNLTMPAA VLLGDQPGTA HLAGGGTLPG EIARHLVGRA TEHAVAWVKR LYVRPESGAV VGLDSRSRLF PAGLAELIAA RDRYCRTPYC DAPIAHTDHI IPDAHGGRTS LENGQGLCAA CNYAKEAAGW SSRTVDDSSG RHTVETHTPT GHLHRSTAPP QAA
|
| |