Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0807 |
Symbol | |
ID | 9154947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 820788 |
End bp | 822488 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | HNH endonuclease |
Protein accession | YP_003645782 |
Protein GI | 296138539 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATTGC ACGACAGGAT CACCGAGAGT AATATCGAAC ATATGAGCTA CGATGACGCA TTCGATCTCG GACGGCGGGT CACACGGCCG CTGCCGTCCG AGTTCGCCAG TGCGTCGGGG CTCACTCCGG CTGGTGCCGT CGCGCGGCTG CGGGCGATCG CCGCGGAGGC GAATCGATTG GAGGCGGAAC GGCACGCCGT GCTCTCACAG CTGTACCTGC TGCGTGATGA TGGCCGCGCC GTCCGCGCGG AATCCAACCG CCGGTTCGTG GACGACTGGG ACGAGCTGAT CGCCGAATGC GGTGCAGCGC TGGGCGTCGG GCGTGGCGCG GCGTCGGCTG CCGTGCATCG CGCTATTGAT CTGCGCGAAC GGTTTCCGCG GGTGTTCGAA CTGTTCGCCC GCGGTGCGAT CGGCATGCCG CAGGTCCGGG CGGTTCTCCG CACGGCGATC GCTGTCCTGG ACGACGAGGT GGCCGCCGCG CTCGACGGGC GGATCGCCGG GTGGTTCGAG GCCCGGCTGG ACCAGCCGGG AACGGTTCTC ACCGGCCCCA CGGTCGAGCA GGCCGCCACC ACCATCCTGA CCAGGCTCGA TCGCGAAGCG GTGCCCGAGA AACCGTCGGT CGCACCCGCC GCCCGGCTCG AGTTCCACGC CCGCACGGAC GGCGCGGTCG ATCTCGAGGT GGTGATGAGC AAGGCTGAGG GCATTCGACT GTCGAAGTCG ATCGCCGAGA TGGTGCAGAC GGTGTGTCGC AATGACGGCC GCACCTCGGC GCAGCGCCAG GTGGCCGCTC TGGTCGCGCT GGCCGAGGGC TACGAAACCC TTGGTTGCCA GTGCCCCACC GAGAATTGTC CTAACCGCGA GGCACGCCCG CGCAATGGTG CCGTCGCGCA GCAGCTCAAG GCGCTCGCCG TGATCGTGCT GAACGAGTCG GATGCGGTCG CGTCGGGTTC TCCGGCACCG GAGTCGGCCG AGCCGGGTGG CGCCGTGGTC CTCACGGACG ATCCGGGGAT ATCCGGCCCG GTCACCGCCG CGCAGGCGCG GGCACTCATC GACTCGTGCG CCACCTCGGT GCGGGTGCTC GGCCGCCGCG ATCCGGTCAC CGGGAAGATC CACGTCAGGG GCGCATCCGG GTACCGGCCC ACGCAGTACC AGCTACTCGT GCTCCGACTG ACGTATCCCA CCTGCACCTT CCCCGGCTGC TCGGTGCCCT CGTCGGCGTG CCAGATCGAC CACGTCACCG AGTACGACCA CCACGAGTCC GCGGCGGGCG GCCGCACCGA GATCGGGAAC CTGGTTCCGC TCTGCGGTTT CCACCACCGG ATCAAGACGG AGACGGGCTG GCTTTCTGAT GTCCTGCCCG ACGGCGGTGT CGAATGGCAC CACCCCACCG GCGCCACGTG GGTGGTGCCG CCCGGTTCGG CGCGCGATCT CTTTCCCGGG CTCGGCACGC TGGCCTGGGA CACCTCTGCC CGCGACACGG AGCGCGAGGA CCGTGGGGCC GAGCGCCCGA CATCCGGTGG GTTCGGCGGC CACGCCCGCC GTCGCGCCGC GCAGCGCAGG CGTCTGCGCG CGATGCACCG CCGCCTCCGG CACCTGCGAG CCGCGCGGAA GACGCAGGAA CAGGAAGCCG CGCTCCGCGA GGCCGCGAGT AGTCTGGGCC ACGAACCCGA CAGCTTCCTG CCGGCTGCTC CCCCGTTCTG A
|
Protein sequence | MALHDRITES NIEHMSYDDA FDLGRRVTRP LPSEFASASG LTPAGAVARL RAIAAEANRL EAERHAVLSQ LYLLRDDGRA VRAESNRRFV DDWDELIAEC GAALGVGRGA ASAAVHRAID LRERFPRVFE LFARGAIGMP QVRAVLRTAI AVLDDEVAAA LDGRIAGWFE ARLDQPGTVL TGPTVEQAAT TILTRLDREA VPEKPSVAPA ARLEFHARTD GAVDLEVVMS KAEGIRLSKS IAEMVQTVCR NDGRTSAQRQ VAALVALAEG YETLGCQCPT ENCPNREARP RNGAVAQQLK ALAVIVLNES DAVASGSPAP ESAEPGGAVV LTDDPGISGP VTAAQARALI DSCATSVRVL GRRDPVTGKI HVRGASGYRP TQYQLLVLRL TYPTCTFPGC SVPSSACQID HVTEYDHHES AAGGRTEIGN LVPLCGFHHR IKTETGWLSD VLPDGGVEWH HPTGATWVVP PGSARDLFPG LGTLAWDTSA RDTEREDRGA ERPTSGGFGG HARRRAAQRR RLRAMHRRLR HLRAARKTQE QEAALREAAS SLGHEPDSFL PAAPPF
|
| |