Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2422 |
Symbol | |
ID | 9156582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2516303 |
End bp | 2519161 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003647367 |
Protein GI | 296140124 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGGATC GCCTGATCGT GCGCGGGGCG CGCGAACACA ACCTGCGGGG AATCGATGTG GATCTCCCGC GCGACAGCCT CATCGTCTTC ACCGGGCTGT CGGGTTCCGG CAAGTCCAGC CTCGCCTTCG ACACCATCTT CGCGGAGGGA CAGCGGCGCT ACGTCGAATC GCTGTCGGCG TACGCCCGCC AGTTCCTCGG CCAGATGGAC AAGCCCGATG TCGATTTCAT CGAGGGCCTC TCGCCCGCGG TGTCGATCGA CCAGAAGTCG ACCAACCGCA ACCCGCGGTC CACCGTGGGC ACCATCACCG AGGTCTACGA CTACCTGCGC CTGCTGTACG CCCGCGCAGG CACCCCGCAC TGCCCCGAAT GCGGCTCGGT GATCGCGCGG CAGACGCCGC AGCAGATCGT CGATCAGGTG CTCGACATGG AACAGGGCAC TCGGTTCCAG GTGCTTGCGC CGGTGGTGCG CACCCGCAAG GGCGAGTTCG TCGACTTGTT CAATCAGTTG CAGACCCAGG GCTACGCCCG CGCCCGGATC GACGGGGTGG TCTACCAGCT CAGTGAGCCG CCGAAGCTCA AGAAGCAGGA GAAACACGAC ATCGAGGTGG TCGTCGACCG GCTCGCCGTG AAGGCCTCGT CGAAGCAGCG GCTCACCGAC TCCATCGAGA CCGCCCTGCG GCTGGCCGAC GGTATCGTCG TGCTCGATTT CGTGGACCGC GAGGAGAACG ACCCCGAGCG TGAACGCCGG TTCTCCGAGA AGATGGCCTG CCCCAACGGG CACCCCATTG CGGTCGACGA TCTGGAACCC CGTTCGTTCT CGTTCAATTC ACCGTACGGT GCCTGCCCCG AGTGCGACGG CCTCGGCGTC CGCAAGGAGG TCGACCCGGA CCTCGTCGTA CCCGATCCCG AGTTGTCGCT CGCCGAGGGC GCCATAGCCC CCTGGGCATC CGGCCAGACC GCCGACTACT TCCTGCGGCT GCTTTCCGGT CTCGCGGACG CGATGGGCTT CGACCTGGAC ACCCCGTGGA AGAGCCTGCC GGCCAAGGCC CGCAAGGCGG TGATCGAGGG CTCCGAACAC CAGGTGCACG TCAAGTACAA GAACCGGTAC GGCCGTACCC GCTCGTACTA CGCCGAATTC GAGGGCGTGA TGCCCTTCCT GCACCGCCGG CTCGAATCCA CCGAGTCCGA GCAGATGAAG GAGCGGTACG AGGGCTACAT GCGCGACGTA CCGTGCCCCG TCTGTCAGGG CGCTCGGCTG CGTCCCGAGA TCCTCGCGGT CACCCTCGAC CACCCGCGAT TCGGCGAGAA GTCCATCGCC GATGTGGCCG CGATGTCGGT GGGCGAGTGC TCCGACTACC TGTCCGATCT CAAGCTCGGC GCCCGGGAGG CCGCGATCGC CGGCCGCGTC CTCAAAGAGG TGCAGGCCCG GATCGACTTC CTGCTGGACG TGGGCCTGGA ATACCTGTCG CTCTCGCGGG CTGCCGGCTC TCTCTCGGGC GGTGAGGCGC AGCGCATCCG GCTCGCCACC CAGATCGGCT CCGGTCTCGC GGGCGTGCTG TACGTGCTCG ACGAGCCGTC CATCGGCCTG CATCAGCGCG ACAATCGCCG CCTCATCGAC ACCCTGGTGC GGTTGCGCGA CCTGGGCAAC ACGCTGATCG TGGTCGAACA CGACGAAGAC ACCATCCGTA CCGCCGACTG GGTGGTCGAC ATCGGCCCGT ACGCGGGTGA GCACGGTGGC AAGGTGGTGC ACAGTGGCAC CTACAAGGCG CTGCTGAAGA ACAAGGAGTC GCTGACAGGC GCCTATCTGT CCGGACGCAG GGCACTCCCG GTGCCCGCGG TGCGTCGGCC GATCGATAAG AAGCGCCAGC TCAAGGTGGT CGGCGCCCGC GAGCACAACC TGCAGGGGAT CGACGTTTCG TTCCCGCTGG GCGTGCTCAC CGCGGTCACG GGCGTCTCCG GCTCGGGCAA GTCGACCCTG GTCAACGACA TCCTGGCCAC TGTGCTGGCC AACAAGCTCA ACGGTGCACG GCAGGTACCG GGACGGCACA GCCGGGTCAC CGGCCTGGAC GATCTGGACA AGTTGGTCCA GGTGGATCAG TCGCCGATCG GCCGCACGCC CCGGTCGAAC CCGGCCACCT ACACCGGCGT CTTCGACAAG ATCCGCACGC TGTTCGCCGC TACCACCGAA GCCAAGGTGC GTGGTTATCA GCCGGGCCGG TTCTCGTTCA ACGTCAAGGG CGGGCGGTGC GAGGCCTGCT CGGGCGACGG CACGATCAAG ATCGAGATGA ACTTCCTGCC GGACGTGTAC GTGCCGTGCG AGGTGTGTCA CGGCGCCCGG TACAACCGCG AGACGCTGGA GGTGCATTAC AAGGGCAAGA ACATCGCCGA GGTACTCGAT ATGCCGATCG AGGAGGCCGC CGACTTCTTC GAGGCGGTCA CCTCGATCCA CCGCTACCTC AAGACACTGG TGGAAGTGGG TCTCGGTTAC GTGCGGCTCG GACAGCCGGC CACCACGCTC TCGGGTGGCG AGGCGCAGCG CGTGAAGCTG GCCGCTGAGT TGCAGAAGCG GTCGAACGGG CGCACCGTGT ACATCCTCGA CGAGCCCACC ACGGGCCTGC ACTTCGAGGA CATCGCGAAG TTGCTTCAGG TGATCGACGG GCTGGTGGAC AAGGGCAACT CGGTCATCGT GATCGAGCAC AACCTGGACG TGATCAAGAC CGCCGACTGG ATCGTGGACA TGGGGCCGGA AGGTGGCAGC GGGGGAGGCA CCGTGGTCGC GCAGGGGACA CCGGAGGATG TGGCCGCGGT GCCCGAGTCG TACACAGGGA AGTTTCTCGC GGAGGTTCTG GCCACGTAG
|
Protein sequence | MADRLIVRGA REHNLRGIDV DLPRDSLIVF TGLSGSGKSS LAFDTIFAEG QRRYVESLSA YARQFLGQMD KPDVDFIEGL SPAVSIDQKS TNRNPRSTVG TITEVYDYLR LLYARAGTPH CPECGSVIAR QTPQQIVDQV LDMEQGTRFQ VLAPVVRTRK GEFVDLFNQL QTQGYARARI DGVVYQLSEP PKLKKQEKHD IEVVVDRLAV KASSKQRLTD SIETALRLAD GIVVLDFVDR EENDPERERR FSEKMACPNG HPIAVDDLEP RSFSFNSPYG ACPECDGLGV RKEVDPDLVV PDPELSLAEG AIAPWASGQT ADYFLRLLSG LADAMGFDLD TPWKSLPAKA RKAVIEGSEH QVHVKYKNRY GRTRSYYAEF EGVMPFLHRR LESTESEQMK ERYEGYMRDV PCPVCQGARL RPEILAVTLD HPRFGEKSIA DVAAMSVGEC SDYLSDLKLG AREAAIAGRV LKEVQARIDF LLDVGLEYLS LSRAAGSLSG GEAQRIRLAT QIGSGLAGVL YVLDEPSIGL HQRDNRRLID TLVRLRDLGN TLIVVEHDED TIRTADWVVD IGPYAGEHGG KVVHSGTYKA LLKNKESLTG AYLSGRRALP VPAVRRPIDK KRQLKVVGAR EHNLQGIDVS FPLGVLTAVT GVSGSGKSTL VNDILATVLA NKLNGARQVP GRHSRVTGLD DLDKLVQVDQ SPIGRTPRSN PATYTGVFDK IRTLFAATTE AKVRGYQPGR FSFNVKGGRC EACSGDGTIK IEMNFLPDVY VPCEVCHGAR YNRETLEVHY KGKNIAEVLD MPIEEAADFF EAVTSIHRYL KTLVEVGLGY VRLGQPATTL SGGEAQRVKL AAELQKRSNG RTVYILDEPT TGLHFEDIAK LLQVIDGLVD KGNSVIVIEH NLDVIKTADW IVDMGPEGGS GGGTVVAQGT PEDVAAVPES YTGKFLAEVL AT
|
| |