Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1132 |
Symbol | |
ID | 9155272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1159051 |
End bp | 1162386 |
Gene Length | 3336 bp |
Protein Length | 1111 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | UvrD/REP helicase |
Protein accession | YP_003646103 |
Protein GI | 296138860 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.378694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAGC CGATGCCCCA GGACTTCCCC CACGCGCCGC TGATCTCGGC CGAGGACCTC GCCTACGAAC TGGGCCAGGC GTTCGCTCCC ACGCAGGAGC AGCGTGCGGT GATCGAGGCG CCGCTCGGCC CGTGCCTGGT GGTGGCGGGT GCGGGGGCGG GGAAGACCGA GACCATGGCG GGACGGGTGG TGTGGCTGAT CGCGAACCGG TACGTCACGC CCGACCAGGT ACTGGGCCTC ACGTTCACCC GCAAGGCCGC ACAGCAGTTG ATGATCCGGG TGCGCAAGCG GCTCTCCCGG CTCGGCGCGG CGCCCGCGCT GGAGCGGATC GACCCGAGCG GTGAGCTCCG GGTGCTGCTG CGCACCGTCG AACCCGAGAT CAGCACGTAC CACGCGTACG CGGGCCGGCT GCATGGCGAT TACGGCATGC TGCTCCCGGT GGAGCCCACC GTGCGCCTGG TGTCGGAGAC CGAGCGCTGG CAGATCGCTT TCGACGTGGT CACCGGCTGG GACGAGGCAC TCGAGACCGA CAAGAATCCG GCCACGCTCA CCGAACAGGT GCTGGGTCTG TCGGGCGCGC TCGCCGATCA CCTGGTGACC CCCGACCAAC TGGAAGCGAG CGACGACGAG CTGGAACGGC TGATCGGCCT GCTCCCGCCC GGCCCGCGCC AGCGCGCCGC CCCGAATGCG GCGTTGCTCA AGGCCGCCGA CGTGCAGGAG CAGCGGCGTG CGCTGATCCC GCTCGTCCGC GCGGTGGCCG CCGAGATGCG GGTGCGGGAG GTGCTCGACT TCGGTAGCCA GATGGGTCTG GCCGCGCAGC TGGCGCTGGC CAATCCTGAT GTGGCGGCGC TGGAACGCTC GCGGTTCGGC GCCGTCCTGC TGGACGAGTA CCAGGACACC GGCCATGCGC AGCGCATGCT GCTGGCGTCC TTGTTCGGCG GGCCCGGCGG CGCCGCGGCG GTGACCGCGG TGGGTGACCC GATCCAATCG ATCTACGGCT GGCGCGGCGC ATCGGCGGCC AACCTGCCCC GGTTCGCCAC CGACTTCCCG CAGGCCGACG GGATGCCGGC GCCCCGCCGC GAGCTGCTCA CCAGCTGGCG CAACCCCACC GGCGCGCTCA GCCTGGCGAA CGCCGTATCC GAGGATCTGC GACGTCGAGG CGTCCCGGTC TCCGAGCTCC GTGCCCGGCC CGACGCACCG TCGGGCGATC TGCGGATCGC CTTGCACTCC ACGGTGATCG ACGAGCGGAC CTGGGTGGCC GATGCGATCA CCGCGCTGTG GCGCGGCCGC CTCGACGCCG GCGATCCGCC GCCGACCGTC GCGGTCCTGG TGCGCCGCAA TGCCGATTCG GCGGGGCTGG CCGCCGCACT GGGCGAGCGC GGCGTGCCCG TGGAGGTGGT GGGCCTCGGC GGTCTGCTGC ACACACCCGA GGTGCAGGAC CTGGTGGCGC TGCTGCGGCT GGCCGTTGAA CCGCTCGCCG GCACTGCGGC GATGCGCCTG CTCACCGGGC CTCGGTGGCA GCTCGGCGCG GCCGACCTGC GGGCGCTGTG GAATCGGGCA AGGCGGATCG CGCACGGCAC CGGCAGAGCG GCCACGGGGC TGGTGACCAC GGCGGACGAG CTCGATCAGG CATTGGACGC CACTCTGCCC GCAGAACTGC TCGACGCGGC CGGACTGGGG GATGCGATCG CCGATCCCGG ACCGGACTCC GATTACTCGG CGGCCGGTCT GGCGAAGATC CGCTCGCTCG ACCGGGAGAT CCGCAACGTC CGTGAGCGGC TCGGTCACCC GCTGCCGGAG GTGGTGGCCG AGGCCGAGCG CGTTCTGGGG GTGAGCATCG AGACCCGGAT CCGCGCGGCC CGTCAGCTCG GTGGGCGCGC CACCGGACGC GAGCATCTCG ACGCGTTCGC CGATGTGGTG GTGTCCTACG CCGAGCGGCC GACGGCCACG CTGCCCGGCC TGCTGTCCTT CCTGGCCGCG GCCGAGGCGG TGGAAGGCGG ACTTACCCCG GGAGACGTCG AGGTGGCCAC CGACCGGGTG CAGGTGCTCA CCGTGCACTC GGCGAAGGGC CTCGAATGGG ACGTGGTCGC GGTGCCGCAC CTCTCGGAGG GCATCTTCCC GTCGAACCGG GCGATGCCCA CTTGGCTGTC TACTCCCGCC GAGCTGCCGC CCGAGTTGCG GGGCGATGTC GCGGAACCCG GTGAGTCCGA CGGGGTGCCG CGGCTCGACC TGAGCGAGTG CAACAACCGC AAAGACCTCG AGGATGCGCT CGATACGCAC CGCGCGGCGC TCAAAGCGAT GAACATGCAC GAGGATGAGC GGCTGTTCTA CGTCGCGGTC ACTCGTACAC AGGGCACGCT GCTGCTGTCG GGGCACTACT GGAGCGAGGA CGTGAAGACG GCGAAGGCGC CGTCCCGCTT CCTCGATCGT GCCCGCGAGC ACGCGCCCGA GGCGGTCGGT CACTGGGCCG CGACGCCCGT CGACGATGCG GAGAATCCGC TGGAGGCCGA GCCGGTGCAG GCCCCGTGGC CGCGAGACTT TCTCGCCGCT CATCGGGCCG ATGCGGATGC GGGGGCGGCG CTGGTGCTGG CGGCCCTCGC CGATCCGGAC GGGACCGAGG CGGATGCGCG GGCGCCCGGG GAGGATCCGC ACGGCTGGGC GGCGGATGTG ACCGCTCTGC TCGCCGAGCG TGCGCGGCAG GCGGCGATGG ATCTCGAAGT GGCCGTGCCG CGCGAGGTTT CGGTGAGTCA GCTGGTGGAA TTACGGCGCA GCCCGGAGAC TTTCGCGCGC AGGTTGCGGC GGCCCGTACC GTACCGGCCG AACCCGTACG CCCGGCGCGG CACGGCCTTC CACGCCTGGC TGGAGCGCCG GTACGGTGCA TCGCGCCTGT TGGACTTCGA CGAGTTGCCC GGCGCCGCCG ACGGTGATGC CGGTGCCGAC GAGAATCTGG CGTTGCTGCA GCGGCGTTTC GAAGAGAGCG AGTGGGCGGC GCGCACCCCC GTCGACATCG AGGTACCGTT CGAGATCGCC GCGGCCGGCA CGGTGGTCCG GGGCCGGATG GACGCCGTCT TCCGCGATCC GGGAGGTGGT TTCACCGTGG TCGACTGGAA GACCGGGGTG CGGCCGTCCG AGCCCGCCGA TGAGCGCGCC GCCGCAGTGC AGCTGGCCGC GTACCGTCTG GCGTGGGCGA GGCTCCGCGA CGTTCCCGTC GACGAGGTGC GGGCGGCCTT CTTCTACGTC CGGTCCGGCG AGACCGTCTC ACCGTCCGAC CTGCTCGACC ACGCCGGACT GGAACTGCTG ATCACGAGCG CGGGGCGCAG CGATCTCGGC GAATAA
|
Protein sequence | MTEPMPQDFP HAPLISAEDL AYELGQAFAP TQEQRAVIEA PLGPCLVVAG AGAGKTETMA GRVVWLIANR YVTPDQVLGL TFTRKAAQQL MIRVRKRLSR LGAAPALERI DPSGELRVLL RTVEPEISTY HAYAGRLHGD YGMLLPVEPT VRLVSETERW QIAFDVVTGW DEALETDKNP ATLTEQVLGL SGALADHLVT PDQLEASDDE LERLIGLLPP GPRQRAAPNA ALLKAADVQE QRRALIPLVR AVAAEMRVRE VLDFGSQMGL AAQLALANPD VAALERSRFG AVLLDEYQDT GHAQRMLLAS LFGGPGGAAA VTAVGDPIQS IYGWRGASAA NLPRFATDFP QADGMPAPRR ELLTSWRNPT GALSLANAVS EDLRRRGVPV SELRARPDAP SGDLRIALHS TVIDERTWVA DAITALWRGR LDAGDPPPTV AVLVRRNADS AGLAAALGER GVPVEVVGLG GLLHTPEVQD LVALLRLAVE PLAGTAAMRL LTGPRWQLGA ADLRALWNRA RRIAHGTGRA ATGLVTTADE LDQALDATLP AELLDAAGLG DAIADPGPDS DYSAAGLAKI RSLDREIRNV RERLGHPLPE VVAEAERVLG VSIETRIRAA RQLGGRATGR EHLDAFADVV VSYAERPTAT LPGLLSFLAA AEAVEGGLTP GDVEVATDRV QVLTVHSAKG LEWDVVAVPH LSEGIFPSNR AMPTWLSTPA ELPPELRGDV AEPGESDGVP RLDLSECNNR KDLEDALDTH RAALKAMNMH EDERLFYVAV TRTQGTLLLS GHYWSEDVKT AKAPSRFLDR AREHAPEAVG HWAATPVDDA ENPLEAEPVQ APWPRDFLAA HRADADAGAA LVLAALADPD GTEADARAPG EDPHGWAADV TALLAERARQ AAMDLEVAVP REVSVSQLVE LRRSPETFAR RLRRPVPYRP NPYARRGTAF HAWLERRYGA SRLLDFDELP GAADGDAGAD ENLALLQRRF EESEWAARTP VDIEVPFEIA AAGTVVRGRM DAVFRDPGGG FTVVDWKTGV RPSEPADERA AAVQLAAYRL AWARLRDVPV DEVRAAFFYV RSGETVSPSD LLDHAGLELL ITSAGRSDLG E
|
| |