Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1839 |
Symbol | |
ID | 9155989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1922434 |
End bp | 1924125 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003646796 |
Protein GI | 296139553 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.036346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGGTT CACTTCGTGT CTGGCAGCGC CGTGCGCTCA CGAAGTACCT GACCGCGAAG CCGCAGGACT TCCTGGCCGT GGCTACTCCG GGCGCCGGAA AGACCACGTT CGCCCTGCGT GTGGCAGCCG AGCTGTTGGC CGATCGCACC GTGGAGCGGG TCACCGTGGT CGCCCCCACT GAGCACCTGA AGTACCAGTG GGCCGAAGCG GCCGCGCGGA ACGGCATCAA CCTCGACCCG AACTTCACCA ATTCGGGCGG CAGCACTTCG TCCGACTTCG ACGGTGTCGT GATCACCTAC GCACAGGTGG GGATGCACCC GTACAAGCAC CACGCGCGCA CCACCGCCTA CAAGACCCTG GTGATCCTTG ATGAGATCCA CCACGCCGGT GACGCGAAGA GCTGGGGCGA GGGCGTGCGC GAGGCCTTCG AGGGTGCGAC CCGCCGTCTG GCGCTCACCG GCACCCCGTT CCGCAGCGAC GACAACCCGA TCCCGTTCGT CACCTACGAG CCGGAGTTCG GCGGGGGACA GCGGTCGAAG GCCGACCACG TCTACGGCTA TTCCGACGCA CTCGCCGACG GCGTGGTGCG GCCTGTGGTC TTCCTCGCCT ACTCGGGCCA GGCGAGCTGG CGCACCAGCG CCGGTGAGGA ATTCACCGCC CGCCTCGGCG AACCGCTGAG TAAGGAGCAG ACCGCCCGGG CGTGGCGCAC GGCGCTCGAC CCGCACGGCG ATTGGATCCC CGCCGTGCTG CACGCCGCGA ACACCCGTCT CGATCAGCTG CGCCGGACGA TGCCCGATGC CGGCGGCCTG GTGATCGCGA CCGATCAGAG CACCGCCCGC GACTACGCCG AGCTGCTGCA CGACATCACC GGCGAGAAGG TCACGGTGGT GCTGTCCGAC GATCCGACCG CCTCGAAACG GATCAGCGAG TTCTCGTCGA GCCGGGACAA GTGGATGGTC GCGGTGCGGA TGGTGTCCGA GGGCGTCGAT GTTCCGCGTC TCGCGGTGGG TGTGTACGCC ACCAGCGCCT CGACGCCGCT GTTCTTCGCG CAGGCGATCG GCCGCTTCGT CCGGTCGCGC GCGCAGGGCG AGACCGCCAG CGTGTTCCTG CCCTCGGTCC CGGTTCTGCT CGACCTCGCG TCGAAGCTGG AGGAACAGCG TGACCACGTC CTGGGCAAGC CGCATCGCGA GTCCGACGGG CTCGACGACG CGCTGCTGAT CGACGCGAAC AAGCAGAAGG ACGAGCCCGG CGAGGAGGAG AAGGCCTTCG TCTCACTGCA CGCCGACGCC GAACTCGACC AACTCATCTA CGACGGATCG TCCTACGGGA CCGCAACCTT CGCGGGCAGC GACGAGGAGG CCGACTACCT GGGGCTGCCC GGCCTGCTCG ATGCGGAGCA GATGCGGGCG TTGCTCAAAC AGCGGCAGAA GGAACAGGTG CAGACCCGCA CCGTCGAGGC CGAGCGGCCG GCTCCGCCGG TCGAGGTCGA ACAGCGGGCC ACCGCCGGTG AGCAGTTGTC GGCGTTGCGT CGCGAGCTGA ACTCGCTGGT CGCCATGCAT CATCACCGCA CCGGCAAGCC ACACGGTGTG GTCCACAACG AGCTGCGCAG CCGCCTGGGC GGCCCGGTCA CTGCGATGGC GTCCGCCGAG CAACTGCGCG AGCGGATCGC CGCTCTGCGC ACCTGGCGCT AG
|
Protein sequence | MAGSLRVWQR RALTKYLTAK PQDFLAVATP GAGKTTFALR VAAELLADRT VERVTVVAPT EHLKYQWAEA AARNGINLDP NFTNSGGSTS SDFDGVVITY AQVGMHPYKH HARTTAYKTL VILDEIHHAG DAKSWGEGVR EAFEGATRRL ALTGTPFRSD DNPIPFVTYE PEFGGGQRSK ADHVYGYSDA LADGVVRPVV FLAYSGQASW RTSAGEEFTA RLGEPLSKEQ TARAWRTALD PHGDWIPAVL HAANTRLDQL RRTMPDAGGL VIATDQSTAR DYAELLHDIT GEKVTVVLSD DPTASKRISE FSSSRDKWMV AVRMVSEGVD VPRLAVGVYA TSASTPLFFA QAIGRFVRSR AQGETASVFL PSVPVLLDLA SKLEEQRDHV LGKPHRESDG LDDALLIDAN KQKDEPGEEE KAFVSLHADA ELDQLIYDGS SYGTATFAGS DEEADYLGLP GLLDAEQMRA LLKQRQKEQV QTRTVEAERP APPVEVEQRA TAGEQLSALR RELNSLVAMH HHRTGKPHGV VHNELRSRLG GPVTAMASAE QLRERIAALR TWR
|
| |