Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1595 |
Symbol | |
ID | 9155745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1668810 |
End bp | 1670474 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | allophanate hydrolase |
Protein accession | YP_003646555 |
Protein GI | 296139312 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGTA TCGCCGATAT CTATCGCCGC ATCGCCGCCG ACGACCGACC CGAGGTCTTC GTCACTCTCC GGCCCGAGAC CGATGTCCAG GCCGACTACG ACGCCGCCGT CGCCGCGGGT GGCCCACTCG CGGGAATCAT CCTGGCGGTC AAGGACAACG TCGATGTCGC GGGCCTCCCG ACCACCGCGG CCTGCCCCGG ATACGCGTAC ACCGCGGAAC GGGACGCCGC AGCCGTCGCG GCCCTGCGTG CCGCCGGAGC CGTGGTGATC GGAAAGACCA ATCTCGATCA GTTCGCCACC GGCCTCGTGG GGACGCGGAG CCCGTACGGC GCCGTCCGCA ACGCCCACCG TCCCGATTAC GTCTCGGGCG GATCCAGTTC CGGATCCGCC GTCGCCGTCG CCCTCGACTA CGCAGACATC GCGATCGGTA CCGACACCGC CGGCTCGGGC CGGGTGCCCG CCGCTTTCCA GGGCGTCGTG GGCATCAAGC CCACCATCGG CGCCGTGAGC ACCGACGGCG TGATCCCCGC CTGCGCGTCG TACGACTGCG TCTCCGTCTT CGCGCGCGAT ACCGATGCCG CGAACCGTGC CATGGCGATC ATGGGCGCCA CCGGGCCTCG TGCATGGCCG GCCGACGCGC CGCTGGCGGC CGCCCCCGAT GCGACGGTGG CCGTACCTGC GGAACTCCCG GGCTTGAATG ACGCATGGCG CCAGGCCTTC GAGCGAGTCG TCGCGCAGGC CGAACAGGCA GGGCTGCGCG TGACCCGCAT CGACGTCGCC GACTTCCTGG CCGCGGGCCG GCTGCTCTAC GGCGGTGCAC TCGTCGCCGA ACGGTACAGC GCCGTCGGCG AATACCTCGC GACCGCCGGT CCGGATGCCG GCGTGGATCC CATCGTGGCA GGCATCATCA CCGCGGCCGG TGAGCTCCCC GCGTACCGTC TGGCAACCGA CCAGCAACGC CTGCGCGAGC TGGCCGAGCG CACCCGCTCC ATCCTCGACG GTTGCGCCGG GTTGCTCGTA CCGACCGCTC CGCGGCACCC CACCATCGCG GATGTGGCGG CCGACCCGGT GGGCGTGAAC TCCGAACTGG GCGCCTACGC CACCTTCTGC AACCTGCTCG ACCTGTGCGC GGTTGCGGTG CCTGCGGGGG AAACCGACGA CGGTGCGCCG TTCGGCATCA GCGTGCTCGC TCCGGCTGGA CACGATGCGG TGGCACTCGA CCTCGCCGCC AGGATCACCG GGGCACCGTC GCCCGAACCG TGGAACACCG AGTTCGTCGG TGCGCTGGAT CTCGCCGTCT TCGGCGCGCA CCTGCGCGGC CAGCCGCTGG AGCGCGAGCT GACCGCCCTG GGGGCACGGT GGGCCGGTCC GGTCGCGACC GCACCCGAGT ACCGCCTCGT CGCACTCGAC ACCGTGCCGC CGAAACCCGG CCTCGTGCAC GACCCTATCG ACGGTCGCTC GATCCGCGGC GAGCTGTGGC GGATTTCGCC CGCGGCACTC GGCACGTTCC TGTCCCGGTT GCCCGCGCCG ATGACGCTCG GTGCCGTCCG ATTGAACGAC GAGCGGAGCG TCGTCGGGTT CAGCTGCCAG GCCTCCGCTC TGGCCACCGC ACGCCTGCTG CACGTGGATC ACTGGCTCGA TCGTGATGCA GCCCCGCGGT CCTAG
|
Protein sequence | MSRIADIYRR IAADDRPEVF VTLRPETDVQ ADYDAAVAAG GPLAGIILAV KDNVDVAGLP TTAACPGYAY TAERDAAAVA ALRAAGAVVI GKTNLDQFAT GLVGTRSPYG AVRNAHRPDY VSGGSSSGSA VAVALDYADI AIGTDTAGSG RVPAAFQGVV GIKPTIGAVS TDGVIPACAS YDCVSVFARD TDAANRAMAI MGATGPRAWP ADAPLAAAPD ATVAVPAELP GLNDAWRQAF ERVVAQAEQA GLRVTRIDVA DFLAAGRLLY GGALVAERYS AVGEYLATAG PDAGVDPIVA GIITAAGELP AYRLATDQQR LRELAERTRS ILDGCAGLLV PTAPRHPTIA DVAADPVGVN SELGAYATFC NLLDLCAVAV PAGETDDGAP FGISVLAPAG HDAVALDLAA RITGAPSPEP WNTEFVGALD LAVFGAHLRG QPLERELTAL GARWAGPVAT APEYRLVALD TVPPKPGLVH DPIDGRSIRG ELWRISPAAL GTFLSRLPAP MTLGAVRLND ERSVVGFSCQ ASALATARLL HVDHWLDRDA APRS
|
| |