Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1377 |
Symbol | |
ID | 9155525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1441481 |
End bp | 1442617 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Uroporphyrinogen III synthase HEM4 |
Protein accession | YP_003646344 |
Protein GI | 296139101 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.855752 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGTC CGACGACCGA GGCCTCGGCC ACCCCACTTC TCGGCTTCAC CGTGGCCGTG ACCGCCGCCC GCCGCGCCGA CGAGCTGTCC ACTCTGCTCA TCCGGCGCGG CGCCGCCGTG GTCGCGGCCC CCGCGATCAG CATGGTGCCG CTGTCCGATG ATGTGCGGTT GCGCGCCGCC ACGGACGAGT TGCTCACGGC ACCACCGGAC CTGCTCGTCG CCACCACCGG CATCGGTTTC CGCGGCTGGC TGGAGGCCGC CGAGGGCTGG GGCGTCGCGG AGGGCCTCGT GGCCGCCCTT GGCGGCGGCC GGGTGATCTC CCGCGGACCC AAGGCCACCG GCGCTCTGCG CGCCGCCGGT TTGCGCGAGG AGTGGTCGCC CAAATCGGAA TCGTCGGCGG AGGTGCTCAG TCACCTCGCG GGGGAGGATC TGCACGGGCA GCGGGTCGCC GTCCAGCTGC ACGGCGCGAC CGACGACTGG GACCCCAACC CGGGTTTGCT CGACGGACTG CGGGACCTGG GCGCGGAGAT CGTGCCGGTG CCCGTGTACC GGTGGGAGCA GCCGGAGGAC TTATCGGGCC TCGACCGCAT CGTCGAGGCG ATCGCTCTCG CCGAGGTCGA CGCCGTCACC TTCACTTCTG CTCCTGCCGC GGCCTCGCTG TTGGAGCGTG CCGTCGAGCT CGGCGTCGCC GACTCGGTGC GCGCCGCCCT CACCGGCCCG GTGGTGGCGT ACTGCGTGGG GCCGGTCACC GCGACCCCGC TGCAGCGTGA GGGGATCGAA TCGGTGACCC CCGAGCGGAT GCGGCTCGGC GCCCTGGCAC GGCTGGTCGA ACAGGACCTT CCGGGCCGCA GGCCGGACCT GCGGGTGGCC GGGCACAGCC TCGGACTACG CGCGCGCGGC GCCGTCGTCG ACGGTGCCGA ACGCGACCTC ACGCCCACCT CGATCACCCT GCTCCGCCTG CTCGCGCGCG AGCCCGGCGA TGTGGTCTCC CGCGACGAGT TGCTCGCCGC GCTCGGCGGA GACGATCCGC ACGCCGTCGA GGCGGCCGTC GCCCGGTTGC GAACCGGGCT GGGCCACAAG GAGATCATCG CGACGGTGGT CAAGCGGGGA TACCGGCTTG CGCTCGACGA GCACTGA
|
Protein sequence | MTGPTTEASA TPLLGFTVAV TAARRADELS TLLIRRGAAV VAAPAISMVP LSDDVRLRAA TDELLTAPPD LLVATTGIGF RGWLEAAEGW GVAEGLVAAL GGGRVISRGP KATGALRAAG LREEWSPKSE SSAEVLSHLA GEDLHGQRVA VQLHGATDDW DPNPGLLDGL RDLGAEIVPV PVYRWEQPED LSGLDRIVEA IALAEVDAVT FTSAPAAASL LERAVELGVA DSVRAALTGP VVAYCVGPVT ATPLQREGIE SVTPERMRLG ALARLVEQDL PGRRPDLRVA GHSLGLRARG AVVDGAERDL TPTSITLLRL LAREPGDVVS RDELLAALGG DDPHAVEAAV ARLRTGLGHK EIIATVVKRG YRLALDEH
|
| |