Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2002 |
Symbol | |
ID | 9156157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2091005 |
End bp | 2092012 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | biotin synthase |
Protein accession | YP_003646953 |
Protein GI | 296139710 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.678074 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGAG TGCTGGCCGA CATCATCGAC GTCGCACGCG AGCAGGTGCT CGAACGCGGC GAGGGCTTGA ATCAGGAACA GGTGCTCGCC GTCCTGCAGC TCTCGGACGA CCGGCTGGAC GAGGTGCTGC AGTTGGCGCA CGAGGTGCGC ATGAAGTGGT GCGGTCCCGA GGTCGAGGTC GAGGGCATCA TCAGCCTCAA GACCGGCGGC TGCCCCGAGG ATTGCCACTT CTGCAGCCAG TCCGGCCTGT TCGAATCGCC GGTTCGGTCC GCCTGGATCG ATATCCCGTC GCTCGTCGAG GCCGCTAAGC AGACCGCGAA GACGGGTGCC ACCGAGTTCT GCATCGTCGC CGCCGTGCGG GGCCCGGACA AGCGGCTGCT GAGCCAGGTC GCGGCGGGGA TCGAGGCCAT CCGCAACGAG GTCGACATCC AGATCGCGTG CAGCCTCGGC ATGCTCACCC AGGAGCAGGT CGATCAGCTC AAAGATATGG GTGTGCACCG CTACAACCAC AACCTCGAGA CGTCCCGGTC GCATTTCCCC AACGTGGTCA CCACGCACAC CTGGGAGGAG CGCTGGGACA CGCTGCGCAT GGTCCGTGAG GCCGGTATGG AGGTGTGCTG CGGAGGCATC CTGGGGATGG GGGAGTCCTT GGAACAGCGC GCCGAGTTCG CCGCGAACCT CGCCGAACTC GAGCCCGACG AGGTTCCCCT GAACTTCCTC AACCCGCGGC CCGGAACCCC GTTCGGCGAT CTCGACGTGC TGCCTGCCGC GGAGGCGCTC AAGTCGGTCG CCGCGTTCCG GCTCGCCTTG CCGCGCACCA TCCTCCGGTT CGCCGGTGGT CGCGAGATCA CCCTGGGAGA TCTGGGTGCG GAGAAGGGCA TTCTGGGTGG TATCAACGCC GTCATCGTGG GCAACTACCT CACCACCCTG GGACGCCCGG CGGAGACGGA CCTCGATCTG CTCAACGAGC TGAAGATGCC GATCAAGGCT CTCAACGATT CGCTGTGA
|
Protein sequence | MTGVLADIID VAREQVLERG EGLNQEQVLA VLQLSDDRLD EVLQLAHEVR MKWCGPEVEV EGIISLKTGG CPEDCHFCSQ SGLFESPVRS AWIDIPSLVE AAKQTAKTGA TEFCIVAAVR GPDKRLLSQV AAGIEAIRNE VDIQIACSLG MLTQEQVDQL KDMGVHRYNH NLETSRSHFP NVVTTHTWEE RWDTLRMVRE AGMEVCCGGI LGMGESLEQR AEFAANLAEL EPDEVPLNFL NPRPGTPFGD LDVLPAAEAL KSVAAFRLAL PRTILRFAGG REITLGDLGA EKGILGGINA VIVGNYLTTL GRPAETDLDL LNELKMPIKA LNDSL
|
| |