Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_13599 |
Symbol | |
ID | 5224288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | - |
Start bp | 4020112 |
End bp | 4021500 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640608368 |
Product | arylamine n-acetyltransferase nat (arylamine acetylase) |
Protein accession | YP_001289526 |
Protein GI | 148824772 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2162] Arylamine N-acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 257 |
Plasmid unclonability p-value | 0.00110183 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 195 |
Fosmid unclonability p-value | 0.274456 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCGAG CAGACGCAAA ATCGCCCATT TTGCGGGCGT GTTGGGCGAT TTTGCGTCTG CTCGCCGTCG CCGGTAAGCA GTCTGGTGAG CCACGTCCGG TATGTCGCGT GGTTGGGGCA GCTGTCCTCG CGGATGGCGA ATATGCAGAC GCTTACGTCA AGGAGCAGCA TTGTCGCCGA TCAGTTCGGC GAGGGCTTCC TTGTCGTCCA AGTCGACACC GGCCTGCAAG CCTCCGGTGC CGTGGGTCGG CAATAGCATC AGTAACAGGC ATCATGATGC GTTGTTGTGT CCGGCGCCGA TCCTCCGACC CGCCGGGCAT TCGGCCAGAT GGCGCGAGCA GCCACCGGAT GGGTGAGTGT CAGCGGGCAG TTTGCCGTAG CCGCCGACAC GTGTCGCTGC GAGGGCACTT TGTTTGCCGT CGACCCAGAA ACTCATGTGG CAAACCACAA TCGGTGCGAC ATAGTTGGCC GGCTGCGCGA CGAACGCCCG AATACCCTGC GGTCGGTCCG ACGCGGCGAC GAGGTCAGAA TGGCAACATG GCACTGGATC TGACCGCGTA CTTCGATCGC ATCAACTATC GCGGCGCTAC CGATCCAACC CTGGATGTTC TGCAGGATCT GGTGACCGTG CACAGTCGAA CGATTCCGTT CGAGAACCTC GACCCGCTGC TGGGGGTGCC GGTCGACGAC CTCAGTCCAC AGGCGCTGGC CGACAAGCTG GTACTTCGGC GCCGAGGCGG GTACTGCTTT GAGCACAACG GGCTGATGGG TTATGTGCTG GCCGAACTCG GCTATCGGGT GCGCCGATTC GCCGCCCGCG TCGTCTGGAA GCTCGCGCCG GACGCGCCCC TGCCGCCGCA GACGCACACC CTGCTGGGGG TCACGTTCCC CGGCTCGGGC GGATGCTATC TCGTCGACGT CGGATTCGGC GGCCAAACAC CGACCTCACC GCTTCGCCTC GAAACCGGCG CCGTCCAGCC GACAACGCAC GAACCTTATC GGCTCGAGGA CCGCGTCGAC GGCTTTGTCT TGCAGGCGAT GGTCCGGGAC ACATGGCAGA CACTGTACGA ATTCACCACC CAGACCCGCC CGCAGATCGA TCTGAAAGTG GCCAGCTGGT ACGCCTCAAC ACACCCGGCA TCGAAGTTCG TCACGGGACT GACCGCCGCG GTGATCACCG ACGACGCCCG GTGGAACCTA TCTGGCCGCG ACCTTGCCGT TCACCGTGCC GGTGGTACCG AGAAGATCCG CCTTGCCGAT GCGGCAGCGG TTGTCGACAC CCTGAGCGAA CGGTTCGGGA TCAACGTGGC AGATATCGGC GAGCGCGGCG CGCTCGAGAC GCGCATCGAC GAGCTATTGG CTCGGCAGCC AGGAGCCGAT GCGCCGTAA
|
Protein sequence | MARADAKSPI LRACWAILRL LAVAGKQSGE PRPVCRVVGA AVLADGEYAD AYVKEQHCRR SVRRGLPCRP SRHRPASLRC RGSAIASVTG IMMRCCVRRR SSDPPGIRPD GASSHRMGEC QRAVCRSRRH VSLRGHFVCR RPRNSCGKPQ SVRHSWPAAR RTPEYPAVGP TRRRGQNGNM ALDLTAYFDR INYRGATDPT LDVLQDLVTV HSRTIPFENL DPLLGVPVDD LSPQALADKL VLRRRGGYCF EHNGLMGYVL AELGYRVRRF AARVVWKLAP DAPLPPQTHT LLGVTFPGSG GCYLVDVGFG GQTPTSPLRL ETGAVQPTTH EPYRLEDRVD GFVLQAMVRD TWQTLYEFTT QTRPQIDLKV ASWYASTHPA SKFVTGLTAA VITDDARWNL SGRDLAVHRA GGTEKIRLAD AAAVVDTLSE RFGINVADIG ERGALETRID ELLARQPGAD AP
|
| |