Gene TBFG_13599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_13599 
Symbol 
ID5224288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp4020112 
End bp4021500 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content64% 
IMG OID640608368 
Productarylamine n-acetyltransferase nat (arylamine acetylase) 
Protein accessionYP_001289526 
Protein GI148824772 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2162] Arylamine N-acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones257 
Plasmid unclonability p-value0.00110183 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones195 
Fosmid unclonability p-value0.274456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCGAG CAGACGCAAA ATCGCCCATT TTGCGGGCGT GTTGGGCGAT TTTGCGTCTG 
CTCGCCGTCG CCGGTAAGCA GTCTGGTGAG CCACGTCCGG TATGTCGCGT GGTTGGGGCA
GCTGTCCTCG CGGATGGCGA ATATGCAGAC GCTTACGTCA AGGAGCAGCA TTGTCGCCGA
TCAGTTCGGC GAGGGCTTCC TTGTCGTCCA AGTCGACACC GGCCTGCAAG CCTCCGGTGC
CGTGGGTCGG CAATAGCATC AGTAACAGGC ATCATGATGC GTTGTTGTGT CCGGCGCCGA
TCCTCCGACC CGCCGGGCAT TCGGCCAGAT GGCGCGAGCA GCCACCGGAT GGGTGAGTGT
CAGCGGGCAG TTTGCCGTAG CCGCCGACAC GTGTCGCTGC GAGGGCACTT TGTTTGCCGT
CGACCCAGAA ACTCATGTGG CAAACCACAA TCGGTGCGAC ATAGTTGGCC GGCTGCGCGA
CGAACGCCCG AATACCCTGC GGTCGGTCCG ACGCGGCGAC GAGGTCAGAA TGGCAACATG
GCACTGGATC TGACCGCGTA CTTCGATCGC ATCAACTATC GCGGCGCTAC CGATCCAACC
CTGGATGTTC TGCAGGATCT GGTGACCGTG CACAGTCGAA CGATTCCGTT CGAGAACCTC
GACCCGCTGC TGGGGGTGCC GGTCGACGAC CTCAGTCCAC AGGCGCTGGC CGACAAGCTG
GTACTTCGGC GCCGAGGCGG GTACTGCTTT GAGCACAACG GGCTGATGGG TTATGTGCTG
GCCGAACTCG GCTATCGGGT GCGCCGATTC GCCGCCCGCG TCGTCTGGAA GCTCGCGCCG
GACGCGCCCC TGCCGCCGCA GACGCACACC CTGCTGGGGG TCACGTTCCC CGGCTCGGGC
GGATGCTATC TCGTCGACGT CGGATTCGGC GGCCAAACAC CGACCTCACC GCTTCGCCTC
GAAACCGGCG CCGTCCAGCC GACAACGCAC GAACCTTATC GGCTCGAGGA CCGCGTCGAC
GGCTTTGTCT TGCAGGCGAT GGTCCGGGAC ACATGGCAGA CACTGTACGA ATTCACCACC
CAGACCCGCC CGCAGATCGA TCTGAAAGTG GCCAGCTGGT ACGCCTCAAC ACACCCGGCA
TCGAAGTTCG TCACGGGACT GACCGCCGCG GTGATCACCG ACGACGCCCG GTGGAACCTA
TCTGGCCGCG ACCTTGCCGT TCACCGTGCC GGTGGTACCG AGAAGATCCG CCTTGCCGAT
GCGGCAGCGG TTGTCGACAC CCTGAGCGAA CGGTTCGGGA TCAACGTGGC AGATATCGGC
GAGCGCGGCG CGCTCGAGAC GCGCATCGAC GAGCTATTGG CTCGGCAGCC AGGAGCCGAT
GCGCCGTAA
 
Protein sequence
MARADAKSPI LRACWAILRL LAVAGKQSGE PRPVCRVVGA AVLADGEYAD AYVKEQHCRR 
SVRRGLPCRP SRHRPASLRC RGSAIASVTG IMMRCCVRRR SSDPPGIRPD GASSHRMGEC
QRAVCRSRRH VSLRGHFVCR RPRNSCGKPQ SVRHSWPAAR RTPEYPAVGP TRRRGQNGNM
ALDLTAYFDR INYRGATDPT LDVLQDLVTV HSRTIPFENL DPLLGVPVDD LSPQALADKL
VLRRRGGYCF EHNGLMGYVL AELGYRVRRF AARVVWKLAP DAPLPPQTHT LLGVTFPGSG
GCYLVDVGFG GQTPTSPLRL ETGAVQPTTH EPYRLEDRVD GFVLQAMVRD TWQTLYEFTT
QTRPQIDLKV ASWYASTHPA SKFVTGLTAA VITDDARWNL SGRDLAVHRA GGTEKIRLAD
AAAVVDTLSE RFGINVADIG ERGALETRID ELLARQPGAD AP