Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_12661 |
Symbol | |
ID | 5223343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | + |
Start bp | 2980227 |
End bp | 2981723 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640607423 |
Product | arsenic-transport integral membrane protein arsC |
Protein accession | YP_001288590 |
Protein GI | 148823836 |
COG category | [P] Inorganic ion transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0394] Protein-tyrosine-phosphatase [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 219 |
Plasmid unclonability p-value | 0.000000400645 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 223 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAGA CGGTCACCCG CACCGCCGCC CCGGCGGTGG TGGGCAAACT CTCGACGCTG GACCGCTTCT TGCCGGTGTG GATCGGGTCG GCAATGGCCG CCGGGCTACT ACTGGGCCGG TGGATTCCCG GCCTGCACAC CGCCCTAGAA GGGGTTCAGC TCGACGGGAT TTCGCTGCCG ATCGCGCTAG GCCTGCTGAT CATGATGTAT CCGGTGCTGG CCAAGGTGCG CTACGACCGC CTCGACACCG TCACCGGTGA CCGCAAGCTG CTACTCAGCT CGCTGCTGCT GAACTGGGTA CTGGGCCCGG CGTTGATGTT CGCGCTGGCT TGGCTGCTAC TGGCGGATCT GCCCGAGTAC CGCACCGGGC TGATCATCGT GGGCCTGGCT CGCTGCATCG CCATGGTGAT CATCTGGAAC GACCTGGCCT GCGGGGATCG CGAAGCCGCC GCCGTGCTCG TCGCGTTGAA CTCGATCTTT CAGGTGGCCA TGTTCGCCGC GCTCGGCTGG TTCTACCTGT CGGTGCTACC GGGTTGGCTG GGCCTCGAGC AGACCACCAT CGCCACATCC CCGTGGCAGA TCGCCAAGTC GGTGCTGATC TTCCTCGGCA TCCCGCTGCT GGCCGGCTAC CTGTCGCGGC GGATCGGCGA AAAGACCAAG GGCCGCAACT GGTATGAATC CCGCTTCCTG CCCAAGGTGG GACCGTGGGC GCTCTACGGT TTGCTGTTCA CCATCGTGAT TCTCTTTGCG CTGCAAGGAG ATCAGATCAC CGGCCGACCG CTGGACGTCG CACGCATTGC GCTGCCGCTG CTGGCCTACT TCGCCATCAT GTGGGTAGGC GGCTACCTAC TGGGGGCGGC GCTGCGGCTA GGGTATCGGC GCACCACCAC GCTGGCGTTC ACCGCCGCGA GCAACAACTT CGAGCTGGCC ATCGCGGTGG CCATCGCCAC CTACGGCGCC ACCTCCGGGC AAGCCCTGGC CGGAGTCGTC GGGCCCCTGA TCGAGGTACC CGTCCTGGTG GGGTTGGTCT ATGTGTCCCT GGCGCTGCGC AACCGCCTCG CCGGTCCCAA CGCGACCCAC GATGCCGACA AACCCAGCGT CCTATTCGTC TGTGTGCACA ACGCCGGACG TTCCCAGATG GCCGCCGGGC TATTGACCCA CTTGGCCGGT GACCGCATCG AAGTCCGTTC GGCCGGAACC GAGCCCGCCG GTCAGGTCAA TCCGACGGCT GTGGCCGCGA TGGCCGAAAT GGGCATCGAT ATCACCGCCA ATGCCCCCAC ATTGCTCACC GGCGGGCAGG TCCAGTCCAG CGACGTCGTC ATCACGATGG GCTGCGGCGA TGCCTGCCCT TACTTCCCGG GTGTCTCCTA CCGCAACTGG AAACTACCCG ATCCCGCCGG CCAGCCCCTC GACGTTGTGC GCATGATCCG CGACGACATC GCAGACCGCG TCCAAGCCCT GATCGCCGAG CTGCTGGCCA CCGCCAAGAC CAGATAG
|
Protein sequence | MTETVTRTAA PAVVGKLSTL DRFLPVWIGS AMAAGLLLGR WIPGLHTALE GVQLDGISLP IALGLLIMMY PVLAKVRYDR LDTVTGDRKL LLSSLLLNWV LGPALMFALA WLLLADLPEY RTGLIIVGLA RCIAMVIIWN DLACGDREAA AVLVALNSIF QVAMFAALGW FYLSVLPGWL GLEQTTIATS PWQIAKSVLI FLGIPLLAGY LSRRIGEKTK GRNWYESRFL PKVGPWALYG LLFTIVILFA LQGDQITGRP LDVARIALPL LAYFAIMWVG GYLLGAALRL GYRRTTTLAF TAASNNFELA IAVAIATYGA TSGQALAGVV GPLIEVPVLV GLVYVSLALR NRLAGPNATH DADKPSVLFV CVHNAGRSQM AAGLLTHLAG DRIEVRSAGT EPAGQVNPTA VAAMAEMGID ITANAPTLLT GGQVQSSDVV ITMGCGDACP YFPGVSYRNW KLPDPAGQPL DVVRMIRDDI ADRVQALIAE LLATAKTR
|
| |