Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_1133 |
Symbol | |
ID | 4648568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 1202906 |
End bp | 1206277 |
Gene Length | 3372 bp |
Protein Length | 1123 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639804632 |
Product | hypothetical protein |
Protein accession | YP_951975 |
Protein GI | 120402146 |
COG category | [S] Function unknown |
COG ID | [COG4913] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.802022 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAT CACGCTGCGA TGCAACCCAA CCCGGGGCCG GGTACCGACT CCAGTACGCC GAGGTCTACA ACTGGGGAAC GTTCGACGAC CATTCCTGGC GGTTCACCCC CGGCACCGAC ACCGCGCTGC TGACCGGCGA CATCGGATCA GGGAAATCGA CAATCGTCGA CGCGCTCACC ACATTGCTGG TGCCCGCGCA CAAGGCCGCC TACAACAAGG CCGCCGGCGC CGACGCCAAG GAGCGCACGC TGCGCTCCTA TGTCGAGGGC CACTACAAAT CCGAGCGCAA CGAATCCACC GGTAGATCCC GTCCGAAAGG CCTACGAGAG AACAAGCGCA CCTACTCGGT GATTCTCGGT GTGTTCCGCA ACCACGGTCA CGACGAGACG GTCACCCTGG CTCAGGTGTT CCAGCAGCGC GAGAGCACCG GGCAGCCCTA CCGGTTCTTC GTGACCGCCA CCAAGGAGCT GTCCATCGCG ACCGACTTCG CCGACTTCGG CACCGATCTG CGCGAACTGC GCAGACGGCT GCGTGGCGCC GGGGCCGAGA TCTTCGATGA GTTCCCGAAG TACTCCACCT CGTTGCGCCG CCTGCTCGGT ATCCGCTCGG AGCAGGCCCT CGAACTGTTC CACCAGACGG TGTCGATGAA GTCGGTCGGG AATCTCAACG ATTTCGTCCG TGATCACATG CTCGAACCCA GCGACTCGAC CGAACGCGTC CGCGAGATCA TCGGACATTT CGAAGATTTG ACCAAAGCGC ACGACGCCGT CAAACGCGCC CGCGAGCAAC TCGAAGCGCT GCAACCGATC GTGGACACCG CAGCGAAATA CGACGCGGCA CTAGCCGAAC GTGCGGGTTT GGAGCTCGAG CGCGCGGCGG TCCGGCTGTT CATCGCCGAG CTGCGCTCAG GGTTGCTCAC CGACGAGATC GCGCGGCTGG AAGCCGACGG AGCGGCGTTG CTGACGCAAC TGGACACCGC CGAAGCCGAA CAGCGCAGGC TCGGCCGCGA GCGCGACTCG CTGATCGAAG AGCGCGCCAA AGCGGGCGGT GACCGCATCG GCGAGCTGGA GCGACTTGCC GCTGACGCCC GCGAGCAGGC GAAAACGCGA AGCCAGACAA AGGCTTTGTT CGACGCCGCA GTGGCAGAGG CCGGACTGGA GCCCGTCGCC GACGGTGACG CCTTCGCCGC GCTCGGCGCC GTGGTCGCCG GCGAACGCCC CCGGCTGACC GGTGAGAAAC GCGACCTCGA CACCGCGACC GTTGACGCGA TCGGTCGCGA GCGGGAGCAC CAACGCAGGT GCGATGTGAT CGCCGAGGAG GTCGCGAGCC TGGAGCAGCG TACCGACAAC CTGCCACAGG AGCAGGTGGT GGTGCGGGCC GAGTTGTGCG CGGCGCTGGG GTTGACGCCG GACGACCTGC CATATGCCGG CGAGTTGCTC GATGTCCATG ACGAGCACGC GCAGTGGCGG GGCGCCGCAG AACGTGTGCT GCGGGGGTTC GCGCTGTCAT TGCTGGTGCC GCAGCGGCAC TACGACGCCG TCACCGCATG GGTGAACGGG CGCAGGCTCA CCGTCGGCGG CCGCGGCGCC AAACTGGTCT ACGAACGCGT CCCCCAACAT CGGGTACGAC TGCAGCCGAC GGCACACGAC GGCTTGTTGC TGGCCGACTG CATCGAAGTC CGAGACGGGC AGTTCGAGGA ATACCTTCGC GCCGAGCTGA TGAAGCGCGC CGACTTCCGC TGCGCAGCAA CACTTGACGA GTTCCGCACC GAGCGTCGCG CCGTCACCCG AGAGGGCCAG GTGCGCTCGG GGGACCGGCA CGAGAAGGAC GACCGGCACC GGGTCGACGA CCCCAGACGG TGGGTACTGG GCTGGGTCAA CGAACGCAAG ATCGCCGCGA TGCGCGCGGA ACTGGCCGAG CTGGAAAGCC AACGTGACGA GGCCGCGGCA CAAGCCGCGC GGCTGGTAAA AGAACGCGAC GCCCTGCAGT GTCGACTGGA TGCGTTCCGG AGTGTCGAGG GGTTCCGCTC CTGGGGCGAA CTCGACGCCG ACGAGGCCGA GTCGCGTGCG AAAGCGCATG ACGCCGAGAG GGTTCGGCTT CAGGCCGGGT CGAATCGACT GGCGGAGATC ACGCAGGCGC TGGAGCGCAA CGCCGAGAAC GCGGTCACAG TAACGGATCT GATCAAGAAG CTCACCGGCA TGCTGGCGAC CGCGCAATCG AGAATGAATC AGGCCAAGCA GGAGCGAAGC CGCGACGACG AGTTCGTCGC TGCGCATGCA CCGGATCAAC GGGAGAAGGC CCGTGCGTCG TACCCGGCCC TGACCGCGCG ACTGGCGGAT AGCCCGCCCG CCCGTGCGGC GGACTGCGCC GACTCCGAGG CGGCATTGTC CGATGACCTG CACCGGCGCA TCGAGCGGTT GTCCGGTCAG CTCAACGGGC ACGCATTGAA TCTGACACAG CACATGACGG CAGTCCTGAA CCGATGGCAA GAGCTGCGGG CGGACATGGA CGTCAATGTC GAATCCCGAG CAGACTTCCT GGCCTTCCGG GAGCGGGTCG CCACCGACGA CCTGCCCCGG TTCGAGAGCG AGTTCAAAGA ACAGCTCAAC AAGAACGCCA TCCAAGAGTT GGCGGGGTTC AATAACTGGT TGGGCCGGCA GGCGTCGGCC ATCGACGAGC GCGTTGACCG AATCAACGAC GCCCTCGGTG CGGTGCCCTA CAACCCGGGC CGATACATCA AGCTGGAAAA GGAACCGACC AGCAATCAGG ATGTCGCGCA GTTCCGCTCC GATCTCCGCA ATCTCACCAA CGACACGCTC ACCGCGGACG GCGACCAGTA TTCCGAGCAG CGCTTTCTGG ACGTGAAGCG GATCATCGAG CGCTTCCGGG GCCGCGACGG CTACGCCGAA TCGGACAAGA ACTGGACGCG TCGCGTTACC GATGTGCGTA ACTGGTTCGT GTTTTCGGCA TCCGAGCGTG ATGTCGACAC CGATCTCGAG TGGGAGCACT ACAGCGACTC CGACGGCAAG TCCGGCGGGC AGAAAGAGAA GCTGGCCTAC ACCATTCTCG CGGCATCGCT GGCCTACCAG TTCGGTCTGG AGTGGGGCGT CGAGCGATCA CGCGACTTCC GGTTCGCGGT CATCGACGAG GCTTTCGGGC GGGGTTCCGA TGTCTCCACA CGGTATGCCC TCGATCTATT CGCGACTCTC GGTCTGCAGC TGCTGATCGT GACACCCTTG CAGAAGGTGC ACGTCATCGA GCCCTATGTG AAATCGATTG GCATCGTCGA CAATCCGACC GGAACCTATT CGCGGTTGCA GACGATGACC ATCGAGGAGT ACCGGGACCG GCGAGACCGG CCACTACGGT GA
|
Protein sequence | MTESRCDATQ PGAGYRLQYA EVYNWGTFDD HSWRFTPGTD TALLTGDIGS GKSTIVDALT TLLVPAHKAA YNKAAGADAK ERTLRSYVEG HYKSERNEST GRSRPKGLRE NKRTYSVILG VFRNHGHDET VTLAQVFQQR ESTGQPYRFF VTATKELSIA TDFADFGTDL RELRRRLRGA GAEIFDEFPK YSTSLRRLLG IRSEQALELF HQTVSMKSVG NLNDFVRDHM LEPSDSTERV REIIGHFEDL TKAHDAVKRA REQLEALQPI VDTAAKYDAA LAERAGLELE RAAVRLFIAE LRSGLLTDEI ARLEADGAAL LTQLDTAEAE QRRLGRERDS LIEERAKAGG DRIGELERLA ADAREQAKTR SQTKALFDAA VAEAGLEPVA DGDAFAALGA VVAGERPRLT GEKRDLDTAT VDAIGREREH QRRCDVIAEE VASLEQRTDN LPQEQVVVRA ELCAALGLTP DDLPYAGELL DVHDEHAQWR GAAERVLRGF ALSLLVPQRH YDAVTAWVNG RRLTVGGRGA KLVYERVPQH RVRLQPTAHD GLLLADCIEV RDGQFEEYLR AELMKRADFR CAATLDEFRT ERRAVTREGQ VRSGDRHEKD DRHRVDDPRR WVLGWVNERK IAAMRAELAE LESQRDEAAA QAARLVKERD ALQCRLDAFR SVEGFRSWGE LDADEAESRA KAHDAERVRL QAGSNRLAEI TQALERNAEN AVTVTDLIKK LTGMLATAQS RMNQAKQERS RDDEFVAAHA PDQREKARAS YPALTARLAD SPPARAADCA DSEAALSDDL HRRIERLSGQ LNGHALNLTQ HMTAVLNRWQ ELRADMDVNV ESRADFLAFR ERVATDDLPR FESEFKEQLN KNAIQELAGF NNWLGRQASA IDERVDRIND ALGAVPYNPG RYIKLEKEPT SNQDVAQFRS DLRNLTNDTL TADGDQYSEQ RFLDVKRIIE RFRGRDGYAE SDKNWTRRVT DVRNWFVFSA SERDVDTDLE WEHYSDSDGK SGGQKEKLAY TILAASLAYQ FGLEWGVERS RDFRFAVIDE AFGRGSDVST RYALDLFATL GLQLLIVTPL QKVHVIEPYV KSIGIVDNPT GTYSRLQTMT IEEYRDRRDR PLR
|
| |