Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_2628 |
Symbol | |
ID | 4643398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 2777073 |
End bp | 2780411 |
Gene Length | 3339 bp |
Protein Length | 1112 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639806110 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_953442 |
Protein GI | 120403613 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0329937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGATCA AGGTGGCGCT GGAGCACCGC ACCAGCTACA CGTTCGACCG ACTCGTCGAG GTGTATCCGC ACGTCGTGCG CTTGCGTCCC GCGCCGCATT CCCGGACCCC GATCGAGGCC TATTCCCTGC AGGTCGAACC GGCGGATCAC TTCATCAACT GGCAGCAGGA CGCCTTCGGC AACTTTCTGG CCCGGCTGGT GTTCCCGACA CGCACCCGCA GCCTCACCGT CACCGTGGGA TTGATCGCCG ACATGAAGGT GGTCAACCCG TTCGACTTCT TCATCGAGGA CTACGCCGAG CAGGTGGGCT TCACCTATCC CGAGTCGCTG GCCGAGGATC TGAAGCCGTA TCTGCGTCCG GTCGACGAGG AGGGGCCGGG TTCGGGGCCC GGCGACCTCG TCGAGGCGTG GGTGAAGAAC TTCACCGTCG CCTCCGGCAC CCGCACCATC GACTACCTCG TCGCGCTCAA CCGTGCCGTC AACGCCGACG TCGGCTATTC GGTCCGGATG GAGCCCGGGG TGCAGACACC GGATTTCACG CTGCGCACCG GCATCGGTTC GTGCCGTGAC TCGGCGTGGC TGCTGGTGTC GATCCTGCGG CAGCTCGGGT TGGCGGCCCG GTTCGTCTCG GGCTACCTCG TCCAGCTGAC CTCCGACATC GAGGCGCTCG ACGGCCCGTC GGGGCCCGCC GCCGACTTCA CCGACCTGCA CGCCTGGACC GAGGTCTACA TCCCCGGCGC AGGCTGGATC GGACTCGACC CGACCTCGGG GCTGTTCGCC GGTGAGGGCC ACATTCCACT GTCGGCGACG CCGCATCCCG AGTCGGCGGC GCCGATCACC GGTGCCACCG AACCGTGCGA GACCACCCTG GAGTTCAGCA ACGTCGTCAC CCGGGTGCAC GAGGACCCGA GGGTGACGCT GCCCTACACC GAGGAGTCGT GGGCGGCGAT CAACGCGCTC GGCCGGCGCG TGGACGAACG GCTGACCGCG GGCGACGTGC GGTTGACGGT CGGGGGTGAG CCGACGTTCG TGTCGATCGA CAACCAGGTC GACCCGGAAT GGACCACCGA CGCCGACGGC CCACACAAGC GCGAGCGCGC CTCCGCGCTG GCGGCCAGGT TGAAGAAGGT GTGGGCGCCG CAGGGGCTGG TGCAGCGTAG CCAGGGCAAG TGGTATCCGG GAGAACCGTT GCCGCGCTGG CAGATCGGAC TGTTCTGGCG AACCGACGGC GAACCTTTGT GGGCTGACGA GTCCCTTCTC GCCGACCCCT GGCAGGAGAC GTCGGACACC CGCACGCCGA CTCCGGACGC CGGCCACCGG CTGCTCGCTG TCATCGCCGA CGGGCTGGGC CTGCCCGCCG GTCACGTCCG TCCGGCCTAC GAGGATCCGC TGGCCCGGCT GGTCGGTGCG GTGCGGCAAC CCGCGGGGCC ACCGGTCGAC GCCGACGACG ACCTGGCAGT CGACTCTGCG GACGGCCGCG CGCAGCTGCT GGCCCGTCTG GAAGAGTCCG TCACCGAACC GTCCGCGTTC GTGTTGCCGG TGCACCGCCG CGACGACGAC TCCGGATGGG CCGGCGCCCA GTGGACGCTG CGCCGCGGCC GCGTCGTGCT GTTGGAGGGG GATTCGCCGG CGGGCCTGCG GTTGCCGCTG CACTCCATCA GCTGGCAGCC GCCGCGGCCG ACCTTCGACG CCGACCCGAT CGAACGTCGC CCTCCGTTGC CACGCGCCGG ATCCGTCACG GCGGCCGCCT CGCCGGACAT GGCGACGACG GTCGAGGACG CCGACTGGGT CCCGACGACC GCGCTGGTCG GCGAGATCCG CGACGGCCTG TTGTATGTGT TCCTGCCGCC CACCGAGGAG TTGGAGCACT TCGTCGACCT CGTCGAGCGG ATCGAGGCGG CGGCCGCCGC GATCGACTGC CCGGTCGTGA TCGAGGGATA CGGACCACCC AACGACGCCC GGCTGAGCTC GGTGACGATC ACACCCGACC CCGGCGTCAT CGAGGTCAAC GTCGCGCCGA CGGCAAGCTT CGCCGAGCAG CGCGCGCAAC TCGAAACGCT CTACGCCGAA GCCCGGCTGG CCCGGCTGTC CACCGAGTCG TTCGACGTGG ACGGCTCCCA CGGCGGCACC GGCGGCGGAA ACCACATCAC GCTGGGCGGC ATCACCCCCG CCGATTCGCC GTTGCTGCGA CGGCCCGACC TGCTGGTGTC GTTGCTGACC TACTGGCAGC GGCACCCGTC GCTGTCGTAC CTGTTCGCCG GCCGGTTCAT CGGTACCACG TCGCAGGCGC CGCGGGTCGA CGAGGGTCGG CCCGAGTCGC TCTACGAGCT CGAGATCGCC TTCGCCGAGA TCGCCCGGCT CGCACAGGCG CCGGGCGGTG CCAAGGCCTG GGTCACCGAC CGGGCGCTGC GGCACCTGCT CACCGACATC ACCGGCAACA CCCACCGCGC GGAGTTCTGC ATCGACAAGC TGTACAGCCC CGACAGTGCG CGGGGCCGGC TGGGGCTGCT GGAACTGCGG GGCTTCGAGA TGCCGCCGCA CCACCAGATG GCGATGGTGC AGTCGCTTCT GGTGCGCGCG CTGGTGGCCT GGTTCTGGGA GGAGCCGCTG CGCGCTCCGC TGATCCGCCA CGGCGCCAAT CTGCATGGCC GGTATCTGTT GCCGCACTTC CTGATTCACG ACATCGCCGA GGTTGCCGCC GATCTGCGGG CCCATGGTGT GGAGTTCGAC ACCAGCTGGC TGGATCCGTT CACCGAGTTC CGGTTCCCGC GCATCGGGAC CGCGGTGCTC GGGGGTGTCG AGCTCGAGCT GCGCGATGCC ATCGAGCCGT GGAATGTGCT GGGGGAGGAG GCGACCGCCG GAGGCACCGC CCGCTATGTC GACTCGTCGG TGGAACGCCT GCAGGTGCGG CTGATCGGGG CGGATCGGCA GCGCCACGTC GTGGTGGTCA ACGGATTCCC GGTGCCGCTG CTGGCCACCG ACAACCCCGA CGTCCAGGTC GGTGGCGTGC GGTACCGGGC CTGGCAACCG CCCAGCGCGC TGCATCCGAC GATCACCGTG GACGGGCCGC TGCGCTTCGA ACTCGTCGAC GCCGCGGCGG GGGTCTCCCG CGGGGGCTGC ACCTACCACG TCTCCCATCC CGGCGGCCGC TCCTATGACC GGCCGCCGGT CAACGCGGTC GAGGCCGAGT CGCGCCGGGG CAGGCGCTTC GAGGCGACCG GCTTCACTCC GGGCAGGGTC GATCTGGCGG ATCTCCGCGA GAAGCAGGCG CGTCAGTCCA CCGATGTGGG AGCGCCGGGG ATCTTGGATC TGCGGCGAGT GCGTACCGTT CTGCAGTAA
|
Protein sequence | MGIKVALEHR TSYTFDRLVE VYPHVVRLRP APHSRTPIEA YSLQVEPADH FINWQQDAFG NFLARLVFPT RTRSLTVTVG LIADMKVVNP FDFFIEDYAE QVGFTYPESL AEDLKPYLRP VDEEGPGSGP GDLVEAWVKN FTVASGTRTI DYLVALNRAV NADVGYSVRM EPGVQTPDFT LRTGIGSCRD SAWLLVSILR QLGLAARFVS GYLVQLTSDI EALDGPSGPA ADFTDLHAWT EVYIPGAGWI GLDPTSGLFA GEGHIPLSAT PHPESAAPIT GATEPCETTL EFSNVVTRVH EDPRVTLPYT EESWAAINAL GRRVDERLTA GDVRLTVGGE PTFVSIDNQV DPEWTTDADG PHKRERASAL AARLKKVWAP QGLVQRSQGK WYPGEPLPRW QIGLFWRTDG EPLWADESLL ADPWQETSDT RTPTPDAGHR LLAVIADGLG LPAGHVRPAY EDPLARLVGA VRQPAGPPVD ADDDLAVDSA DGRAQLLARL EESVTEPSAF VLPVHRRDDD SGWAGAQWTL RRGRVVLLEG DSPAGLRLPL HSISWQPPRP TFDADPIERR PPLPRAGSVT AAASPDMATT VEDADWVPTT ALVGEIRDGL LYVFLPPTEE LEHFVDLVER IEAAAAAIDC PVVIEGYGPP NDARLSSVTI TPDPGVIEVN VAPTASFAEQ RAQLETLYAE ARLARLSTES FDVDGSHGGT GGGNHITLGG ITPADSPLLR RPDLLVSLLT YWQRHPSLSY LFAGRFIGTT SQAPRVDEGR PESLYELEIA FAEIARLAQA PGGAKAWVTD RALRHLLTDI TGNTHRAEFC IDKLYSPDSA RGRLGLLELR GFEMPPHHQM AMVQSLLVRA LVAWFWEEPL RAPLIRHGAN LHGRYLLPHF LIHDIAEVAA DLRAHGVEFD TSWLDPFTEF RFPRIGTAVL GGVELELRDA IEPWNVLGEE ATAGGTARYV DSSVERLQVR LIGADRQRHV VVVNGFPVPL LATDNPDVQV GGVRYRAWQP PSALHPTITV DGPLRFELVD AAAGVSRGGC TYHVSHPGGR SYDRPPVNAV EAESRRGRRF EATGFTPGRV DLADLREKQA RQSTDVGAPG ILDLRRVRTV LQ
|
| |