Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_4091 |
Symbol | |
ID | 4112921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 4358490 |
End bp | 4361435 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638033234 |
Product | hypothetical protein |
Protein accession | YP_641252 |
Protein GI | 108801055 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.565558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGCAG GACGTCGCAC CGCCGGGTTC GACTTCCTCG AACAGCCGGC GCTTCCCGCC CCACGGGTCA GCGAAGCCGA AGCGCAGCGC ATCCTCGCCA CCCACTACGG CATCGAGGGC GATGCCGTCT CCCTGGGCAG TCAGCAGGAC AAGAACTTCC TGGTGCGCCG CACCGGCACC GGTGAGGTCG CGGGCGTGCT GAAGGTGGCC AACCCCGCGT TCACCGCCGT CGAACTCGCG GCGCAGGACG CAGCGGCGAC GCTGATCGCC GAGGCCGAAC CGGGCCTGCG GATCGCGGTG CCGCTGCCCA ACGCCGACGG CGCGGAGGTG ACGACGGTCG ACGGGCTGTT GGTCCGGTTG CTGCGGTACC TGCCCGGCGG CACGCTGATC GACGCCGATC ACCTCGGTCC GGCCGCGGTC GCGGGGCTCG GCGAGGTGGC GGCGCGCGTC AGCCGGGCGC TGACGGGCTT CGAACACGCG GGTCTCGACC GGGTGCTGCA GTGGGATCTG CGCTACGGCG CCGACGTCGT CGCCGCTCTG ATCGGCCACG TCGCCGATCC GGTGCAGCGC GAGCGGCTCT CGACCGCCAC CCGCGACGCC GCGGAGCGCA TCGGCCGCGT CGCCGACGCA CTCCCCCGCC AGGCGGTGCA CCTCGACATC ACCGACGCGA ACGTCGTCGT GTCGCGGGCG GCCGACGGCA CCCGCCGACC CGACGGGGTG ATCGACTTCG GCGATCTCAC CGACACCTGG GCGGTGTCGG AGTTGGCGAT CGCCGCCTCG TCGGTGCTGG GGCACAGCGG CACCGAACCG GTCTCGATCC TGCCTGCGGT GCGCGCGTTC CACGGCATCC GGCCGCTGAC GGTCGAGGAG ATCGACGCAC TGTGGCCGAT GGTGGTGCTG CGGACGGCGG TGCTCATCGT CAGCGGAGCC CAGCAGGCCG AACTCGACCC GGACAACGCC TACGTCACCG ACCAGTCCGA CGGCGAGTGG CGGATGTTCG AACAGGCGAC GTCGGTCCCG ATCGACGTGA TGACCGCGGT GATCCGGGCC GATCTCGGAT TCGCCGCACC ACCCGCCGAT GTCACGGCCA CCGTGCCGAT GATCGCCGGT GTGACCGCCG AAGACGTTGT CACACTTGAT CTCTCACCGA CATCCGACGC CTACGACTTC GCGTTCACAC CAGGGGGCTG GCTGCCGCCG GACGTCGACG ACCGGCTCGC CCGACGCGCC GTGGGTGACG GTGCGGCCGT GGTCGTCACA CGATTCGGCG AGCCGCGACT CGGTCTGGCG CCCGCGCTGA GCCAACGCAG CGGCGACGTC GTGCCGACCG GTGTCCGGCT CTGGCCGGCG CAACCGCTGA CCCTGGTGGC GCCGTGGGAC GGTGAGGTCG GCAGCGACGG CGCAGGCGAC ACGGTGACGG TCCGCGGCGA CGCCCACGAG GTCACGCTGA CCGGTGTGCG CCCCGTGGGC GGCGCTTCCG CGGTGCGGGC CGGTGACCCG ATCGCGCAGG CCGACGCCGC GCAGTGGGCG GACGTCGCGG TGCGCCCGGT CGGCGGGGTG ACGGCCCCAC CGCTGGTGCG TCCCGAGGTG GCGACGGGCT GGTTGGCGCA GGCGCGCGAT CCGCGCCCGG TGCTCGGCCT GCCGCCGGAC GCCGTCACCT CCCCCGCCGC CGACCTGGTC GAGCGGCGTG ACCGCAGCTT CGCGCCGGTG CAGGAGCACT ACTACCGCAG GCCGCCGCAG ATCGAGCGCG GCTGGCGGCA CTATTTGATG TCGACGGCGG GGCGCTGCTA CCTCGACATG GTCAACAACG TGACCGTGCT CGGGCACGCC CACCCCCGGG TGGCCGACAC CGCGGCCCGC CAACTGCGCA AGCTCAACAC GAATTCGCGG TTCAACTACG CCGCGGTCGT CGAGTACAGC GAGCGGCTCG CCGCCGAACT GCCCGATCCC CTGGACACCG TGTTCCTGGT CAATTCCGGT TCGGAGGCAA GCGATCTGGC GATCCGGCTG GCGCTGGCCG CCACCGGCCG CCGCGACGTC GTCGCGATGT GCGAGGCGTA CCACGGCTGG ACGTACGGCA CCGACGCCGT GTCCACGTCG ACCGCCGACA ATCCGAACGC GCTGGCCACC CGGCCGGACT GGGTGCACAC CGTCGAATCG CCCAACAGCT TCCGCGGCAA GTACCGCGGG GCGGACGTGG GCCGCTACGC GACCGAGGCC GTCGCGCAGA TCGAACGACT CGTCGCCGAC GGTCGCGCCC CGGCCGGGTT CATCTGCGAG TCGGTGTACG GCAACGCCGG CGGGATGGCG CTGCCCGACG GCTATCTGCA GCAGGTGTAC GCGGCGGTGC GCGGCGCCGG CGGGCTGGCC ATCGCCGACG AGGTCCAGGT GGGTTACGGC CGTCTCGGAC ACTGGTTCTG GGGGTTCGAG CAGCAGGGCG TGGTGCCCGA CATCGTCTCG ATGGCGAAGT CGACGGGCAA CGGCTATCCG CTCGGCGCGG TGATCACCAG CCGTGAGGTG GCCGAGGCGT TCCGCTCCCA GGGATACTTC TTCTCCTCGA CCGGCGGCAG CCCGCTGTCG TGTGCGATCG GCCTGACCGT GCTCGATGTA CTGCGCGCAG AAGACCTGCA GGGCAACGCC GTCCGGGTGG GCGGACACCT CAAGGCGCGG TTGGAGGCGC TGGCCGACAG GCATCCGATC ATCGGCACCG TGCACGGGGT CGGCCTGTAT CTCGGTGTCG AGATGGTCCG CGACCGGCAG ACGCTGGAAC CGGCGACCGA GGAGACCGCG GCAATCTGCG AGCGGATGCT CGAACTCGGC GTCGTCATCC AGCCGACGGG CGACCACTCG AACATCCTCA AGACCAAACC GCCGTTGTGC ATCGACACCG AATCCGCCGA CTTCTACGTC GATGCGCTGG ACCGGGTGCT GACCGAGGGG TGGTAG
|
Protein sequence | MTAGRRTAGF DFLEQPALPA PRVSEAEAQR ILATHYGIEG DAVSLGSQQD KNFLVRRTGT GEVAGVLKVA NPAFTAVELA AQDAAATLIA EAEPGLRIAV PLPNADGAEV TTVDGLLVRL LRYLPGGTLI DADHLGPAAV AGLGEVAARV SRALTGFEHA GLDRVLQWDL RYGADVVAAL IGHVADPVQR ERLSTATRDA AERIGRVADA LPRQAVHLDI TDANVVVSRA ADGTRRPDGV IDFGDLTDTW AVSELAIAAS SVLGHSGTEP VSILPAVRAF HGIRPLTVEE IDALWPMVVL RTAVLIVSGA QQAELDPDNA YVTDQSDGEW RMFEQATSVP IDVMTAVIRA DLGFAAPPAD VTATVPMIAG VTAEDVVTLD LSPTSDAYDF AFTPGGWLPP DVDDRLARRA VGDGAAVVVT RFGEPRLGLA PALSQRSGDV VPTGVRLWPA QPLTLVAPWD GEVGSDGAGD TVTVRGDAHE VTLTGVRPVG GASAVRAGDP IAQADAAQWA DVAVRPVGGV TAPPLVRPEV ATGWLAQARD PRPVLGLPPD AVTSPAADLV ERRDRSFAPV QEHYYRRPPQ IERGWRHYLM STAGRCYLDM VNNVTVLGHA HPRVADTAAR QLRKLNTNSR FNYAAVVEYS ERLAAELPDP LDTVFLVNSG SEASDLAIRL ALAATGRRDV VAMCEAYHGW TYGTDAVSTS TADNPNALAT RPDWVHTVES PNSFRGKYRG ADVGRYATEA VAQIERLVAD GRAPAGFICE SVYGNAGGMA LPDGYLQQVY AAVRGAGGLA IADEVQVGYG RLGHWFWGFE QQGVVPDIVS MAKSTGNGYP LGAVITSREV AEAFRSQGYF FSSTGGSPLS CAIGLTVLDV LRAEDLQGNA VRVGGHLKAR LEALADRHPI IGTVHGVGLY LGVEMVRDRQ TLEPATEETA AICERMLELG VVIQPTGDHS NILKTKPPLC IDTESADFYV DALDRVLTEG W
|
| |