Gene Mmcs_4091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4091 
Symbol 
ID4112921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4358490 
End bp4361435 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content72% 
IMG OID638033234 
Producthypothetical protein 
Protein accessionYP_641252 
Protein GI108801055 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases
[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.565558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGCAG GACGTCGCAC CGCCGGGTTC GACTTCCTCG AACAGCCGGC GCTTCCCGCC 
CCACGGGTCA GCGAAGCCGA AGCGCAGCGC ATCCTCGCCA CCCACTACGG CATCGAGGGC
GATGCCGTCT CCCTGGGCAG TCAGCAGGAC AAGAACTTCC TGGTGCGCCG CACCGGCACC
GGTGAGGTCG CGGGCGTGCT GAAGGTGGCC AACCCCGCGT TCACCGCCGT CGAACTCGCG
GCGCAGGACG CAGCGGCGAC GCTGATCGCC GAGGCCGAAC CGGGCCTGCG GATCGCGGTG
CCGCTGCCCA ACGCCGACGG CGCGGAGGTG ACGACGGTCG ACGGGCTGTT GGTCCGGTTG
CTGCGGTACC TGCCCGGCGG CACGCTGATC GACGCCGATC ACCTCGGTCC GGCCGCGGTC
GCGGGGCTCG GCGAGGTGGC GGCGCGCGTC AGCCGGGCGC TGACGGGCTT CGAACACGCG
GGTCTCGACC GGGTGCTGCA GTGGGATCTG CGCTACGGCG CCGACGTCGT CGCCGCTCTG
ATCGGCCACG TCGCCGATCC GGTGCAGCGC GAGCGGCTCT CGACCGCCAC CCGCGACGCC
GCGGAGCGCA TCGGCCGCGT CGCCGACGCA CTCCCCCGCC AGGCGGTGCA CCTCGACATC
ACCGACGCGA ACGTCGTCGT GTCGCGGGCG GCCGACGGCA CCCGCCGACC CGACGGGGTG
ATCGACTTCG GCGATCTCAC CGACACCTGG GCGGTGTCGG AGTTGGCGAT CGCCGCCTCG
TCGGTGCTGG GGCACAGCGG CACCGAACCG GTCTCGATCC TGCCTGCGGT GCGCGCGTTC
CACGGCATCC GGCCGCTGAC GGTCGAGGAG ATCGACGCAC TGTGGCCGAT GGTGGTGCTG
CGGACGGCGG TGCTCATCGT CAGCGGAGCC CAGCAGGCCG AACTCGACCC GGACAACGCC
TACGTCACCG ACCAGTCCGA CGGCGAGTGG CGGATGTTCG AACAGGCGAC GTCGGTCCCG
ATCGACGTGA TGACCGCGGT GATCCGGGCC GATCTCGGAT TCGCCGCACC ACCCGCCGAT
GTCACGGCCA CCGTGCCGAT GATCGCCGGT GTGACCGCCG AAGACGTTGT CACACTTGAT
CTCTCACCGA CATCCGACGC CTACGACTTC GCGTTCACAC CAGGGGGCTG GCTGCCGCCG
GACGTCGACG ACCGGCTCGC CCGACGCGCC GTGGGTGACG GTGCGGCCGT GGTCGTCACA
CGATTCGGCG AGCCGCGACT CGGTCTGGCG CCCGCGCTGA GCCAACGCAG CGGCGACGTC
GTGCCGACCG GTGTCCGGCT CTGGCCGGCG CAACCGCTGA CCCTGGTGGC GCCGTGGGAC
GGTGAGGTCG GCAGCGACGG CGCAGGCGAC ACGGTGACGG TCCGCGGCGA CGCCCACGAG
GTCACGCTGA CCGGTGTGCG CCCCGTGGGC GGCGCTTCCG CGGTGCGGGC CGGTGACCCG
ATCGCGCAGG CCGACGCCGC GCAGTGGGCG GACGTCGCGG TGCGCCCGGT CGGCGGGGTG
ACGGCCCCAC CGCTGGTGCG TCCCGAGGTG GCGACGGGCT GGTTGGCGCA GGCGCGCGAT
CCGCGCCCGG TGCTCGGCCT GCCGCCGGAC GCCGTCACCT CCCCCGCCGC CGACCTGGTC
GAGCGGCGTG ACCGCAGCTT CGCGCCGGTG CAGGAGCACT ACTACCGCAG GCCGCCGCAG
ATCGAGCGCG GCTGGCGGCA CTATTTGATG TCGACGGCGG GGCGCTGCTA CCTCGACATG
GTCAACAACG TGACCGTGCT CGGGCACGCC CACCCCCGGG TGGCCGACAC CGCGGCCCGC
CAACTGCGCA AGCTCAACAC GAATTCGCGG TTCAACTACG CCGCGGTCGT CGAGTACAGC
GAGCGGCTCG CCGCCGAACT GCCCGATCCC CTGGACACCG TGTTCCTGGT CAATTCCGGT
TCGGAGGCAA GCGATCTGGC GATCCGGCTG GCGCTGGCCG CCACCGGCCG CCGCGACGTC
GTCGCGATGT GCGAGGCGTA CCACGGCTGG ACGTACGGCA CCGACGCCGT GTCCACGTCG
ACCGCCGACA ATCCGAACGC GCTGGCCACC CGGCCGGACT GGGTGCACAC CGTCGAATCG
CCCAACAGCT TCCGCGGCAA GTACCGCGGG GCGGACGTGG GCCGCTACGC GACCGAGGCC
GTCGCGCAGA TCGAACGACT CGTCGCCGAC GGTCGCGCCC CGGCCGGGTT CATCTGCGAG
TCGGTGTACG GCAACGCCGG CGGGATGGCG CTGCCCGACG GCTATCTGCA GCAGGTGTAC
GCGGCGGTGC GCGGCGCCGG CGGGCTGGCC ATCGCCGACG AGGTCCAGGT GGGTTACGGC
CGTCTCGGAC ACTGGTTCTG GGGGTTCGAG CAGCAGGGCG TGGTGCCCGA CATCGTCTCG
ATGGCGAAGT CGACGGGCAA CGGCTATCCG CTCGGCGCGG TGATCACCAG CCGTGAGGTG
GCCGAGGCGT TCCGCTCCCA GGGATACTTC TTCTCCTCGA CCGGCGGCAG CCCGCTGTCG
TGTGCGATCG GCCTGACCGT GCTCGATGTA CTGCGCGCAG AAGACCTGCA GGGCAACGCC
GTCCGGGTGG GCGGACACCT CAAGGCGCGG TTGGAGGCGC TGGCCGACAG GCATCCGATC
ATCGGCACCG TGCACGGGGT CGGCCTGTAT CTCGGTGTCG AGATGGTCCG CGACCGGCAG
ACGCTGGAAC CGGCGACCGA GGAGACCGCG GCAATCTGCG AGCGGATGCT CGAACTCGGC
GTCGTCATCC AGCCGACGGG CGACCACTCG AACATCCTCA AGACCAAACC GCCGTTGTGC
ATCGACACCG AATCCGCCGA CTTCTACGTC GATGCGCTGG ACCGGGTGCT GACCGAGGGG
TGGTAG
 
Protein sequence
MTAGRRTAGF DFLEQPALPA PRVSEAEAQR ILATHYGIEG DAVSLGSQQD KNFLVRRTGT 
GEVAGVLKVA NPAFTAVELA AQDAAATLIA EAEPGLRIAV PLPNADGAEV TTVDGLLVRL
LRYLPGGTLI DADHLGPAAV AGLGEVAARV SRALTGFEHA GLDRVLQWDL RYGADVVAAL
IGHVADPVQR ERLSTATRDA AERIGRVADA LPRQAVHLDI TDANVVVSRA ADGTRRPDGV
IDFGDLTDTW AVSELAIAAS SVLGHSGTEP VSILPAVRAF HGIRPLTVEE IDALWPMVVL
RTAVLIVSGA QQAELDPDNA YVTDQSDGEW RMFEQATSVP IDVMTAVIRA DLGFAAPPAD
VTATVPMIAG VTAEDVVTLD LSPTSDAYDF AFTPGGWLPP DVDDRLARRA VGDGAAVVVT
RFGEPRLGLA PALSQRSGDV VPTGVRLWPA QPLTLVAPWD GEVGSDGAGD TVTVRGDAHE
VTLTGVRPVG GASAVRAGDP IAQADAAQWA DVAVRPVGGV TAPPLVRPEV ATGWLAQARD
PRPVLGLPPD AVTSPAADLV ERRDRSFAPV QEHYYRRPPQ IERGWRHYLM STAGRCYLDM
VNNVTVLGHA HPRVADTAAR QLRKLNTNSR FNYAAVVEYS ERLAAELPDP LDTVFLVNSG
SEASDLAIRL ALAATGRRDV VAMCEAYHGW TYGTDAVSTS TADNPNALAT RPDWVHTVES
PNSFRGKYRG ADVGRYATEA VAQIERLVAD GRAPAGFICE SVYGNAGGMA LPDGYLQQVY
AAVRGAGGLA IADEVQVGYG RLGHWFWGFE QQGVVPDIVS MAKSTGNGYP LGAVITSREV
AEAFRSQGYF FSSTGGSPLS CAIGLTVLDV LRAEDLQGNA VRVGGHLKAR LEALADRHPI
IGTVHGVGLY LGVEMVRDRQ TLEPATEETA AICERMLELG VVIQPTGDHS NILKTKPPLC
IDTESADFYV DALDRVLTEG W