Gene Mjls_4321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4321 
Symbol 
ID4880026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4559289 
End bp4562234 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content71% 
IMG OID640141629 
Producthypothetical protein 
Protein accessionYP_001072583 
Protein GI126436892 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases
[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.212999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCAG GACGTCGCAC CGCCGGGTTC GACTTCCTCG AACAGCCGGC GCTTCCCGCC 
CCACGGGTCA GCGAAGCCGA AGCGCAGCGC ATCCTCGCCA CCCACTACGG CATCGAGGGC
GATGCCGTCT CCCTGGGCAG TCAGCAGGAC AAGAACTTCC TGGTGCGCCG CACCGGCACC
GGTGAGGTCG CGGGCGTGCT GAAGGCGGCC AACCCCGCGT TCACCGCCGT CGAACTCGCG
GCGCAGGACG CAGCGGCGAC GCTGATCGCC GAGGCCGAAC CGGGCCTGCG GATCGCGGTG
CCGCTGCCCA ACGCCGACGG CGCGAAGGTG ACGACGGTCG ACGGGCTGTT GGTCCGGTTG
CTGCGGTACC TGCCCGGCGG CACGCTGATC GACGCCGATC ACCTCGGTCC GGCCGCGGTC
GCGGGGCTCG GCGAGGTGGC GGCGCGCGTC AGCCGGGCGC TGACGGGCTT CGAACACGCG
GGTCTCGACC GGGTGCTGCA GTGGGATCTG CGCTACGGCG CCGACGTCGT CGCCGCTCTG
ATCGACCACG TCGCCGATCC GGTGCAGCGC GAGCGGCTCT CGACCGCCAC CCGCGACGCC
GCGGAGCGCA TCGGCCGCGT CGCCGACGCA CTCCCCCGCC AGGCGGTGCA CCTCGACATC
ACCGACGCGA ACGTCGTCGT GTCGCGGGCG GCCGACGGCA CCCGCCGACC CGACGGGGTG
ATCGACTTCG GCGATCTCAC CGACACCTGG GCGGTGTCGG AGTTGGCGAT CGCCGCCTCG
TCGGTGCTGG GGCACAGCGG CACCGAACCG GTCTCGATCC TGCCTGCGGT GCGCGCGTTC
CACGGCATCC GGCCGCTGAC GGTCGAGGAG ATCGACGCAC TATGGCCGAT GGTGGTGCTG
CGGACGGCGG TGCTCATCGT CAGCGGAGCC CAGCAGGCCG AACTCGACCC GGACAACGCC
TACGTCACCG ACCAGTCCGA CGGCGAGTGG CGGATGTTCG AACAGGCGAC GTCGGTCCCG
ATCGACGTGA TGACCGCGGC GATCCAGGCC GATCTCGGAT TCGCCGCACC ACCCGCCGAT
GTCACGGCCA CCGTGCCGAT GATCGCCGGT GTGACCGCCG ACGACGTTGT CACACTTGAT
CTCTCACCGA CCTCCGACGC CTACGACTTC GCGTTCACGC CAGGGGGCTG GCTGCCGCCG
GACGTCGACG ACCGGCTCGC CCGCCGCGCC GTGGATGACG GTGCGGCCGT GGTCGTCACG
CGGTTCGGCG AGCCGCGACT CGGTCTGACG CCCGCGCTGA GCCAACGCAG CGGCGACGTC
GTGCCGACCG GTGTCCGGCT CTGGCCGGCG CAACCGCTGA CCCTGTTGGC GCCGTGGGAC
GGTGAGGTCC GCAGCGACGG CGCAGGCGAC ATGGTGACCG TCCGCGGCGA CGCCCACGAG
GTCACGCTGA CCGGTGTGCG CCCCGTGGGC GGCGTTTCCG CCGTGCGGGC CGGTGACCCG
ATCGCACAGG CCGACGCCGC GCAGTGGGCG GACGTCGCGG TGCGCCCGGT CGGCGGGGTG
ACGGCCCCAC CGCTGGTGCG TCCCGAGGTG GCGACGGGCT GGTTGGCGCA GGCACGCGAT
CCGCGCCCGG TGCTCGGCCT GCCGCCGGAC GCCGTCACCT CCCCCGCGGC CGACCTGGTC
GAGCGGCGTG ACCGCAGCTT CGCGCCGGTG CAGGAGCACT ACTACCGCAG GCCGCCGCAG
ATCGAGCGCG GCTGGCGGCA CTACCTGATG TCGACGGCGG GGCGCTGCTA CCTCGACATG
GTCAACAACG TAACCGTGCT GGGGCACGCC CACCCCCGGG TGGCCGACAC CGCGGCCCGC
CAACTGCGCA AGCTCAACAC GAATTCGCGA TTCAACTACG CCGCGGTTGT CGAGTACAGC
GAGCGGCTCG CCGCCGAACT GCCCGATCCC CTGGACACCG TGTTCCTGGT CAATTCCGGT
TCGGAGGCAA GCGATCTGGC GATCCGGCTG GCGCTGGCCG CCACCGGCCG CCGCGACGTC
GTCGCGATGC GCGAGGCGTA CCACGGCTGG ACCTACGGTA CGGATGCGGT GTCGACGTCG
ACCGCCGACA ACCCCAATGC GCTGGCCACC CGGCCGGACT GGGTGCACAC CGTCGAATCG
CCGAACAGCT TCCGCGGCAA GTACCGCGGG GCGGAGGTGG GCCGCTACGC GACCGAGGCC
GTCGCGCAGA TCGAACGACT CGTCGCCGAC GGTCGCGCCC CGGCCGGGTT CATCTGCGAG
TCGGTGTACG GCAACGCCGG CGGGATGGCG CTGCCCGACG GCTATCTGCA GCAGGTGTAC
GCGGCGGTGC GCGGCGCCGG CGGGCTGGCC ATCGCCGACG AGGTCCAGGT CGGTTACGGC
CGTCTCGGAC ACTGGTTCTG GGGGTTCGAG CAGCAGGGCG TGGTGCCCGA CATCGTCTCG
ATGGCGAAGT CGACGGGCAA CGGCTATCCG CTCGGCGCGG TGATCACCAG CCGTGAGGTG
GCCGAGGCGT TCCGCTCCCA GGGATACTTC TTCTCCTCGA CCGGCGGCAG CCCGCTGTCG
TGTGCGATCG GCCTGACCGT GCTCGACGTA CTGCGCGCAG AAGACCTGCA GGGCAACGCC
GTCCGGGTGG GCGGACACCT CAAGGCGCGG TTGGAGGCGC TGGCCGACAG GCATCCGATC
ATCGGCACCG TGCACGGGGT CGGCCTGTAT CTCGGTGTCG AGATGGTCCG CGACCGGCAG
ACGCTGGAAC CGGCGACCGA GGAGACCGCG GCCATCTGCG AGCGGATGCT CGAACTCGGC
GTCGTCATCC AGCCGACGGG TGACCACTCG AACATCCTCA AGACCAAACC GCCGTTGTGC
ATCGACACCG AATCCGCCGA CTTCTACGTC GATGCGCTGG ACCGGGTGCT GACCGAGGGG
TGGTAG
 
Protein sequence
MTAGRRTAGF DFLEQPALPA PRVSEAEAQR ILATHYGIEG DAVSLGSQQD KNFLVRRTGT 
GEVAGVLKAA NPAFTAVELA AQDAAATLIA EAEPGLRIAV PLPNADGAKV TTVDGLLVRL
LRYLPGGTLI DADHLGPAAV AGLGEVAARV SRALTGFEHA GLDRVLQWDL RYGADVVAAL
IDHVADPVQR ERLSTATRDA AERIGRVADA LPRQAVHLDI TDANVVVSRA ADGTRRPDGV
IDFGDLTDTW AVSELAIAAS SVLGHSGTEP VSILPAVRAF HGIRPLTVEE IDALWPMVVL
RTAVLIVSGA QQAELDPDNA YVTDQSDGEW RMFEQATSVP IDVMTAAIQA DLGFAAPPAD
VTATVPMIAG VTADDVVTLD LSPTSDAYDF AFTPGGWLPP DVDDRLARRA VDDGAAVVVT
RFGEPRLGLT PALSQRSGDV VPTGVRLWPA QPLTLLAPWD GEVRSDGAGD MVTVRGDAHE
VTLTGVRPVG GVSAVRAGDP IAQADAAQWA DVAVRPVGGV TAPPLVRPEV ATGWLAQARD
PRPVLGLPPD AVTSPAADLV ERRDRSFAPV QEHYYRRPPQ IERGWRHYLM STAGRCYLDM
VNNVTVLGHA HPRVADTAAR QLRKLNTNSR FNYAAVVEYS ERLAAELPDP LDTVFLVNSG
SEASDLAIRL ALAATGRRDV VAMREAYHGW TYGTDAVSTS TADNPNALAT RPDWVHTVES
PNSFRGKYRG AEVGRYATEA VAQIERLVAD GRAPAGFICE SVYGNAGGMA LPDGYLQQVY
AAVRGAGGLA IADEVQVGYG RLGHWFWGFE QQGVVPDIVS MAKSTGNGYP LGAVITSREV
AEAFRSQGYF FSSTGGSPLS CAIGLTVLDV LRAEDLQGNA VRVGGHLKAR LEALADRHPI
IGTVHGVGLY LGVEMVRDRQ TLEPATEETA AICERMLELG VVIQPTGDHS NILKTKPPLC
IDTESADFYV DALDRVLTEG W