Gene Mjls_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3101 
Symbol 
ID4878814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3242744 
End bp3244021 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content67% 
IMG OID640140401 
Productglycosyl transferase family protein 
Protein accessionYP_001071371 
Protein GI126435680 
COG category[G] Carbohydrate transport and metabolism
[C] Energy production and conversion 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.61797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.564364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCG CGGTCGCCAT CCATGGCACC CGGGGTGACG TCGAACCGTG TGCAGCCGTA 
GCCCTGGAAT TGCAGCGCCG TGGACACGAA GTACGGGCTG CGGTACCGCC CAACACGGTC
GGATTCGTCG AAGCTGTGGG CCTGTCTGCC GTCAGTTATG GCCCCGATTC CCAACAGCAG
CTGCAGGGCG ATGTATTCGA ACGACCAGAC GCGCTAACGG CGGCCAGTCC ATCGGACTGG
CTGCGGCTGG GAAACCCGCT CAACGCGCTG CGCAGGGCTC GCGTTGCCGC CACCCGCGGC
TGGGATGAGA TGAGCCAGAC ACTGTTGTCG ATGACCGCGC GTGCCGACCT GGTCGTCACC
GGCACGGCCT ATGAGGAGAT CGCGAGCAAC GTCGCTGAAT TCCGCGGCGT CCCGTTGGCG
GAGGTACATT ACTTTCCGGT CCGCGCCAAC ACCCGCGTAC TGCCGGTTCG ACTACCGCCG
ACGGCGGCCC ACGGCGCGTT CGCCGCGGGT GAATGGATGC ATTGGCAGCT GCTTAAACCC
GCCGAGAGCC GGCAGCGGCG CACCCTGGGT CTACCCCCGG CGACCACTCG GCCGGTGGCA
CGCATCGTGG CCGGCGAGGC TCTGGAGCTT CAGGCCTACG ATCCGGTGTT CTTTCCTGCG
CTGGCGCAGG AGTGGGGCGC CCGGCGCCCT CTCATCGGGT CGATGACGAT GCGGCTTTCC
ACCGAGGTCG ACGGCGAGGT GGCGTCGTGG ATCGCCGCAG GTCCCCCGCC CATCTACTTC
GGATTCGGCA GCATGCCTTT GCACAACCCC ACAGACACGG TGCGTCTCAT TCGTGACGTG
TGCGGCACGC TCGGCACGCG AGCACTGATC TGCGCGGGAA GTTCCGCGTT CGACGACATT
GTTACCACCG AGGATGTCAA GGTCGTTGCC GACGTCAACC ACGCCGCGGT CTTTCCGATG
TGCCGCGCCG TCGTGCACCA TGGCGGGGCG GGCACCACGG CTGCCGGACT GCGCGCCGGC
GTTCCCACCT TGGTGTTGTG GGTGGCCGCC GAACAACCGC TGTGGGGCAA GCAGGTCAAA
CGCCTTGGTG TCGGCACGTA CCGGCGTTTT TCCACCATTA CCCGGAATTC GTTGGTCGCC
GATCTGCAGG TGGTGCTGGC CCCAGGTATG TCTGAGCGCG CGCGTTCGCT CGCTGGGCGA
ATGAGCCGAC CTTCCGATAG CGTCACGACG GCCGCAGACT TGCTCGAGGG GGCGGCTCGC
GCCGGCCGTC TCGGGTGA
 
Protein sequence
MKFAVAIHGT RGDVEPCAAV ALELQRRGHE VRAAVPPNTV GFVEAVGLSA VSYGPDSQQQ 
LQGDVFERPD ALTAASPSDW LRLGNPLNAL RRARVAATRG WDEMSQTLLS MTARADLVVT
GTAYEEIASN VAEFRGVPLA EVHYFPVRAN TRVLPVRLPP TAAHGAFAAG EWMHWQLLKP
AESRQRRTLG LPPATTRPVA RIVAGEALEL QAYDPVFFPA LAQEWGARRP LIGSMTMRLS
TEVDGEVASW IAAGPPPIYF GFGSMPLHNP TDTVRLIRDV CGTLGTRALI CAGSSAFDDI
VTTEDVKVVA DVNHAAVFPM CRAVVHHGGA GTTAAGLRAG VPTLVLWVAA EQPLWGKQVK
RLGVGTYRRF STITRNSLVA DLQVVLAPGM SERARSLAGR MSRPSDSVTT AADLLEGAAR
AGRLG