Gene Mmcs_3836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3836 
Symbol 
ID4112666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4083384 
End bp4084721 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content68% 
IMG OID638032974 
Productgeneral substrate transporter 
Protein accessionYP_640997 
Protein GI108800800 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0813841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCTCTCG CCATGGAGAG TCAGACATCC GCGCCTGCAC CCGCCGAGAC GGCGGCCATC 
CGCAAAGCGG TCGCAGGCGC GTCGATCGGT AACGCCGTCG AATGGTTCGA CTTCGCCATC
TACGGCTTCC TCGCCACTTT CATCGCCGCC AAGTTCTTCC CGGCGGGCAA CGACACGGCC
GCCCTGCTGA ACACGTTCGC GATCTTCGCC GCGGCCTTCT TCATGCGGCC CCTCGGCGGT
TTCGTCTTCG GCCCGCTCGG CGACCGGATC GGACGTCAGC GGGTGCTCGC GGTGGTCATC
CTGCTGATGT CGGCGGCCAC GCTCGGCATC GGCCTGCTGC CGACCTACGC CAGCATCGGC
GTCGCCGCAC CGCTGCTGCT GCTGTTCCTG CGGTGCCTGC AGGGGTTCTC CGCGGGCGGT
GAATACGGTG GCGGCGCAGT CTATCTCGCC GAGTACGCCC GTGACCGCAA CCGCGGGTTG
ACGGTCACGT TCATCGCCTG GTCCGGTGTC GTCGGCTTCC TGCTCGGCTC GGTGACGGTG
ACGGTGCTGC AGGCGCTGTT GCCCACCGCG GCGATGGACA GCTACGGCTG GCGGATCCCG
TTCCTGATCG CCGCACCGCT GGGCCTGGTC GGCCTCTACA TCCGGTTACG CCTCGACGAC
ACGCCCGAGT TCGCGGCGCT GTCGGAGACC GACCGGGTGG CGTCGTCGCC GCTGCGTGAG
GCGGCCACCA CCGCGTGGCG GCCGATCCTG CAGGTGATCG GCCTGTTCAT CGTGTTCAAT
GTCGGCTACT ACGTGGTGTT CACGTTCCTG CCGACCTACT TCATCAAGAC CCTCGAATTC
GGCAAGACGG CGGCGTTCGT CTCGATGTCG CTGGCCAGTC TGGTCGCCCT GGTGCTGATC
CTGCCGCTGG CCGCGCTGTC GGACCGCATC GGCAGGCGTC CGCTGCTGAT CGGCGGGACG
GTGGCGTTCG TGATCCTCGC GTATCCGCTG TTCCTGCTGC TGAATTCCGG TTCGCTCGCC
GCGGCGATCA CCGCGCACTG CGTGCTGGCG GCCATCGAGT CGATCTACGT GTCGACCGCG
GTCACCGCCG GCGTCGAACT GTTCGCCACC CGGGTGCGCT ACAGCGGGTT CTCCATCGGC
TACAACGTGT CGGTCGCGGC GTTCGGCGGG ACGACGCCCT ACGTCGTCAC GTGGCTGACC
GCGACGACCG AGAACAACCT CGCCCCGGCG CTGTACCTGA TCGTCGCGGC CGTCGTATCG
CTGGCGACGC TGCTCACCCT GCGGGAGTCC GCCGGCCGGC CGCTGGCCGC GACAGTGGCC
GCTGACGTGC GACGATGA
 
Protein sequence
MPLAMESQTS APAPAETAAI RKAVAGASIG NAVEWFDFAI YGFLATFIAA KFFPAGNDTA 
ALLNTFAIFA AAFFMRPLGG FVFGPLGDRI GRQRVLAVVI LLMSAATLGI GLLPTYASIG
VAAPLLLLFL RCLQGFSAGG EYGGGAVYLA EYARDRNRGL TVTFIAWSGV VGFLLGSVTV
TVLQALLPTA AMDSYGWRIP FLIAAPLGLV GLYIRLRLDD TPEFAALSET DRVASSPLRE
AATTAWRPIL QVIGLFIVFN VGYYVVFTFL PTYFIKTLEF GKTAAFVSMS LASLVALVLI
LPLAALSDRI GRRPLLIGGT VAFVILAYPL FLLLNSGSLA AAITAHCVLA AIESIYVSTA
VTAGVELFAT RVRYSGFSIG YNVSVAAFGG TTPYVVTWLT ATTENNLAPA LYLIVAAVVS
LATLLTLRES AGRPLAATVA ADVRR