Gene Mlut_20210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlut_20210 
Symbol 
ID7985231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMicrococcus luteus NCTC 2665 
KingdomBacteria 
Replicon accessionNC_012803 
Strand
Start bp2181346 
End bp2182704 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content69% 
IMG OID644806961 
Productarabinose efflux permease family protein 
Protein accessionYP_002958049 
Protein GI239918491 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.272116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACA CCGTCGCATC CCCCGCGGCG ACCGCCCCCG CGCGTGACCG CCGCGAGGAG 
CGCAAGGTCA TCGCCGGCAC CGTCGTCGGC ACCACCATCG AGTGGTACGA CTTCTTCATC
TTCGCCCAGG CCACCGCACT CGTCTTCGCC GCCCTCTTCT TCCAGCCGAT GGGCGAGAAC
GGCTCCCAGA TCGCCGCGTG GGCCACCCTC GGCATCTCCT TCCTCATCCG CCCGCTCGGC
GCGATCATCG CCGGCCACCT CGGCGACCGC TTCGGCCGCA AGTTCGTCCT GTCCCTCACG
CTGATCGGCA TGGGCCTGGC CACCACCCTC ATCGGCCTGC TGCCCACCTA CGCCCAGATC
GGTGTGTGGG CCCCGATCCT GCTCGTCGTG CTGCGCCTGC TGCAGGGCCT GTCCGCCGGC
GGCGAGTGGG GCGGCGCGGC GCTGCTGTCC GTGGAGCACG CCCCCCACGG CAAGCGCGGC
CTGTTCGGCT CCGCTCCGCA GATCGGCGTG CCGCTGGGCA TGATCCTGGC CACCGGCGTG
CTGTTCATCG TGCGCTCCAC CATGTCCGAG GAGCAGTTCC TCGCCTGGGG CTGGCGCATC
CCGTTCCTGA TCTCCGTGGT GCTGATCGTC GTCGGCTACC TGATCCGCAA GGCCGTCGAG
GAGTCCCCGG TCTTCAAGGA GATGCAGCAG CTCAAGGTGG ACGAGTCCGC CCCGCTGGGC
GAGCTCTTCC GGCACCACAC CAAGGAGGTC ATCCTCGCCG CCGTGATCTT CGCCGCGAAC
AACGGCGTCG GCTACCTGCT CATCGCGTGG TTCTCGAAGT ACGGCGGCCC GAAGGGCCTG
GGCATGACCT CCTCCGAGGT GCTCATCGCG AGCCTCATCG GCGGCGTCGG CTGGTTCATC
TTCACCCTGC TCGGCGGCTG GGTCTCGGAC AAGATCGGCC GCAAGCTGAC CTTCGTCCTC
GGCTACGGCT TCCTGATCGT CTGGGCCTTC CCGCTGTTCG GGCTGCTCAA CACCGCGTCT
CTGCCGCTGT TCTCGCTGGG CCTGTTCGTT CTGACCCTCG GCCTGGGCCC GTCCTACGGC
CCGCAGTCGG CGATGTACGC CGAGATGTTC CCGGCCCGCG TCCGCTTCTC CGGCGTCTCC
ATCGGCTACG CGCTCGGCAC CATCATCGGC GGCGCCTTCG CCCCGCTGAT CGCCGACCAG
CTCGTGAAGA CCGGCTGGGA GAACGTGGCC TGGTACATCA TCGCGATCTC CGCGGTCTCG
CTCATCGCCG TCCTGTTCGT CCCCAAGGGC ATCCAGGACC GCGAGCTGCA CGATGAGCAG
GTCGTGGCCA CGCGCTCGAA CCCGGTGGTG CCGGCCTGA
 
Protein sequence
MSHTVASPAA TAPARDRREE RKVIAGTVVG TTIEWYDFFI FAQATALVFA ALFFQPMGEN 
GSQIAAWATL GISFLIRPLG AIIAGHLGDR FGRKFVLSLT LIGMGLATTL IGLLPTYAQI
GVWAPILLVV LRLLQGLSAG GEWGGAALLS VEHAPHGKRG LFGSAPQIGV PLGMILATGV
LFIVRSTMSE EQFLAWGWRI PFLISVVLIV VGYLIRKAVE ESPVFKEMQQ LKVDESAPLG
ELFRHHTKEV ILAAVIFAAN NGVGYLLIAW FSKYGGPKGL GMTSSEVLIA SLIGGVGWFI
FTLLGGWVSD KIGRKLTFVL GYGFLIVWAF PLFGLLNTAS LPLFSLGLFV LTLGLGPSYG
PQSAMYAEMF PARVRFSGVS IGYALGTIIG GAFAPLIADQ LVKTGWENVA WYIIAISAVS
LIAVLFVPKG IQDRELHDEQ VVATRSNPVV PA