Gene Namu_2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2212 
Symbol 
ID8447823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2440743 
End bp2442203 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content72% 
IMG OID645041334 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003201578 
Protein GI258652422 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.189935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00984822 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCGCC TGCGGGCCGG CTGGCGTTGG CTGGCCCAGC GCCCCCTGAT CATCTGGTTC 
GCGGCCGTCG GCTGCTACCT GCTCGCCGTC TTCCACCGCA CCTCGCTCGG CGTCGCCGGT
CCGCTGGCCG CCCAACGGCT GGATCTGTCC GCCGGTCAGC TGAGCAGCTT CGTCATGCTG
CAGCTGGGCG TGTACGCGGC CATGCAGGTG CCCGCCGGCA TCCTGATCGA CCGGTTCGGT
CCGCGCCGGA TGCTGCTGAC CGCCACCTTG ATCATGGGCT CGGCCCAGGT GCTGTTCTCG
CTGGTGCACA GCTACCCGCT GGCCCTGCTG GCCCGGGGCC TGCTCGGCGT CGGCGACGCG
ATGACCTACA TCTCGGTGCT GCGGCTGGTC GCCGGCTGGT TCCCGGCTCG CCGCTACCCG
ATGATGGTCG GCCTGACCAG CCTGACCGGG ATGCTGGGCA ACGTGGTGGC CACCGTGCCG
CTGACCCTGA TGCTCACCGA TCTGGGCTGG GGCCCGACCT TCGCCATCGC CGGCGGGCTG
TCCCTGGGCT ACGCCCTGCT GTTGCTGCGC CCGGCGGTGG CCGCCCCCTA CCGCGAGGCG
GAGAGCGCGG CCGGCCCGAT GGGTGGTCGC GAGGTGCTGG CCCAGGTCAA GCACGCGTGG
ACGCTGCCGG CCGGCCGGCT CGGCTTCTGG ATCCACCTGT CGACGATGGC CGGGCCGACC
GTGTTCGCCG TGCTCTGGGG TTACCCGTAC CTGACCCAGG CCCTGGGCTA CTCGCCCGGG
GTGGCCTCCT CGCTGCTGAT GCTGCTGGTC TTCGGCGGCC TGATCGGCAA CCTGATGCTC
GGACCGGTCA TCGCCCGGCG CCCGTTCATC CGCGGCTGGC TCGCGCTGAT CGTGGCCGGG
CTGTGCCTGA CCGGCTGGCT GGTGCTGATC GGCTGGCCCA ACGGGCAGCC GCCGGTGGCC
GTGGTCGTCG CTGTCATCTG CGTCTTCTCC GTCGGCGGAC CGGCCTCGGG CATCGGGTTC
ATGCTGGCCC GGGACTACAA CCCGCGGCAT CGCATCTCCA CCGCGACCGG TCTGGTCAAC
GTCGGCGGCT TCTGCGGCGC CGTGGTCATG GTGTTCGCGG TCGGCCAGAT CCTGGACCTG
GTCGAGCCGG GGGCCGTCAC CCACACCGCG GCCGCCTACC GGTGGGCGTT CGGCGCGATC
GCGCTGCTCA CCGCGTTCGG CATTGCCCGG ATGATCACCT GGATCCTGCG TACCCGGGTC
CACGTGCTGC GGGCCGCGGC CCGCGGGGAA GATGTCCCGG TGGAGATCGT GCCCCATCGC
TGGGACCTGC TGGACACCGG AGAATTCCGC CTGGTCACCG GGACCAAGCC CGGCGAGGTG
CGGTTCGAGG ACCTGAGCCG CCGACCGGTG GTCGACCCCT CGACCGACCA GCGACCCGCG
CCCGTGCGGC GCGAGCCGTG A
 
Protein sequence
MNRLRAGWRW LAQRPLIIWF AAVGCYLLAV FHRTSLGVAG PLAAQRLDLS AGQLSSFVML 
QLGVYAAMQV PAGILIDRFG PRRMLLTATL IMGSAQVLFS LVHSYPLALL ARGLLGVGDA
MTYISVLRLV AGWFPARRYP MMVGLTSLTG MLGNVVATVP LTLMLTDLGW GPTFAIAGGL
SLGYALLLLR PAVAAPYREA ESAAGPMGGR EVLAQVKHAW TLPAGRLGFW IHLSTMAGPT
VFAVLWGYPY LTQALGYSPG VASSLLMLLV FGGLIGNLML GPVIARRPFI RGWLALIVAG
LCLTGWLVLI GWPNGQPPVA VVVAVICVFS VGGPASGIGF MLARDYNPRH RISTATGLVN
VGGFCGAVVM VFAVGQILDL VEPGAVTHTA AAYRWAFGAI ALLTAFGIAR MITWILRTRV
HVLRAAARGE DVPVEIVPHR WDLLDTGEFR LVTGTKPGEV RFEDLSRRPV VDPSTDQRPA
PVRREP