Gene Namu_3852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3852 
Symbol 
ID8449471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4223836 
End bp4225176 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content70% 
IMG OID645042901 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003203137 
Protein GI258653981 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00895] benzoate transport 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00000421805 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.905597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGAT CCGCGGTCGT GAACCCGCGC GCCCAGTACT CCGGTTGGGT CTCGCCCCTG 
TGCTGGATTG CCGTCGCCCT CGAAGGATTT GACCTGGTCG TGCTGGGGGT GGTGTTGCCC
GCGCTGCTCA AGTACGACGA CTGGGGGCTC AACCCCAATT CAGCCTCGGT GATCTCGGTC
GTCGGCCTGG TCGGCGTGAT GGTCGGGGCG TTGGCCGCCG GCACGGTCAG CGACCTGATC
GGCCGCCGCC GCACCATGCT GTGGACGGTG ATCAGCTTCT CCGTGCTGAC CCTGGCCTGC
GCCTTCGCCC CCGACCCAGT CACCTTCGCG GTGCTGCGCT TCCTGGCCGG TCTCGGCCTG
GGCGGCGTGC TGCCCACCGC GTTGGCGCTG ATCAACGAGT ACGCCCGGTC GGGTCGCGGC
GGGCGGGCCA CCACCACCAT GATGACCGGC TACCACGTGG GCGCGGTGCT GACCGCGCTG
CTGGGCATCC TGATCATCGA GCCCTGGGGC TGGCATGCGA TGTTCATCGT CGGCGCCCTG
CCGGCCATCG TGCTGGTCCC GTTGATGATC AAGTACCTGC CCGAGTCGAA CGCCTTCCTG
CAGGCCCGAG CCGGGCTCGC GCCGAGCGCC GGCAAGGCCA CCACGACGGA CCGGGCCGAC
CAGGCGGCCA AGCCGGCCAA GCCGGCCAAG TCCAAGAACC CGGTCGGCAT GCTGTTCCAC
CACGGTCTGG GCCGGTCCAC GGTGGCGTTC TGGGTCGCCT CGTTCATGGG CCTGCTGCTG
GTGTACGGGC TGAACACCTG GCTGCCGCAG ATCATGCGCG AGGCCGGCTA CGAGCTGGGC
GCCGCGCTGG CCCTGTTGCT CGTACTCAAC GTCGGCGCGG TGCTCGGCCT GCTGGTCGCC
GGGCAGGTCG CCGACAAGAT CGGCACCCGT CGCTCGTCGA TCAGCTGGTT CGCCGTGGCC
GCCCTGTTCC TGGCCCTGCT GTCGATCAAG CTGCCCGGCA TCGGGGTGTA CATCAGCGTG
CTGCTGGCCG GCATGTTCGT GTTCAGCGCG CAGGTGCTGG TCTACGCCTA CGTCGCCCAT
GTCTACCCGG CCGCCGCCCG CGGCACCGCG CTGGGCTCCG CGGCCGGCGT CGGCCGGCTG
GGCGCCATCA CCGGCCCGCT GATCACCGGC GTCATGCTGA CCGCCGGGGT GGCCTACCCG
TGGGGCTTCT ACCTGTTCGC GGCGGTCGCC GCGATCGGTG CCGCGGCCAT CTTCCTGGTC
GATCGGAACC CGGCCCCGGC CGAGCCGCTG CCGGTCACCG AACAGCAGGC CGACCAGATC
ACCCACATCC ACCCGCACTG A
 
Protein sequence
MNGSAVVNPR AQYSGWVSPL CWIAVALEGF DLVVLGVVLP ALLKYDDWGL NPNSASVISV 
VGLVGVMVGA LAAGTVSDLI GRRRTMLWTV ISFSVLTLAC AFAPDPVTFA VLRFLAGLGL
GGVLPTALAL INEYARSGRG GRATTTMMTG YHVGAVLTAL LGILIIEPWG WHAMFIVGAL
PAIVLVPLMI KYLPESNAFL QARAGLAPSA GKATTTDRAD QAAKPAKPAK SKNPVGMLFH
HGLGRSTVAF WVASFMGLLL VYGLNTWLPQ IMREAGYELG AALALLLVLN VGAVLGLLVA
GQVADKIGTR RSSISWFAVA ALFLALLSIK LPGIGVYISV LLAGMFVFSA QVLVYAYVAH
VYPAAARGTA LGSAAGVGRL GAITGPLITG VMLTAGVAYP WGFYLFAAVA AIGAAAIFLV
DRNPAPAEPL PVTEQQADQI THIHPH