Gene Namu_3367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3367 
Symbol 
ID8448982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3705660 
End bp3706850 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content75% 
IMG OID645042444 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003202684 
Protein GI258653528 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0109833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0295865 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTCCCA CGCCCGCGAC CGCGTCCCCG GCGCGCGCGT GGACCATGCT GGCGCTGGGC 
ACCGCGGCGC AGACCGCGGG CACGGTCTTC GTCTCGACCC CGGCCTTCCT CATCCCGCTG
TTGCACGAGC AGCGCTCGCT CTCGCTGGCG CAGGCCGGTC TGGTGGCCTC CGCACCGCTG
GTGGGTCTGG TGCTCTCGTT GATCGCCTGG GGAGCGCTGG CCGATCGTCG GGGCGAGCGC
CTGGTCATCG CCAGCGGTCT CGCGCTGACC GCGGTGGCCA CGGTCGGCGC CATGTTCTCG
ACCGGTTACG TGGCCCTTGG CCTGTTCTTC GTGCTCGGCG GGGTCGGCGC GGCCAGCACG
AACGCGGCCA GCGGGCGCGT CGTGGTGGGC TGGTTCGCCA AGGATCGACG CGGCCTGGCG
ATGGGGGTCC GGCAGATCGC CCAGCCCCTG GGCACCACCC TGGCCGCGGT CATCGTGCCG
ACCGCCGCCG AGTCGGGGAT CGACGCCGCG CTGGCCGTCC CGTTGATCGC GGTGGCCCCG
TTGGCGGTGA TCTGCGCGAT CGGCATCCGG AACCCGCCGC GGCCGGCGTC GGCCCCGGCC
CTGGCCACCG CCAATCCCTA CCGCGGCTCG GGGTTCCTGT GGCGGATCCA TCTGGTCTCG
GTGCTGCTGG TGGTGCCGCA GTACACCCTG GCGTTGTTCG GGCTGCTGTG GTTGATCGCG
GGCCAGGGCT GGGATCCGAT CGGGGCCGGG CTGGTGATCG GCGCGGCGCA GTTCGTCGGC
GCGCTCGGCC GCATCGGAGC CGGCGTGCTC AGCGATCGGA TCGGCAGCCG GGTGCGGCCG
CTGCGGTGGA TCTCGCTGGC CGCCGCCGCG TCGATGCTCG CGCTGGCCGC GGCGGCGGCC
ACCCAGTGGA GCCTGGCGCC GCTGGTGCTG GTCGTGGCCA CCACCATCTC CGTCGCCGAC
AACGGTCTGG CCTTCACCTC GGTGGCCGAG GTGGCCGGGC CGGTCTGGGC CGGCCGGGCC
CTGGGCGTGC AGAACACCGG GCAGTTCGTG GCCGCCGCGG CGGTGGGGCC GGTCGTCGGC
GTCCTGATCA CGGTCCTGGG CTACCCGCTG GCGTTCGCCG CGTCGGCGGT GGCGCCCGTC
CTGGCCACCC CTCTGATCCC GGACGCCCGG GCCGAACGCG ACCGGCTCTA G
 
Protein sequence
MGPTPATASP ARAWTMLALG TAAQTAGTVF VSTPAFLIPL LHEQRSLSLA QAGLVASAPL 
VGLVLSLIAW GALADRRGER LVIASGLALT AVATVGAMFS TGYVALGLFF VLGGVGAAST
NAASGRVVVG WFAKDRRGLA MGVRQIAQPL GTTLAAVIVP TAAESGIDAA LAVPLIAVAP
LAVICAIGIR NPPRPASAPA LATANPYRGS GFLWRIHLVS VLLVVPQYTL ALFGLLWLIA
GQGWDPIGAG LVIGAAQFVG ALGRIGAGVL SDRIGSRVRP LRWISLAAAA SMLALAAAAA
TQWSLAPLVL VVATTISVAD NGLAFTSVAE VAGPVWAGRA LGVQNTGQFV AAAAVGPVVG
VLITVLGYPL AFAASAVAPV LATPLIPDAR AERDRL