Gene Namu_3676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3676 
Symbol 
ID8449295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4032429 
End bp4033625 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content68% 
IMG OID645042741 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003202977 
Protein GI258653821 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.000606692 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0196107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGC AGGGACGCAA CCTCGCGTTG GCGACCTGGA CGTTCGCGAT CAACTTCTGG 
GCCTGGAACC TGATCGGGCC GCTGTCCACG CCGTACGCCG CCGCGATGTC GCTGTCCAGC
ACGCAGACCG CGTTGCTGGT GGCCACGCCG ATCCTGGTGG GGTCGGTCGG GCGAATACCG
GTCGGCGCCC TCACCGATCG TTATGGCGGC CGAATGATGT TCATCATCGT CTCGGTCGCG
TCCATTGTCC CGGTCCTGCT GGTCGCCTTC GCCGGGTCCA TCGACTCCTA TGTTCTGTTG
CTGATTTTCG GCTTCTTCCT GGGTATTGCC GGGACCACCT TCGCCGTGGG TATCCCGTTC
GTCAACGCGT GGTACCCGCC GGCCCGCCGC GGTTTTGCCA CCGGAGTTTT CGGCGCGGGA
ATGGGTGGGA CGGCATTGTC CGCCTTCTTC ACCCCGCGTT TCGTGAATTG GTTCGGCTAC
GTCACCGCGC ACCTGATCAT CGCCGCGGCG CTGGCCGGCA CGGCGGTGCT GGTGATGGTA
GCAATGAAGG ATTCGCCCAG CTGGAAGCCG AACACGGCCC CGGTGATCCC CAAACTCAGG
GCGGCCGGCA AGCTGGCCAT CACCTGGCAG ATGGCGTTCC TGTACGCGGT GTGCTTCGGC
GGCTTCGTCG CCTTCAGCAC CTACCTGCCG ACCTATCTCA AGACCATCTA CGACTTCTCC
GCCACCGACG CCGGGACCCG TACCGCGGGG TTCGCGCTGG CGGCGGTCGT CGCCCGGCCG
ATCGGCGGGA TCCTGTCCGA CCGGGTCGGT CCCAAGGCGA TCGTGTTGAC CTCGCTGATC
GGGTCCGGGG CGCTGGCCGC CGTCGAGATC ACGCAGCCGC CGGCGGAACT GGCGGCGGGG
GCGTCGTTCG TCGCGCTGGC CTTTTTCCTG GGCATCGGCA CCGGCGGGGT CTTCGCCTGG
GTGGCGCGGT CCGCGCCGCC GGAGAAGGTG GGTACGGTCA CCGGGATCGT CGGCGCGGCC
GGGGGTCTGG GCGGGTACTT CCCGCCGTTG GTGATGGGGG CGACCTACGA CTCGACCGGC
AACGACTACA CCGTCGGTCT TGCGCTGCTC GTGGCCACCT GCCTGGTGGC GGCGATCGTC
ACCGCCTGGC GGCTGCATGT GCGGCCGGCC GGCGCGAGCC CCGGAGGGAA GAATTGA
 
Protein sequence
MTGQGRNLAL ATWTFAINFW AWNLIGPLST PYAAAMSLSS TQTALLVATP ILVGSVGRIP 
VGALTDRYGG RMMFIIVSVA SIVPVLLVAF AGSIDSYVLL LIFGFFLGIA GTTFAVGIPF
VNAWYPPARR GFATGVFGAG MGGTALSAFF TPRFVNWFGY VTAHLIIAAA LAGTAVLVMV
AMKDSPSWKP NTAPVIPKLR AAGKLAITWQ MAFLYAVCFG GFVAFSTYLP TYLKTIYDFS
ATDAGTRTAG FALAAVVARP IGGILSDRVG PKAIVLTSLI GSGALAAVEI TQPPAELAAG
ASFVALAFFL GIGTGGVFAW VARSAPPEKV GTVTGIVGAA GGLGGYFPPL VMGATYDSTG
NDYTVGLALL VATCLVAAIV TAWRLHVRPA GASPGGKN