Gene Namu_3403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3403 
Symbol 
ID8449018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3740711 
End bp3742252 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content75% 
IMG OID645042480 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003202720 
Protein GI258653564 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00000797372 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000013818 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCTGA ACCCGCGGGC CGATCCGGCC GGGCCGCGGG CCGGAGCCGG CGGCGCCGGG 
ATCGCGCTGC TGGGGGTGAT GGCGGCGGTC CAGGGCTCGG ATCCCAACAT CGCCAGCACC
GCCCTGGTCG GCGCCAGCCG CGGCCTGCAG ATGACCGGTG GCCTGCTGGC CCTGGCCGCC
AGCGTGTCCA CCCTGGCGCT GGCCGCCACG GTCATCTCCA CCGGGTTGCT GGCCGACCGG
TTCGGCCGGC GCGGCGTGCT GATGGCCGCG TTGGCGCTCT CGGCCGCGGG CGACCTGATC
GCCGCCCTGG CCCCGACCGC AGGGCTGTTC CTGATCGGCC GGGCGGTGGC CGGGATCGGC
CTGGGCGCGG TGTACGGGGC GGCCTTCGCC TACATCCGGG CGATCACCCC GCCCGATCGG
ATCGCCGGTG CCATCGGCAT TTTCGGCGCC GTCTCGGGCG GCGTCACGGT GCTGATGACG
TTCCTGGGCG GGGCCCTGGC CTCGGTGCAC TGGCGGCTGG CCTTCCTGGT GGTGCCGGTG
GCCGCCCTGC TGTGCGTGCC GGCGGTGCGG GCGGTGCTGC CGGCCCAGCC GCGGGTCGCC
GACGGCCCGC GGGACTATCC GGGCCAGGTG CTACTGGCCC TGGGCGTGGT CGGCGTGCTC
TACGGGTTCA GCCACGCCGC CGACGGGTTG ACCTCGCCGC TGACCTTCGG CCCGCTGCTG
GGCGGGCTGG TGCTGCTGGC CTTGTTCGTG GTCCGCGAGC GGCACACGTC GGCCCGCTTC
TTCCCGGTGG AGCTGCTGAC CAAGCCGCTG TTCCTGGCCG CCATCTGCGC CGGCTTCGTC
TACAACTTCG GCACCGCCGT CGGCTTCCTG CAGCTCACCG ACCTGTGGCA GTACGTGGTC
GGCCTGTCCA CGCTGCGGGT CTCGCTGTGG CAGATGCCGT TCCTGCTGGC CGGCATCGCC
GCCGCGGTGC TGTTCGGCCG GCTGATGACC AGGGGGCTGA CCGCGGCCAG CACGGTGGCC
ATCGGGTCGC TGGCCGCGGC GGCCGGGTTC GTCCTGCTCG CCGTGCTGCA TTCGTCGACC
TCGCTGTGGG GCTTTCTGCC CGGCTCGATC CTGCTCGGCG CGGGCGTCAT CATCGCCTCG
CTGCCGTACG GGACGCTGAT CATCGCGCAG GCCCCGGCCC GCTACTTCGG GCCGGTCACC
TCGTCGCGGA CGACCATCGG CCAGTTCTTC TACGCGGCCG GGCTGGCCCT GTCCACGGTG
CTGGTGAACA CGATGACCAC CGGCGGGGTG GTCCGCCGAC TGGAGCAGGC CGGCGTGCCG
CCGACCGACA CCGGGCAGGG GCTGGACGCG GTCACCGCCT TCGCCGCCGA CGGCACCCGC
CCCAGCACCG CCCTGGGCCA GCAGGCGCTG GCCGAGGCGG CCCAGTCCTA CGGCCAGGCG
TTCGCGCTGA CGACCCTGCT GGCTGCCCTG GTCACCCTGA TCGTCGGCGG CCTGGGCTGG
TGGCTGCTGC GCCGGCACGA GGCGCGACCG GCCACCGGCT GA
 
Protein sequence
MLLNPRADPA GPRAGAGGAG IALLGVMAAV QGSDPNIAST ALVGASRGLQ MTGGLLALAA 
SVSTLALAAT VISTGLLADR FGRRGVLMAA LALSAAGDLI AALAPTAGLF LIGRAVAGIG
LGAVYGAAFA YIRAITPPDR IAGAIGIFGA VSGGVTVLMT FLGGALASVH WRLAFLVVPV
AALLCVPAVR AVLPAQPRVA DGPRDYPGQV LLALGVVGVL YGFSHAADGL TSPLTFGPLL
GGLVLLALFV VRERHTSARF FPVELLTKPL FLAAICAGFV YNFGTAVGFL QLTDLWQYVV
GLSTLRVSLW QMPFLLAGIA AAVLFGRLMT RGLTAASTVA IGSLAAAAGF VLLAVLHSST
SLWGFLPGSI LLGAGVIIAS LPYGTLIIAQ APARYFGPVT SSRTTIGQFF YAAGLALSTV
LVNTMTTGGV VRRLEQAGVP PTDTGQGLDA VTAFAADGTR PSTALGQQAL AEAAQSYGQA
FALTTLLAAL VTLIVGGLGW WLLRRHEARP ATG