Gene Namu_3523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3523 
Symbol 
ID8449142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3869463 
End bp3870563 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content75% 
IMG OID645042601 
ProductC4-dicarboxylate transporter/malic acid transport protein 
Protein accessionYP_003202837 
Protein GI258653681 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1275] Tellurite resistance protein and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00371301 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.143874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACCGC ACGGGCAGCC GCCGGGACCG GTCCGCCCGC CCCGGTCGCT GGACGATCCG 
GCCTGGTTCG GCGCGGTGAT GGGCCGTGCG GCCACGGCCA CCGTCGCCTC CCTGCACCCC
GGCCGGATCG GCCCGCTCAG CCGCTTCGCC GACGTCACCG CCGCGATCCT GCTCGTGGCC
AGCATCCTCG CCTTCGCCGG GCTGTTCGTG CGCGACTTCC TGGTCCGCCG CCTGGGCGCC
GACCTGGCCG GCAAGCTCCG CTCGCCCCGC ACCGGACCGG CCTACGCCAC CATCCCCGGC
GCGATCAACG TGCTCGCCGT CGCCGTCCTG CACGTGTGGC CGGCCAGTGC CAACTCGGCG
GTCGGCTGGT GGCTGCTCAT CGGGCTGGCC GGCCTGGGCA CCACGCTGGG CCTGATGCTG
ACGGTGGTCT TCTTCGTCAG CGCGTTCGAG CACGAACAGT TTCCGGCGCA GGACATCTCG
GGCATCTGGT TCATTCCGGA GACCGTGGTC CTGCTGGGTT CGTTGCTGTT CGCCGAGCTG
GCGCCGGCCG GACCGGAGGC CGCTCAGCGC GGGCTGGCCG TGCTGGCGGT CGCCCTGCTC
GGGGCCGGCG GGTTGCTGTT CGGGATCACC GCGGTGATCT TCGTGAACCG GCTGGTGCTG
CACGCCGGGG TGCACCGCAC CGGCGCCCCG GCCATGTGGA TCATGATCAG CCCGCTGGCC
GTCACCTCGC TCGCACTCCA GTCGGTGGCC GGCGGCGACC CGATGCTTGG CGGGACCTGG
ACGCCGGCCG TGGCCGAGGT CGCCACCTTC GCGGCCGGCG CGCTCTGGGG GTTCGCCCTC
TGGTGGATCG CCGCTGCCGC CGTGGTCACC CGGCACGCCG GGCGGGCCGC GTTCACCCGG
ACCGCGGCGG ACTGGGGCTT CGTCTTCCCG TCCGCGGCGA TGGTCATCGC CACCCTGACC
CTGGCCCGGC GATGGCAGTC CGGCCTGGTC GAGGCGGCCG GCCTGGCTCT GGGCGTGCTG
CTGGCCCTGG TCTGGGTGGC CGTGCTGTCC GGCGCCGTCG TCGGGTACCG GCGCGAGCAA
CGCACCCGCC GCGGCCGGTG A
 
Protein sequence
MAPHGQPPGP VRPPRSLDDP AWFGAVMGRA ATATVASLHP GRIGPLSRFA DVTAAILLVA 
SILAFAGLFV RDFLVRRLGA DLAGKLRSPR TGPAYATIPG AINVLAVAVL HVWPASANSA
VGWWLLIGLA GLGTTLGLML TVVFFVSAFE HEQFPAQDIS GIWFIPETVV LLGSLLFAEL
APAGPEAAQR GLAVLAVALL GAGGLLFGIT AVIFVNRLVL HAGVHRTGAP AMWIMISPLA
VTSLALQSVA GGDPMLGGTW TPAVAEVATF AAGALWGFAL WWIAAAAVVT RHAGRAAFTR
TAADWGFVFP SAAMVIATLT LARRWQSGLV EAAGLALGVL LALVWVAVLS GAVVGYRREQ
RTRRGR