Gene Namu_4320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4320 
Symbol 
ID8449946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4804920 
End bp4805933 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content69% 
IMG OID645043368 
Productextracellular solute-binding protein family 3 
Protein accessionYP_003203597 
Protein GI258654441 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGTTCT TCGAGAAGTT CCGCGTACGT CCTATGCTCG CCGGCGGTGC CGCGCTGGCA 
ATCGTGCTGG CCACGGCGGC CTGCGGAGCC AGTACCACGC CCCCCGGAAG CACCACCGCG
GCCTCGTCGG CCGGGTCGGC CGCGTCGTCC GGAGCCGCCA CGTCCGGAGC TGAAGCCGAG
AGCGTCCCCA CCCCGCTGAA CCTGAACCTG GCCAAGGTTG ACTCGCTCAA CGCGATGCTG
CCGGACAAGT TCAAGCAGTC CGGCACGATC ATCGTGGCGA CCGACCCGAC ATACCCGCCC
AACGAATTCC TGCCCGAGGG CTCGAACACC CCCGTGGGCA TGGACATCGA CCTGATCAAC
GCGGTGGGCC AGGTGCTGGG GCTGACCGTC ACCGTCGAGC CGGCCAGCTT CGACGGCATC
ATCGCGGGCC TGAACGCGGG CCGCTACGAC GTCGGCATCT CCTCGTTCAC CGACACCAAG
GAACGCGAGC AGTCGGTCAA CTTCGTCACC TACTTCACCG CGGGCACCTC GATCATGGTG
CCGGCCGGCA ACCCGAAGAA CATCCAGTCG GCGACCGACC TGTGCGGGCT GCCGGTCGGC
GCGCAGAACG GCACCACCCA GCTGGACCAG CTGACCGACG CGACCGTCGA GGGCTCCGTG
GTCAAGGCGT GCCAGGACGC GGGTAAGGAG CCGCCGGTGG CGCAGGGCTT CCCGAAGCAG
ACCGACGTGA ATGCGGCACT GGTGGCCGGC CGGATCGACG CCTACATGGC CGACTCGCCG
GTCGTCGACT ACGCCGTGAA GGTCACCGGG GACCGGTTCC AGAAGGTCGG CGGCGACGAG
GGCGCCGCGC CCTACGGCAT CGCCATCCCG AAGGAGCCGG CGGAGCTGAC CCCGGCCATC
CAGGCCGCGA TGCAGCACCT GATCGACACC GGTGCCTACA CCAAGATCCT GGACAACTGG
GGTCTGACGG GCGGGGCCGT CACCACCTCG CAGATCAACG GCGCGATCTA CTGA
 
Protein sequence
MSFFEKFRVR PMLAGGAALA IVLATAACGA STTPPGSTTA ASSAGSAASS GAATSGAEAE 
SVPTPLNLNL AKVDSLNAML PDKFKQSGTI IVATDPTYPP NEFLPEGSNT PVGMDIDLIN
AVGQVLGLTV TVEPASFDGI IAGLNAGRYD VGISSFTDTK EREQSVNFVT YFTAGTSIMV
PAGNPKNIQS ATDLCGLPVG AQNGTTQLDQ LTDATVEGSV VKACQDAGKE PPVAQGFPKQ
TDVNAALVAG RIDAYMADSP VVDYAVKVTG DRFQKVGGDE GAAPYGIAIP KEPAELTPAI
QAAMQHLIDT GAYTKILDNW GLTGGAVTTS QINGAIY