Gene Namu_4344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4344 
Symbol 
ID8449970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4831274 
End bp4832293 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content67% 
IMG OID645043391 
Productextracellular solute-binding protein family 3 
Protein accessionYP_003203620 
Protein GI258654464 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCCC GTCTGCGACG CGGCTTCGTC CTGACTTCGG CGCTGATCGC CGCGGCCCTG 
GCGGTCAGCG CCTGCGGCGG CTCCACCGGC GACACCGCAA GCACCACGAG CGGTGCCGCC
GCGCCCGCGA GTGCCGGTTC GGTGTCGGCG GCCGCAGCCG ACGCGGTCGA GTTCAATCTC
GGCCCGGACC AGAACCGGGT CACCACCGCC AAGGTCGATG CGATTGCGGC CAAGGTTCCG
CAGGAGATCC GGGACCGCGG CACCATCAAG GTCACCGGCT CGGCCGGCAC CGCGCCGCCG
CTACGTTTCT ACGCCACCGA CGACACCACC CTGATCGGTT CCGAGGTCGA CTTCGCGTAC
CTGTTCGCCG ACGTGCTCGG GCTGCGGGTG GACCTGTCCG CAGCGGACTG GTCGCAGAAC
TTCGTCCGGG TCGATTCGGG TGAGGTGGAC GCCTTCATCT CCAACGTGAC GGTGACCGAG
GAACGCAAGG AGAAGTACGA CTTCGCCACC TACCGGCTGG ACACCCTCGC CCTGGAGACC
CGGATCGACG ATCCCTGGAC GGTCACCGAC CGCAAGGATC TGGCCGGCAA GGTGATCGCC
GTCGGGTCGG GCACCAACCA GGAAAAGATC CTGGTCGACT GGAACGAGCA GAACATCGCC
GAGGGACTGG CGCCCATCGA CATCAAGTAC TTCCAGAACT CCACCGACTA CTACCTGGCC
CTGTCCTCGG GACGGATCGA GGGCTACTTC GGGCCCAACC CGACCGCGCA GTACCACGCG
GCCAGCACCG GCGAGACCAA GGTGATCGGC ACCTTCTCGG GGGCCGGCGA GGCCCTGCAG
GGTGAGATCG CGGTGCTGAC GCTCAAGGGC AACGGGTTGG CCGAAGCCTT CACCGAGGCG
ATCAACCACA CCATCGAGAA CGGGACCTAC CAGCAGGTTC TCGATCGGTG GAATCTGGCC
AGCGAGGGCG TCGCCCGCTC CGAGCTCGAC CCGCGCGGGT TGCCCAAGCC GACCAGCTGA
 
Protein sequence
MSSRLRRGFV LTSALIAAAL AVSACGGSTG DTASTTSGAA APASAGSVSA AAADAVEFNL 
GPDQNRVTTA KVDAIAAKVP QEIRDRGTIK VTGSAGTAPP LRFYATDDTT LIGSEVDFAY
LFADVLGLRV DLSAADWSQN FVRVDSGEVD AFISNVTVTE ERKEKYDFAT YRLDTLALET
RIDDPWTVTD RKDLAGKVIA VGSGTNQEKI LVDWNEQNIA EGLAPIDIKY FQNSTDYYLA
LSSGRIEGYF GPNPTAQYHA ASTGETKVIG TFSGAGEALQ GEIAVLTLKG NGLAEAFTEA
INHTIENGTY QQVLDRWNLA SEGVARSELD PRGLPKPTS