Gene Namu_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3988 
Symbol 
ID8449607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4404458 
End bp4405423 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content69% 
IMG OID645043033 
Productextracellular solute-binding protein family 3 
Protein accessionYP_003203269 
Protein GI258654113 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0701044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCC GCAGAATCGG ATCGGCGATC GCTGGCGCGG CTGCGCTGGC CCTCGTCGTC 
ACCGGCTGCA GCGGTTCGAG CACCTCTTCG GCGACCAGTG CGGCGGGCAG CGCCGCCAGC
GCCGCCCAGA GCGCAGTCGC GTCGGCCACC TCGGCCGCGG GCAGCGCGCT GTCCTCGGCG
CAGAGCGCGG CCAGCAGCGC GATCGCGGGC GCGACCGGAG ACTCGCAGGT GCTGACGAAC
GCCGGCGAGG GCAAGGTGAC CGTCGGGATC AAGTTCGATC AGCCCGGCCT GGGCCTGAAG
AACCCGGACG GCTCGTTCTC CGGCTTCGAC GTCGAGGTGG CCAAGTACGT CGCCGGCAAG
CTGGGCGTCC CCGAGGGCGG CATCACCTTC GTGGAGTCCA AGTCGGCCGA GCGTGAGGGC
CTGATCGACC GCGGCGAGGT CGACTACATC GTCGCCACCT ACTCGATCAC CGACGCGCGC
AAGGAGAAGG TCAACTTCGC CGGGCCGTAC TTCATCGCCG CCCAGGATCT GCTGGTCAAG
TCCGACAACA CCGACATCAC CGGCCCCGAG GCCATGGCCG GCAAGATCCT GTGCTCGGTG
ACCGGTTCGA CCTCCGCCCA GAAGGTCAAG GACAACTACG CGGCGGACGT GGCCCTGCAG
GAGTACGGCA CCTACACCGA ATGCGTCGAG GCCCTGCGGT CCGGCGCCGT CGACGCGGTG
ACCACCGACA ACGTCATCCT GGCCGGCTAC GCCGCGCAGT ACCCGGGTGA GCTCAAAGTC
GTCGGCAAGG GCTTCTCGAC CGAAAACTAC GGCATCGGCC TGAAGAAGGG TGACGCCGCC
GGCACCGCGG CCATCAACGC GGCCATCGCC GCGATGATCG CCGACGGTTC CTGGAAGCAG
GCCCTGGAGG ACACCGTCGG GCCGTCGGGC TTCACCATCC CGTCCCCGCC GACCCCCAGC
AGCTGA
 
Protein sequence
MKLRRIGSAI AGAAALALVV TGCSGSSTSS ATSAAGSAAS AAQSAVASAT SAAGSALSSA 
QSAASSAIAG ATGDSQVLTN AGEGKVTVGI KFDQPGLGLK NPDGSFSGFD VEVAKYVAGK
LGVPEGGITF VESKSAEREG LIDRGEVDYI VATYSITDAR KEKVNFAGPY FIAAQDLLVK
SDNTDITGPE AMAGKILCSV TGSTSAQKVK DNYAADVALQ EYGTYTECVE ALRSGAVDAV
TTDNVILAGY AAQYPGELKV VGKGFSTENY GIGLKKGDAA GTAAINAAIA AMIADGSWKQ
ALEDTVGPSG FTIPSPPTPS S