Gene Namu_4337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4337 
Symbol 
ID8449963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4824619 
End bp4825725 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content69% 
IMG OID645043385 
Productextracellular solute-binding protein family 3 
Protein accessionYP_003203614 
Protein GI258654458 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAC GCCCGCTCCT GCGGCGCCTC GCCACCGCCA CCGCACCGAT CGCCGCCGTC 
GGCGCCGCCC TGCTGCTCGC GGCGTGCTCC GGCTCGACCA CCTCCCCGTC GACCAGCTCC
CCGTCGGCCA CGTCAGCCGG CGCCGTCTCC AGCGGCTCGG TGTCGGCGGC CGCCGACTCC
CTCGACCTGT CCGGCGTCAC CCTGCGGATC GGCGAAACCG GTTACAAGCA GCAACAACTC
CTGCTGGAGA AGGCGGGCCT GGCGGACACG CCCTACACCA CCGACTTCAG CCTGTTCCAG
GGCGGCAACC TGCAACTCGA GGCGCTGGGG GCCGGCGCCA TCGACCTGGC CAGTGCCAGC
GAGATCCCGC CGATCTTCGC CGCCCAGTCG GGTGGCCCCG GCTCCCTGGC GATCGTCGCC
GTCCGCCAGG GCAACACGCT GACCCAGGAG GTCGTGGTTC CCGAAGGGAG TTCGATCACC
GACGCCGCCG GCCTCAAGGG CAAGAAGGTC GCCTACGTCC AGAACACCAC GGCGCACTAC
TTCCTGTACA AGGCCCTGGA ACAGGCCGGC CTGAGCTGGA GTGACATCGA AGCGGTCCCC
CTGTCCACCA GCGACGGGCT GGCCGCCCTG CTGTCCGGCC AGGTGGATGC GCTGGCGTCC
TACGGCAACG CCATCATCTC GGCCCACGCC AAGGGCGCCA GCACCATCGT CGACGCCCGG
GACATCCTGT CCGGCAATTT CGTCTACGTC TCGACGCCGA CGGTGATCGA CGATCCGGCC
AAGCATGCGG CCATCGCGGA CTACTTCTCC CGGCTGCAAC GGGCCTTCAA CTGGGCCCGG
GCCAACCCGG ACACATGGGC CGCGGTCGTC GCCGAGCAGA CCAAGCAACC GGTCGAGCAG
GCGCTGAGCA CCTTCACCGA CGGTGAGGCG CAACGTCCGA GCAAGTTCGT GCCGACCTCG
GCCGAGGCGA TCGCCTCCCA GCAGGACGTG CTCGACACCT TCGTCAAGGC AGGCATTCTC
ACCACCGGCT TCAGCATCGG CGACTACTGG AGCACCTCGT TCGACGCCGA CCTGACCGCG
ATCGAGGGCG AGTATGTCGG CGGCTGA
 
Protein sequence
MTRRPLLRRL ATATAPIAAV GAALLLAACS GSTTSPSTSS PSATSAGAVS SGSVSAAADS 
LDLSGVTLRI GETGYKQQQL LLEKAGLADT PYTTDFSLFQ GGNLQLEALG AGAIDLASAS
EIPPIFAAQS GGPGSLAIVA VRQGNTLTQE VVVPEGSSIT DAAGLKGKKV AYVQNTTAHY
FLYKALEQAG LSWSDIEAVP LSTSDGLAAL LSGQVDALAS YGNAIISAHA KGASTIVDAR
DILSGNFVYV STPTVIDDPA KHAAIADYFS RLQRAFNWAR ANPDTWAAVV AEQTKQPVEQ
ALSTFTDGEA QRPSKFVPTS AEAIASQQDV LDTFVKAGIL TTGFSIGDYW STSFDADLTA
IEGEYVGG