Gene Namu_0658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0658 
Symbol 
ID8446244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp725300 
End bp727033 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content68% 
IMG OID645039793 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003200062 
Protein GI258650906 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACGTC ATCTGTGGTT GGGGGCCGGT CTGGCCGGCG TCCTGGTGCT CGGGCTCACC 
GCCTGCGCCA GTTCGAGCCG GGACGCGGGC ACCACCACCG CGGCCAGCGG AAGTGCGCAG
GCCTCCGGGT CGGCCGGCGA GGGCTCGGGC GGGCAGCCGG CCAACCCCGA CGGGCAGTTC
GTCTTCGGCG CGGCCGGGGC TCCCAGCATG TTCGACCCGC TCTACGCCAC CGACGGGGAG
ACCTTCCGGG TGGCCCGGCA GATCAACGAG GGCCTGATCC GGTTCAAGCC GGGCACCGCC
GACCCGGAGC CGGCCCTGGC CACCGACTGG GAGCAGAGCA CCGACGGCAA GACCTGGACC
TTCACCATCC GTGAGGGCGT CACGTTCCAC GACGGCACCC CGGTCGATGC GGCCGCGGTG
TGCTTCAACC TCGATCGCAT GTACAACCAG ACCGGGGCCG GCGCCACCCA GGCCCAGTAC
TGGTCGGACG TGATGGGTGG GTTCAAGAAC CAGGTCGACG ACGCCGGGCA ACCGGTCCCG
TCGGTCTACT CCAGTTGCAC CGCCGAGGGC AACAAGGCCG TCATCGCCCT GACCACCTCG
ACCTCGAAGT TCCCGGGGGT GCTCGGCCTG CCGTCCTACT CGATCCAGTC GCCCACCGCG
CTGCAGCAGT ACGACGCGAA CAACGTGGTC GCCCAGGGCG ACTCGTTCGT CTACCCGGCC
TACGCGACCG AGCACCCGAC CGGCGCCGGC CCGTACAAAT TCCAGGCCTA CGACAAGGCC
AACAACACCG TGACCCTCGT CCGCAACGAC GATTACTACG GCGAGAAGGC CAAGACCAAG
ACGCTGATCT TCAAGATCAT CCCGGACGAG ACCGCGCGCA AGCAGGAGCT GCAGGCCGGC
ACCATCGACG GGTACGACTT CCCGAGCCCG GCCGACTGGG ACGGGCTGAC CGGAGCCGGC
TTCAACGTCG AGGTCCGGCC CGCGTTCAAC GTGATGTACC TGGGCATGAC CCAGGGCACC
AACCCCGCGC TGGCCGATCT GAAGGTGCGG CAGGCCATCG CCTACGCGCT CAACCGCGAG
CAGTTCGTGC AGTCCCAGTT GCCCGACGGC GCCAAGGTCG CGGACATCTT CTACCCGGAC
ACCGTCGACG GCTGGACCGA CGACGTCACC AAGTACCCGT ACGACCCGGA GAAGGCCAAG
CAACTGCTGG CCGAGGCCGG CCAGTCGAAC CTGACGGTCA ACTTCTGGTG GCCCACCGAG
GTCAGCCGCC CCTACATGCC GGATCCCAAG AGCGTGTTCA CCGCGTTCAA GGCGGACCTG
GAGGCGGTCG GCATCACCGT CAACGAGATC TCCAAGCCGT GGAACGGCGG GTACCTGGAC
GGTGTCGAGG CGCACGACGC CGACCTGTTC CTGCTCGGCT GGACGGGTGA CTACAACACG
CCGGACAACT TCATCGGCAC CTTCTTCACC CGCACCGACA ACCGGTTCAA CACCGGCACC
CAGCCGTGGG GCGCCACCCT GTCCGAGGCG CTCAAGCAGG CCGACGCGAT CCCGGATCCC
GACCAGCGCA ACGCCGCCTA CGTCAAGATC AACCAGGATC TGATGGGCAC GTACCTGCCG
GCGGTGCCGA TCTCCCACTC GCCGCCGGCG ATCGTGGTGG CCGGCGACGT CGAGGGTCTG
GTGGCCAGCC CGCTGACCGA CGAGCGGTTC AGCACGGTCT ACAAGACGAG CTGA
 
Protein sequence
MKRHLWLGAG LAGVLVLGLT ACASSSRDAG TTTAASGSAQ ASGSAGEGSG GQPANPDGQF 
VFGAAGAPSM FDPLYATDGE TFRVARQINE GLIRFKPGTA DPEPALATDW EQSTDGKTWT
FTIREGVTFH DGTPVDAAAV CFNLDRMYNQ TGAGATQAQY WSDVMGGFKN QVDDAGQPVP
SVYSSCTAEG NKAVIALTTS TSKFPGVLGL PSYSIQSPTA LQQYDANNVV AQGDSFVYPA
YATEHPTGAG PYKFQAYDKA NNTVTLVRND DYYGEKAKTK TLIFKIIPDE TARKQELQAG
TIDGYDFPSP ADWDGLTGAG FNVEVRPAFN VMYLGMTQGT NPALADLKVR QAIAYALNRE
QFVQSQLPDG AKVADIFYPD TVDGWTDDVT KYPYDPEKAK QLLAEAGQSN LTVNFWWPTE
VSRPYMPDPK SVFTAFKADL EAVGITVNEI SKPWNGGYLD GVEAHDADLF LLGWTGDYNT
PDNFIGTFFT RTDNRFNTGT QPWGATLSEA LKQADAIPDP DQRNAAYVKI NQDLMGTYLP
AVPISHSPPA IVVAGDVEGL VASPLTDERF STVYKTS