Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4315 |
Symbol | |
ID | 8449941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4799086 |
End bp | 4800897 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645043363 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003203592 |
Protein GI | 258654436 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0499363 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGCCGG TGACGACGAC GCGCGGCAAC AGGGTCCGGC TGATCCTGTT CGTGCTGGCG GCGGTGCTGC TGACCGCCTG CACGCCGCCG AACCTGCCCG CGCCGATCGC GCCGACCGGC GGGTCCTCGG TGTCGAGCAC CCCGCCGGCC CCCGGCACGC TGGTCGTCGG CCTGGACGGC ACCGCCGGAT CGATCACCGG GTTCAACCCT TACGCGATCG CCGACTACTC GCCGGCGGCC CAGGCGGTGG CCTCGCTGGT GTTGCCGTCG GCGTTCGTGA TGGCCCCGGA CGGGTCGATC GGCACCGCCG CTGACGTCGT GGACTGGGTC GCGGTCACCT CGTTGGAGCC GTTCACGGTC ACCTACCTGC TCGACCGCAA GGCGTCCTGG TCCGACGGCA CCCCGATCAC CGCCGAGGAC TTCTCCTACC TGCGCGACCA GCTGCTGGCA CAACCGGGCA CGGTCGGCTC GGCCGGCTAC CGGTTGATCA GCGCGATTCG ATCCCGGGAC GCCGGCAAGA CGGTCGAGGT CGAGTTCTCC CAGCCGTTCC CGCAGTGGCG GACCCTGTTC TCCCCGTTGC TGCCCAGCCA CCTGCTCAAG GACTCGCCGG GCGGCTGGTC CTCCGCCCTG GACAGCGATC TGCCGCTGTC GGCCAACCGG TACCGGATGA GCTCCTACGA CGCGGTGACC GGCCAGATCA CCCTGGCCCG CAACGACAAG TACTGGGCCA CGCCGCCGGG GCCGGCCGCG GTGGTGCTGC GGCTGGGCCG GCCGTCGGAC CTGCTCGCCG CGTTCAACCG GGGCGACGTG CAGGCGTTGT GGTTCGCCCC TGACGCGCGT ACCGCCCAGG ACCTGCTCGA CCAGGTGCCC GCGGACCGGC GGACCACGGT GGCCACCCCG TCCTCGATCC AACTGGTGCT CAACACCCAG CGTGGGCCCA CCGCCGACCG GTCGGTGCGG ACGGCGATCG CCGCCGGCCT GAACCTCGAT CAGCTCTCCG CAGAGCTGAC CGCCGGCTGG CCCGACGGGG GTCGCCCGGT CGCCTCCCAG GTCGCGTTGC CGGTCGAGGA CGCCACCGAT TCGAGCCCGC CGATCGTCAC CGACGACGCG GCCGCGGCGG CCGACGACCT GGCGGCCGCG GGATATACGC GCAACGGTCT GTACGTCGCC CGGCAGGGCG AGGTCCTGCG GCTGACCCTG GGCTATCCGA GCGGTCAGCC CCGGATGGCG GCCGCCGCGC GGCAGATCCG CGACCAGCTC GGGGCGGTCG GCATCGAGGT CGACCTGCTG CCCGACGCCG CCCCCGACCT GATCGAGAAC ACCGTCGCCA CCGGCGATCT CGATCTCGCC CTGGTCAACC TGCCACGCGG TCCGCAGGAC GATGCGGCGG CCGCCACGGC GTTCGGCTGC CCCCGGACCG ACCCGCGGGG TCTGGGCAGC GTGGTCACCA CGACCCCCCG GGCGTCGGGG GGGACGACCG GGCCGACGAC GGTGCCGACC ACCGAGCCGA CCACCGAGCC GACCGGCGAG GCCGGCGCAG ACGGCGCCGA CGGCGCGGCC GTGCGGACCG GCAACCTGTC CGGCTACTGC CAGGCCGGGA CCCAGCAGAA GTTGACCGAC GCGCTGACCG GCTCCGGGTC GGCCGCCGCC GCCGACCCGG CGCTGTGGGC GCAGCTGCCG GTGCTGCCGG TGGTCCAGCC GCAGGCGGTG TTCGCGGTGT CGCCGGCCCT GCAGCCGGTG CTCGACACCA CCCACCCGGG CTGGTCCTGG ACCGGCCCGC TGGCCGGTCT GGCGTCGTGG CCGTCGCCCT GA
|
Protein sequence | MGPVTTTRGN RVRLILFVLA AVLLTACTPP NLPAPIAPTG GSSVSSTPPA PGTLVVGLDG TAGSITGFNP YAIADYSPAA QAVASLVLPS AFVMAPDGSI GTAADVVDWV AVTSLEPFTV TYLLDRKASW SDGTPITAED FSYLRDQLLA QPGTVGSAGY RLISAIRSRD AGKTVEVEFS QPFPQWRTLF SPLLPSHLLK DSPGGWSSAL DSDLPLSANR YRMSSYDAVT GQITLARNDK YWATPPGPAA VVLRLGRPSD LLAAFNRGDV QALWFAPDAR TAQDLLDQVP ADRRTTVATP SSIQLVLNTQ RGPTADRSVR TAIAAGLNLD QLSAELTAGW PDGGRPVASQ VALPVEDATD SSPPIVTDDA AAAADDLAAA GYTRNGLYVA RQGEVLRLTL GYPSGQPRMA AAARQIRDQL GAVGIEVDLL PDAAPDLIEN TVATGDLDLA LVNLPRGPQD DAAAATAFGC PRTDPRGLGS VVTTTPRASG GTTGPTTVPT TEPTTEPTGE AGADGADGAA VRTGNLSGYC QAGTQQKLTD ALTGSGSAAA ADPALWAQLP VLPVVQPQAV FAVSPALQPV LDTTHPGWSW TGPLAGLASW PSP
|
| |