Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4314 |
Symbol | |
ID | 8449940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4796892 |
End bp | 4798712 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645043362 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003203591 |
Protein GI | 258654435 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.986199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAGAA CTCGGGCGGT GGCCATCACC GCGCTCGCCG CAGCCACCAT GGTGGTGGTC GCAGCGTGTG GGAGCAGCAG CAGTAGCTCC AGCACGACGA GCGCAACGTC GGGCGCCACC AGCGCGTCGG GCAGTTCTGC GGCCGGCGGT GGCACCTCGG GGGACTCCGG GACGATCACC TACGCGCAGG AGCAGGAGTG GTACGACTAC AACTCCGGCT CCAGCTCGGG CAACGCCACG GCCAACCAGG TGGTGCTCAA CCAGGTCCTG CGTGGGTTCT CCTTCGTCGA CAACACCGGC ACCGTGCAGA TGGACGACGA GTACGGGACC ATCGAGAAGC TCTCGGACAG CCCGCTGACC GTCAAGTACA CCTTCAACGA CAAGGCCGTC TGGTCGGACG GTCAGCCCGT CGGCTGTGCG GACTTCCTGC TGGCCTGGGC GGCCAACTCG GGCCGCTACA ACAAGGACGG CACCATCAAC CAGCCGGGGA CGATCCCGGC CGAGCCGGCT TACATCTTCG ATACCGCCTC CACCTCGGGC ATCGACCAGA CCCAGAAGCC GACCTGTGCC GATGGTGACA AGTCGGTCAC GCTGACCTAC GACAAGCCGT TCGTGGACTG GCAGGTCGCG ATCGGTACCT CGGCCGGCTC CAGCATCCTG CCGGCGCACG TGGCGGCCAA GGCGGCCGGC ATCGACACCG CTGGCCTGGT CAAGGCGGTC GAGACCGACG ACGTCGCCAC CCTGACCAAG GTTGCGGACT TCTGGAACAC CGGCTGGACC CTGGACAAGG CCAAGGGCCT GCCGGACGCG ACCACCATTC CCTCGGCCGG CCCGTACTAC GTGGCCGCCT GGGATCCGGG TCAGTCGGTG ACCCTGAAGA AGAACGACAA GTGGTGGGGC ACGCCGGCCA AGACCGACAC GGTCGTGATC CGGTACATCT CGCAGGACCA GCAGGTCTCC GCGCTGGCCT CCGGTGAGGT CGACGTGATC GAGCCGCAGC CGAACCCGGA CGTCAACGCC GCCCTGAACA ACCTGGGCAA CGCGGTGAAG GCCGACTACG GCTCGCAGTT CACCTACGAG CACATGGACC TCTCGGTCAA GAACGGCTTC GCCAACGAGA AGCTGCGCGA GGCCCTGTTC AAGTGCGCCC CCCGGCAGCA GATCGTGGAC AACCTGATCG TGCCGTCGAA CCCGGACGCC AAGTTGCTGA ACTCGCTGAT GCTGATGGAC TTCCAGCCCG GCTACGACCA GATCTCGCAG GCCTCGGGCT TCGCCAACTA CGCCGATGTC GACATCGAAG GCGCCAAGGC GGCGTACGCG GCCTCCGGCG AGGCGCAGGG CAAGACCATC CGGGTCATCC ACATCGATCC GAACCCGCGG CGGACCAACG AGGTCGCGTT GCTCAAGGCC AGCTGCGACC CGGTGGGCTT CAACATCCAG GACGTGCCGC TGTCCAGCGA CAAGTTCGGG CCGACCCTGT CCGCCGGTGA CTACGACATC GCGCTGTTCG CGTGGGCCGG CTCGGGTCTG CTGGGCTCGA TCCCCTCGGA GTACCTGTCC ACCGGTGGCC AGAACTACTC CGGCTGGAAC GACGCGCAGA TGGACCAGGC CCTCAACAGC CTGGCCACGC TGACCGATAC CTCGAAGGCC CTTCCGCTGT TGACCACGGT CGACCAGCGG CTGGCGGCCA ACTACTACTC CTTCCCGATC TTCACCTTCC CGGGAGTGGT GGCCATGAAG TCGAACATCG AAGGTCCGGT GCTCAACGCC ACGCAGACCC AGGCCACCTG GAACATGCAG GACTGGGCGC GCACCGGCTG A
|
Protein sequence | MRRTRAVAIT ALAAATMVVV AACGSSSSSS STTSATSGAT SASGSSAAGG GTSGDSGTIT YAQEQEWYDY NSGSSSGNAT ANQVVLNQVL RGFSFVDNTG TVQMDDEYGT IEKLSDSPLT VKYTFNDKAV WSDGQPVGCA DFLLAWAANS GRYNKDGTIN QPGTIPAEPA YIFDTASTSG IDQTQKPTCA DGDKSVTLTY DKPFVDWQVA IGTSAGSSIL PAHVAAKAAG IDTAGLVKAV ETDDVATLTK VADFWNTGWT LDKAKGLPDA TTIPSAGPYY VAAWDPGQSV TLKKNDKWWG TPAKTDTVVI RYISQDQQVS ALASGEVDVI EPQPNPDVNA ALNNLGNAVK ADYGSQFTYE HMDLSVKNGF ANEKLREALF KCAPRQQIVD NLIVPSNPDA KLLNSLMLMD FQPGYDQISQ ASGFANYADV DIEGAKAAYA ASGEAQGKTI RVIHIDPNPR RTNEVALLKA SCDPVGFNIQ DVPLSSDKFG PTLSAGDYDI ALFAWAGSGL LGSIPSEYLS TGGQNYSGWN DAQMDQALNS LATLTDTSKA LPLLTTVDQR LAANYYSFPI FTFPGVVAMK SNIEGPVLNA TQTQATWNMQ DWARTG
|
| |