Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1664 |
Symbol | |
ID | 8447263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 1828886 |
End bp | 1829986 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645040787 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003201043 |
Protein GI | 258651887 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0000133453 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0424658 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTCT TGACCCACCG CCCACGAGGA CGCCGGCTGC TGCAGGCGGC CGGCGCGATC GGCCTGGCCG CCCTCCTGGC CGGTTGCGCC TCCAGCTCGA CCGGGGCGGC CGGATCGGGC ACGGCCAGCT CCACCGCCGG GTCGGCGTCC GCCGCACCGA GCGAGACCGG CGGCACGCTG GTGGTCTACT CCGGCCGGAA CAAGGACCTG ATCGGCCCGC TGTTGGATCA ATTCAGCGCG CAGACCGGGG TCGCGGTCGA GTTCCGCGCC GGTGATTCCG GCGAGCTGGC CGCCCAGCTG CTGACCGAGG GCGACGCCTC CCCGGCGGAT GTCTTCTTCT CCCAGGACGC CGGCGCCCTG GGCGCGGTCG CGCAGGCCGG CCTGTTCACC TCCCTGCCGG CGGCGACGGT CGAGGCGGTT CCCGCGGCCT ACGCCGCGAC CGACGGCAGC TGGGTGGGCG TGTCCGGGCG GGCCCGGGTG ATCGTCTACA ACCCGACGCT GGCGCCGAAC CCGCCGGACA CCATCGACGG GTTGCTGGAC CCGCAGTGGA AGGGCCAGAT CGGGTTCGCC CCAACCAACG CGTCCTGGCA GGCGTTCGTC ACCGGGCTGC GGGTGCTGCG CGGGGAGGAC GGGGCCCGGC AGTGGCTGAC CGCGTTCGCC GCCCAGGAGC CCAAGGCCTA CGAGCGCAAC GGGGCGGTGC GCGACGCGGT GAACAGCGGC GAGATCGCCC TGGGCCTGGT CAACCACTAC TACCTGTACG AGAAGATCGC CGCCGACGGG GCGGACGCCG TGGTCGCCCA GAACCAGTAC CTGGCCGCGG GCGACCCGGG TGGGCTGCTG AACGTGGCCG GGGTCGGCGT CCTGGCCTCC GCGCCGCACG CCGAGCAGGC CCAGGTCTTC GTGGACTACC TGCTCTCGCC GGCCGGCCAG GAGTACTTCG CAGCCAAGAC CAAGGAGTTC CCGCTGGTGC CGGGCACCGC CGCGGCCGCC GAGGTGCCGC CGCTGAGCGA GCTGAGCCCG CCGCAGATCG ACCTGTCGCA GCTCAGCTCG CTGGAGCAGA CCCAGCAGCT GCTGTCCGAG GTGGGGTTGC TGACCCGGTG A
|
Protein sequence | MRVLTHRPRG RRLLQAAGAI GLAALLAGCA SSSTGAAGSG TASSTAGSAS AAPSETGGTL VVYSGRNKDL IGPLLDQFSA QTGVAVEFRA GDSGELAAQL LTEGDASPAD VFFSQDAGAL GAVAQAGLFT SLPAATVEAV PAAYAATDGS WVGVSGRARV IVYNPTLAPN PPDTIDGLLD PQWKGQIGFA PTNASWQAFV TGLRVLRGED GARQWLTAFA AQEPKAYERN GAVRDAVNSG EIALGLVNHY YLYEKIAADG ADAVVAQNQY LAAGDPGGLL NVAGVGVLAS APHAEQAQVF VDYLLSPAGQ EYFAAKTKEF PLVPGTAAAA EVPPLSELSP PQIDLSQLSS LEQTQQLLSE VGLLTR
|
| |