Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4400 |
Symbol | |
ID | 8450026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4883724 |
End bp | 4885409 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645043447 |
Product | hypothetical protein |
Protein accession | YP_003203676 |
Protein GI | 258654520 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCGGG CTCGAATCGG TGCGTTCGCC CTGGCCGTGG CGGCCACCGG GACCCTGGCG GCGACCGTCG CGTCCCTACC CGCGCAGGCG ACCACCACGG CGTCCTCGGC GCAGGCCGTG GGGACCCCGG TGACCAGCGC GAACGGCGAG GCGACGCTGA TCGTCGACCC GACCAGCGAC CTCGCCGACA CCGGTGCCTC GATCAAGGTC CAGGGCAAGG GTTTCGGGAC CGATCCCGGC GGCATGTACG TCGCGGTCTG CCGGGACGCC GGGGCCACCC CGAACCTGGA CCAGTGCGTC GGCGGGCCCG TCCCGACCAA CCCGACCCCG GGGGCGTGGG CGCACATCGT GGCCAGCGGC ACCGGGGTCA ACGTGGCCAA CTGGAACGGG GGCGGCTTCC AGGTGACCCT GGCCCTGCCC TCGGTGGCCG GCGGGTCCGT CGATTGCGTC AAGTCGGCCT GCGCGCTGTA CACGGTCAGC GACGACGGCA GCGAGCCATC GCTGGACAAC CGGATCCCGC TGGGCTTCAA GGCGCCGACG TCGTCGGCGC CGTCCACGCC GACCAGCGCG ATCGTGCAGC AGGTCGGCTC GCCGACCATT GCCCCCGGGG CGACCCAGTC GGTGATCTTC TCCGGCTTCA AGGGCGGTGA GCAGGTCAAC CTGACCCTGT TCTCCGAGCC GGTCACCCTG TCGCCGGTCA CCGCCGATCC CACCGGGGTG GCCCGGGTCG ACTTCGTGGT CCCGGCCGAC TTCGTGGACG GCGCGCACCG GTTGGAGGCG ATCGGCGCCC AGTCCGGCAC GGTCGGGGTG GCCAGCTTCC AGGTGGTCGT GCCGACCCCC ACCCCGAGCC CCACGCCGAC GCCGAGCCCG ACCCCCTCGC CCAGCCCGTC GCCGACGTCG ACGAGCCAGA CCAGCAGCGC GGCGAGCTCC AGCAGCCTGG CCCCGACGAC CGCGGCGACC ACGACCAGCA CCGACAGCGG GGGCGATTCC GGCGGCTCGA ACTGGTGGAT CTGGCTGATC CTGGCCCTGG TCGTGCTGGC CGGGCTGATC ACCTGGTTCG TCGTCGACCG CCGGAACAAG GAGGCGGCCC GGGCCGAGCA GGAACGGCAG CTGGCGGACG CCGCCAACCG GCAGCAGCCG CCCTACGACC CGATGGCCGA CGCGCCGACG ATGATGATGC CGCCGGCCGA TCCGCGGCCC AGCGGACCGC CGCCGGGGGC CGATCCCTAC GGCCTGCTCT CCGGGCGCAA CCACCCCTAC GGCGTCGACC CGAACGCGCC GACCCGGTAC GACCCGCCGG CCGGTCCGAC CCAGTACATC CCGCCCGATC CGGGGCAGTA CCAAAGCGAT CCGACCCAGG TCATCCCGCC GGGTCAGGCC GGCCGGGGTC AGGGCGGTCC GGGTCAGGGC GGTCCGGGTC AGGGTGGTCC GGGTCGGGGT CCGGCCCCCG ACTGGACGGT CCCGCCCGAG TTCTCGGGCG GCCCGAGCGA GCGTCCGGCC GGCCCGCCGA CGACTCCGCA GCCGCAGTCG CAGCCGCCGG AGCAGGGTCC CCGGACGGCG CAGTTCCGGC CGGACTTCAA CGATCCGAAC GGCCGCGATC CGAACAGCGA CGATCCGAAC GGCCGCGACC CGAACAGCAG CGACGAGTCC GACGGCCCGA CCGACACCGG CCCGCGGTCC CGCTGA
|
Protein sequence | MMRARIGAFA LAVAATGTLA ATVASLPAQA TTTASSAQAV GTPVTSANGE ATLIVDPTSD LADTGASIKV QGKGFGTDPG GMYVAVCRDA GATPNLDQCV GGPVPTNPTP GAWAHIVASG TGVNVANWNG GGFQVTLALP SVAGGSVDCV KSACALYTVS DDGSEPSLDN RIPLGFKAPT SSAPSTPTSA IVQQVGSPTI APGATQSVIF SGFKGGEQVN LTLFSEPVTL SPVTADPTGV ARVDFVVPAD FVDGAHRLEA IGAQSGTVGV ASFQVVVPTP TPSPTPTPSP TPSPSPSPTS TSQTSSAASS SSLAPTTAAT TTSTDSGGDS GGSNWWIWLI LALVVLAGLI TWFVVDRRNK EAARAEQERQ LADAANRQQP PYDPMADAPT MMMPPADPRP SGPPPGADPY GLLSGRNHPY GVDPNAPTRY DPPAGPTQYI PPDPGQYQSD PTQVIPPGQA GRGQGGPGQG GPGQGGPGRG PAPDWTVPPE FSGGPSERPA GPPTTPQPQS QPPEQGPRTA QFRPDFNDPN GRDPNSDDPN GRDPNSSDES DGPTDTGPRS R
|
| |