Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0017 |
Symbol | |
ID | 3786455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 16951 |
End bp | 18174 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637810085 |
Product | tetratricopeptide TPR_3 |
Protein accession | YP_410718 |
Protein GI | 82701152 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCGA TTTCTTCCCT CGCGCTGTTG GTGTTTGCGT CGGCCTTGTT TTTGCCGCTC TTGCCAGGAG CCAGCGCGAA AACTCCGGAG CAGGTTTTTA CGGAGGAAAG GGGGAGCATC GTGTCGATTG ATATGTTGAA TGCACGGGGT GAACTGGCTG TTCAGGGCAG TGGCGTGGCG CTGGGCGCAG ACCAAGTGAT CACTACTTGC GATGTCGCCC GCCAGGCAGA AAGCGCACGA GTGGGCTGGG CGGGAAGGAT ATTCAAGGCC AAGCCGGCAC CATCGCAGAC ACATCTCAAT CTTTGTCGCC TCCGTGTTGC TGGTATGCAC GCTCCGTCGC TTGCTTTAGG TGGCGCAGAA AAGCTCAGGG TGGGACAGCC GGTAAATGCA ATCGGTCTGC CTCATGCAAG GCTCGGCGGC AGGTTGAAAG GAAACAGAAA CGAGGAAAGA AGGAACGGGC GCGACAGGAT CTGCCCGATG TGCGATGGGA GCAAAGTCAC CGGATTGCAG GAGGAACACC GACGTGACCC GATCCTGGCC GAAGGCGTGG TGGCGGTAAT GCGCCCCTAC GCAGGGTCCC GCTACATGAG AATTTCGGCA CCCCTCCTGC CGGGTTTCAG CGGCGGGGGA CTATTCGACG AACAAGGCCA GCTGGTCGGA ATTCTCTCGC CCCAACGTGT AGAAGGTGAA TCGCTTGCTT TCGTATTGCC GTCCGATTGG CTTGACAGTC TCCCGAAAAT GACACAAGCC CCCCGGCCAA TGGACCCAAA GTCAGCGGAC CCAGGGCATG GGCTCGTCTG GCTCAATCAA ACGCTGGCGC TGGAAAAGAA GGCCGACTGG CGCAGGTTGT TGAAGCTCTC GCAGCAGGAG ACCGGGCGCG ACCCGTCGAA TGCTGCTGCC TGGTTTAACG TGGGGATTGC TTCCTGCAAT CTCAAGCAAT ATTCCCAGGC GGTTAACGCA TACCGGGAAG CCATTCGTCA CCATGCGGGC TATGCCGATG CCTGGCATAA ACTGGGGATG GCTTACGCGC ATCTCAAAGA CTACGAGAAC GCAAGCCAGG CCTACGAGGA TGCGGTTCGC CTGGATCCCG ATAATGGCGA GGCCTGGTAC GATCTGGGCA ATACTTACCA TCACCTCAAA AAATATGCGC ACACCATTCA TGCCTACCGG CACGCCCTCC GGATTGATCC GAAAAATTTC AGGGCTTGGT ACAACTGGGA GTGA
|
Protein sequence | MTAISSLALL VFASALFLPL LPGASAKTPE QVFTEERGSI VSIDMLNARG ELAVQGSGVA LGADQVITTC DVARQAESAR VGWAGRIFKA KPAPSQTHLN LCRLRVAGMH APSLALGGAE KLRVGQPVNA IGLPHARLGG RLKGNRNEER RNGRDRICPM CDGSKVTGLQ EEHRRDPILA EGVVAVMRPY AGSRYMRISA PLLPGFSGGG LFDEQGQLVG ILSPQRVEGE SLAFVLPSDW LDSLPKMTQA PRPMDPKSAD PGHGLVWLNQ TLALEKKADW RRLLKLSQQE TGRDPSNAAA WFNVGIASCN LKQYSQAVNA YREAIRHHAG YADAWHKLGM AYAHLKDYEN ASQAYEDAVR LDPDNGEAWY DLGNTYHHLK KYAHTIHAYR HALRIDPKNF RAWYNWE
|
| |