Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2378 |
Symbol | |
ID | 3784969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2705002 |
End bp | 2706348 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637812467 |
Product | hypothetical protein |
Protein accession | YP_413059 |
Protein GI | 82703493 |
COG category | [S] Function unknown |
COG ID | [COG1426] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.654277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGCTC CTGAAAGTAG CGACATCGAG CAGCATGAGG AACCAAAAAA AACCGGAGAG CAGGTCGGCC AGGTGCTGCG CGCTGCGCGG CTGGAACGCG GCCTGGATAT TGAGGACGTT GCGCGCCAGT TACGCTTTGC AGCCCGGCAG GTGACGGCAC TCGAAGAGGA TGAATACGAT AAGCTTGCGG GCGGCCCCTT CCTGCGCGGA TTCGTGCGCA ACTATGCCAA ATTGCTGCAA CTGGATGAAG CGCCGTTATT GAAGTTGCTT GAACAGTCGG TTCCTCCTCC GACGACGCAC GTGGGGCGGC CTCCAAGTGA GGAGATTCCC TTTCCCTCCG GGCAGGAGTA TCTGAAACGC AACGTGGTCC TCGGCGGAGG AATTGTCCTG GCCATCACTC TGCTGGGGTA CGCGATCTAT AGTGGTGACA AGGCTTCCGT TGCCAATCAG CCCGACATGG CAATGGAATC GGAGAAAGAC ACCGGGCAAC CGACTCTTTC ATTCCCGTTT CCATCGCAGG CGCCCCCAGC GGAAGTGCCG GAATCTCAGG CTCCCGCACC TTCCGCGCTT GTTCCCGATA TGGCTTCCCA GCGAGAGCCG GGTATTGCCG CCCTCGAGGA GACAGCACCT TCCGCGGATA CAGGCAGAGA GCAGGACACC GTGGCGTCCG CTCCTAAAGA CGCGGCCGCT GGGAGCGGTG CCGAAGCTTC CGCTGGAGAG CCTCTCACCC TGCCTCTTGC GCCGCCACCG GCGGTCCAGA CCGCTCCCAC AGCACCGGCA GTTCCAGGGG GGGAGCCCGC CAATGTCCCA GTGGCTCCGA AGCCGCCCCA GGTTACAGTT GCTCCAAAGC CATCTGAGGG CCCAGTTACT TCGAAGCCAT CTGGGAGCGC AGTTACCCCG AAGCCGCCTC AGGTCGCAGT TGCTCCGAAG CCACTTGAGG GCCCAGTTAC TTCGCAGCCA TCTGAGAGCG CAGTTACTTC GAAGCCGCCT CAGGTCGCAG TTGCTCCGAA GCCATCTGAG GGCGCAGCTA CTTCGAAGCC ATCTGAGAGC GCAGTTACCC CGAAGCCGCC CCAGGTCGCC GCTACTCCGA AGCCACCTCC TGCAAAAGCT GGAATTCGTC TGACGTTTGC CGGCGAGTCA TGGGTGCAAG TCAAGGATGG CAACGGGAAA TTGCTTCTCT CAAGAGTCAA TCCTCCTGGC AGCGAGCAGG TGCTGCGTGG TAAGCCGCCC TATTCTCTCA TGATCGGAAA CCCGGGGCAG GTCAAGCTTG TCTACAATAG CAAACCAGTC GATCTCTCGA TTTTCGCCAA ACTTCCCGGT GGAATGGCAC ATCTCGTGCT CCAATAG
|
Protein sequence | MEAPESSDIE QHEEPKKTGE QVGQVLRAAR LERGLDIEDV ARQLRFAARQ VTALEEDEYD KLAGGPFLRG FVRNYAKLLQ LDEAPLLKLL EQSVPPPTTH VGRPPSEEIP FPSGQEYLKR NVVLGGGIVL AITLLGYAIY SGDKASVANQ PDMAMESEKD TGQPTLSFPF PSQAPPAEVP ESQAPAPSAL VPDMASQREP GIAALEETAP SADTGREQDT VASAPKDAAA GSGAEASAGE PLTLPLAPPP AVQTAPTAPA VPGGEPANVP VAPKPPQVTV APKPSEGPVT SKPSGSAVTP KPPQVAVAPK PLEGPVTSQP SESAVTSKPP QVAVAPKPSE GAATSKPSES AVTPKPPQVA ATPKPPPAKA GIRLTFAGES WVQVKDGNGK LLLSRVNPPG SEQVLRGKPP YSLMIGNPGQ VKLVYNSKPV DLSIFAKLPG GMAHLVLQ
|
| |