Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2072 |
Symbol | |
ID | 3786076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2363366 |
End bp | 2364319 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637812161 |
Product | cysteine synthase A |
Protein accession | YP_412758 |
Protein GI | 82703192 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00015923 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCACT GGTTCAAAGA TAATTCCCAG ACCAGCGGGG CCACCCCGCT GGTCCGGCTT AACCGCATTA CGGATGGCGC TCCGGCAATG GTACTGGCCA AGATCGAAGG GCGTAATCCC GCTTATTCCG TAAAATGCCG TATCGGCGCC GCCATGATCG AAGATGCGGA ACATCGCGGG CTGCTTTATG CCGGAATAGA GCTGGTGGAA CCTACCAGCG GCAATACCGG TATCGCCCTC GCCTCTGTTG CTGCCGCGCG CGGCATACCC CTGACATTGA CCATGCCCGA AACCATGGGG CTGGAACGCC GCAAGCTGCT TCTCGCCTAC GGGGCAAAAC TGGTTCTGAC CGAGGGCGCG CGGGGCATGA AAGGTGCGGT AGCAAAGGCA GAGGAAATCG TTGCTTCCAA TCCGGGCCGA TACCTCCTGC TCCAGCAATT CTCCAACCCG GCCAACCCTG CTATTCACGA GCGCACCACA GGGCCAGAGA TCTGGAACGA TACCGACGGG GCAGTTGATA TTTTTGTTGC CGGCGTGGGT ACGGGAGGCA CCATTACCGG TGTTTCGCGG TATATAAAGG GGACGAAGAA AAAATCCATT CTCTCGGTCG CCGTTGAGCC TGCTGCCAGC CCAGTGATCA CGCAACACCG GGCTGGCGAA CCCCTGGCAC CCGGACCTCA TCGGATTCCG GGAATTGGCG CAGGATTCAT TCCTGCCAAC CTGGATCTCT CCCTCGTGGA TGAGGTACAG CAAATCAGCA ATGAAGACGC AATTCACTAT GCACGCCGTC TTGCACGCGA AGAAGGCATT ATCTCGGGGA TTTCATGCGG AGCAGCGGTT GCAGCCGCGT TAAATCATGC GAAGCGAACG GAGAATGCCG GAAAAACCAT TGTTGTCGTT CTGCCGGATT CGGGAGAACG CTATCTGAGC TCCAACCTTT TTGAGGAGAT GTAA
|
Protein sequence | MPHWFKDNSQ TSGATPLVRL NRITDGAPAM VLAKIEGRNP AYSVKCRIGA AMIEDAEHRG LLYAGIELVE PTSGNTGIAL ASVAAARGIP LTLTMPETMG LERRKLLLAY GAKLVLTEGA RGMKGAVAKA EEIVASNPGR YLLLQQFSNP ANPAIHERTT GPEIWNDTDG AVDIFVAGVG TGGTITGVSR YIKGTKKKSI LSVAVEPAAS PVITQHRAGE PLAPGPHRIP GIGAGFIPAN LDLSLVDEVQ QISNEDAIHY ARRLAREEGI ISGISCGAAV AAALNHAKRT ENAGKTIVVV LPDSGERYLS SNLFEEM
|
| |