Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1149 |
Symbol | |
ID | 3784205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1322644 |
End bp | 1323927 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637811234 |
Product | Sulfate adenylyltransferase, large subunit |
Protein accession | YP_411844 |
Protein GI | 82702278 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2895] GTPases - Sulfate adenylate transferase subunit 1 |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR02034] sulfate adenylyltransferase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.760256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGCTA TTGAAAATAT TTCTTTCGAT CATTCGGAAT TGTTGCGTTT CATTACCGCA GGTAGTGTGG ATGACGGCAA AAGCACGCTG ATCGGGCGCT TGCTGCATGA CTCAAAATCG ATTTTTGAAG ATCAGTTGAG CGCGATCACC CATACTTCGC GCAAGCGCGG GATGGAGGCT GTCGATCTGT CGCTGCTCAC CGATGGCCTC CAGGCAGAGC GCGAACAAGG CATTACCATC GATGTGGCCT ATCGTTACTT CGCCACTCCC AAGCGCAAAT TCATCATTGC TGACACGCCA GGCCATGAGC AATATACCCG CAACATGGTG ACCGGCGCTT CCACCGCCAA TCTTGCCATT ATCCTGATCG ATGCGCGCAA GGGTGTGCTC ACGCAATCCC GCCGTCATGC CTACCTCGCC AGCCTCGTCG GTATTCCTCA TCTGGTGGTG GCGGTAAACA AGATGGATCT GGTGAATTAC TCCCGGGATG TATTTGAACG AATCTGCCAG GAGTTTCACC GTTTCGTTGC CGGGCTCAAT CTGAAAAACA TCGCCTATAT TCCGATGTCC GCGCTCAACG GCGATATGGT AGTCGAGCGC GGCAACAACC TCGGCTGGTA CGAAGGCATG ACGTTGATGG ATTTACTGGA AAAGGTTCCG GTCGACCATG ACATCAACCT TGAAGATTTT CGCTTTCCCG TGCAATTGGT GTGTCGCCCG CAAACGGAGG AATGGCACGA CTTTCGGGGC TACATGGGCC GTATCGAATC CGGTTCCATC AGTGTGGGTG ACGAGGTGCA GGTTCTGCCC TCCGGCTTGA CTTCGCGCAT CAAGGAGATT GTTACCTATG AAGGAAATGT CGAGGAGGCA GTTGCGCCCC AGTCGGTAAC GCTGACGATT GAAGATCATC TGGACATATC AAGGGGAGAC ATGCTGGTAA AAATTTCCCA GCTTCCCCAG GTTACAAGAG AATTTGATGC GATGCTGTGC TGGTTGTCTG AGCAGAGCCT CGATCCCCGA CGCAAATACC TGATCAAGCA TTCTACACGG CTGGTAAAAG CCGTCATATC CCGCATAGAG TACCGGCTGG ATATCAATAC CCTGAAACAT GAAGGCGCCG ATATTCTCAA AATGAATGAC ATTGCGCGGG TATCGCTCAA GGTTCATCAA CCTCTCGTAT GGGATGCATA TCAGCGTAAC CATGCCACCG GCAGCTTCAT CGTGATTGAT GAGGTTACGA ACAATACCGT GGCCGCAGGG ATGATTTGCC CTTCAAAAGG TTAG
|
Protein sequence | MSAIENISFD HSELLRFITA GSVDDGKSTL IGRLLHDSKS IFEDQLSAIT HTSRKRGMEA VDLSLLTDGL QAEREQGITI DVAYRYFATP KRKFIIADTP GHEQYTRNMV TGASTANLAI ILIDARKGVL TQSRRHAYLA SLVGIPHLVV AVNKMDLVNY SRDVFERICQ EFHRFVAGLN LKNIAYIPMS ALNGDMVVER GNNLGWYEGM TLMDLLEKVP VDHDINLEDF RFPVQLVCRP QTEEWHDFRG YMGRIESGSI SVGDEVQVLP SGLTSRIKEI VTYEGNVEEA VAPQSVTLTI EDHLDISRGD MLVKISQLPQ VTREFDAMLC WLSEQSLDPR RKYLIKHSTR LVKAVISRIE YRLDINTLKH EGADILKMND IARVSLKVHQ PLVWDAYQRN HATGSFIVID EVTNNTVAAG MICPSKG
|
| |