Gene Nmul_A1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1149 
Symbol 
ID3784205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1322644 
End bp1323927 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content52% 
IMG OID637811234 
ProductSulfate adenylyltransferase, large subunit 
Protein accessionYP_411844 
Protein GI82702278 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.760256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGCTA TTGAAAATAT TTCTTTCGAT CATTCGGAAT TGTTGCGTTT CATTACCGCA 
GGTAGTGTGG ATGACGGCAA AAGCACGCTG ATCGGGCGCT TGCTGCATGA CTCAAAATCG
ATTTTTGAAG ATCAGTTGAG CGCGATCACC CATACTTCGC GCAAGCGCGG GATGGAGGCT
GTCGATCTGT CGCTGCTCAC CGATGGCCTC CAGGCAGAGC GCGAACAAGG CATTACCATC
GATGTGGCCT ATCGTTACTT CGCCACTCCC AAGCGCAAAT TCATCATTGC TGACACGCCA
GGCCATGAGC AATATACCCG CAACATGGTG ACCGGCGCTT CCACCGCCAA TCTTGCCATT
ATCCTGATCG ATGCGCGCAA GGGTGTGCTC ACGCAATCCC GCCGTCATGC CTACCTCGCC
AGCCTCGTCG GTATTCCTCA TCTGGTGGTG GCGGTAAACA AGATGGATCT GGTGAATTAC
TCCCGGGATG TATTTGAACG AATCTGCCAG GAGTTTCACC GTTTCGTTGC CGGGCTCAAT
CTGAAAAACA TCGCCTATAT TCCGATGTCC GCGCTCAACG GCGATATGGT AGTCGAGCGC
GGCAACAACC TCGGCTGGTA CGAAGGCATG ACGTTGATGG ATTTACTGGA AAAGGTTCCG
GTCGACCATG ACATCAACCT TGAAGATTTT CGCTTTCCCG TGCAATTGGT GTGTCGCCCG
CAAACGGAGG AATGGCACGA CTTTCGGGGC TACATGGGCC GTATCGAATC CGGTTCCATC
AGTGTGGGTG ACGAGGTGCA GGTTCTGCCC TCCGGCTTGA CTTCGCGCAT CAAGGAGATT
GTTACCTATG AAGGAAATGT CGAGGAGGCA GTTGCGCCCC AGTCGGTAAC GCTGACGATT
GAAGATCATC TGGACATATC AAGGGGAGAC ATGCTGGTAA AAATTTCCCA GCTTCCCCAG
GTTACAAGAG AATTTGATGC GATGCTGTGC TGGTTGTCTG AGCAGAGCCT CGATCCCCGA
CGCAAATACC TGATCAAGCA TTCTACACGG CTGGTAAAAG CCGTCATATC CCGCATAGAG
TACCGGCTGG ATATCAATAC CCTGAAACAT GAAGGCGCCG ATATTCTCAA AATGAATGAC
ATTGCGCGGG TATCGCTCAA GGTTCATCAA CCTCTCGTAT GGGATGCATA TCAGCGTAAC
CATGCCACCG GCAGCTTCAT CGTGATTGAT GAGGTTACGA ACAATACCGT GGCCGCAGGG
ATGATTTGCC CTTCAAAAGG TTAG
 
Protein sequence
MSAIENISFD HSELLRFITA GSVDDGKSTL IGRLLHDSKS IFEDQLSAIT HTSRKRGMEA 
VDLSLLTDGL QAEREQGITI DVAYRYFATP KRKFIIADTP GHEQYTRNMV TGASTANLAI
ILIDARKGVL TQSRRHAYLA SLVGIPHLVV AVNKMDLVNY SRDVFERICQ EFHRFVAGLN
LKNIAYIPMS ALNGDMVVER GNNLGWYEGM TLMDLLEKVP VDHDINLEDF RFPVQLVCRP
QTEEWHDFRG YMGRIESGSI SVGDEVQVLP SGLTSRIKEI VTYEGNVEEA VAPQSVTLTI
EDHLDISRGD MLVKISQLPQ VTREFDAMLC WLSEQSLDPR RKYLIKHSTR LVKAVISRIE
YRLDINTLKH EGADILKMND IARVSLKVHQ PLVWDAYQRN HATGSFIVID EVTNNTVAAG
MICPSKG