Gene Nmul_A2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2072 
Symbol 
ID3786076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2363366 
End bp2364319 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content57% 
IMG OID637812161 
Productcysteine synthase A 
Protein accessionYP_412758 
Protein GI82703192 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01139] cysteine synthase A
[TIGR01136] cysteine synthases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00015923 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCACT GGTTCAAAGA TAATTCCCAG ACCAGCGGGG CCACCCCGCT GGTCCGGCTT 
AACCGCATTA CGGATGGCGC TCCGGCAATG GTACTGGCCA AGATCGAAGG GCGTAATCCC
GCTTATTCCG TAAAATGCCG TATCGGCGCC GCCATGATCG AAGATGCGGA ACATCGCGGG
CTGCTTTATG CCGGAATAGA GCTGGTGGAA CCTACCAGCG GCAATACCGG TATCGCCCTC
GCCTCTGTTG CTGCCGCGCG CGGCATACCC CTGACATTGA CCATGCCCGA AACCATGGGG
CTGGAACGCC GCAAGCTGCT TCTCGCCTAC GGGGCAAAAC TGGTTCTGAC CGAGGGCGCG
CGGGGCATGA AAGGTGCGGT AGCAAAGGCA GAGGAAATCG TTGCTTCCAA TCCGGGCCGA
TACCTCCTGC TCCAGCAATT CTCCAACCCG GCCAACCCTG CTATTCACGA GCGCACCACA
GGGCCAGAGA TCTGGAACGA TACCGACGGG GCAGTTGATA TTTTTGTTGC CGGCGTGGGT
ACGGGAGGCA CCATTACCGG TGTTTCGCGG TATATAAAGG GGACGAAGAA AAAATCCATT
CTCTCGGTCG CCGTTGAGCC TGCTGCCAGC CCAGTGATCA CGCAACACCG GGCTGGCGAA
CCCCTGGCAC CCGGACCTCA TCGGATTCCG GGAATTGGCG CAGGATTCAT TCCTGCCAAC
CTGGATCTCT CCCTCGTGGA TGAGGTACAG CAAATCAGCA ATGAAGACGC AATTCACTAT
GCACGCCGTC TTGCACGCGA AGAAGGCATT ATCTCGGGGA TTTCATGCGG AGCAGCGGTT
GCAGCCGCGT TAAATCATGC GAAGCGAACG GAGAATGCCG GAAAAACCAT TGTTGTCGTT
CTGCCGGATT CGGGAGAACG CTATCTGAGC TCCAACCTTT TTGAGGAGAT GTAA
 
Protein sequence
MPHWFKDNSQ TSGATPLVRL NRITDGAPAM VLAKIEGRNP AYSVKCRIGA AMIEDAEHRG 
LLYAGIELVE PTSGNTGIAL ASVAAARGIP LTLTMPETMG LERRKLLLAY GAKLVLTEGA
RGMKGAVAKA EEIVASNPGR YLLLQQFSNP ANPAIHERTT GPEIWNDTDG AVDIFVAGVG
TGGTITGVSR YIKGTKKKSI LSVAVEPAAS PVITQHRAGE PLAPGPHRIP GIGAGFIPAN
LDLSLVDEVQ QISNEDAIHY ARRLAREEGI ISGISCGAAV AAALNHAKRT ENAGKTIVVV
LPDSGERYLS SNLFEEM