Gene Nmul_A1561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1561 
Symbol 
ID3785283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1794083 
End bp1795105 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content56% 
IMG OID637811649 
Productfructose-bisphosphate aldolase 
Protein accessionYP_412256 
Protein GI82702690 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3588] Fructose-1,6-bisphosphate aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCG ATGAACTCAA GTCAGTGGCC GCAGCCATCG TCGCCAGGGA GAAAGGAATT 
CTGGCAGCGG ATGAAAGCAG TCCTACCATA AAAAAACGTT TTGCTTCGAT CGGCGTTGAA
TCCACCGAGG AAAACCGGCG CCGCTACCGC GAGTTACTTT TCACCGCAGA AGGTATCGAA
CGCCACATCA GCGGCGTCAT TCTGTATGAT GAAACGATAC GGCAGAGTTC GAAGGAGGGG
GTACCGTTTC CGCAGGTACT GGCGGGGCGG GGAATCATAC CGGGCATCAA AGTGGATAAG
AGCGCCAAGC CACTGGCGCT GCAGCCCGGG AACAAAATCA CCGAAGGATT GGACAGTTTG
CGCGACCGCC TGGCGGAATA CAAACAGTTG GGTGCAAAAT TTGCCAAATG GCGGGCAGTC
ATGGAAATCG ATGAGCACTC GCTTCCTTCT GCCTATGCCA TCCGCGCAAA CTGTCACGCC
CTGGCTCGCT ATGCCGCTCT CTGTCAGGAA GCCAGCCTGG TGCCCATTGT GGAACCGGAA
GTTCTGATGG ATGGCGCGCA CGATATCGGA CGGTGCGAAA GCATCACTTC CGCCATGCTC
GAGACTTTGT TCGGAGAACT CGACGCTCAT GGCGTAGTGT TCGAAGGGGC CCTGCTCAAG
CCCAATATGG TCATTCCGGG AAAGAAATGC GCACTCCAGG CCAGTTCTCA GCAAGTTGCG
GAAGCAACGA TCCGCTGCCT GCGCCGTTAT GTCCCGGCGG CAGTGCCGGG AATCGTATTT
CTCTCGGGCG GTCAGAGTCC CGAGGAAGCG ACCGATAACC TGAATGCCAT GAACGTCATA
AGGGGAAACT GCCCCTGGCA ACTCAGCTTT TCTTATGGGC GCGCACTCCA GGAACCGGTT
CTTGCTGCCT GGAAGGGGGA AGAAAAAAAT GTGGCCGAAG CACAGCGCGT GTTTTCCAGA
CGTTGCCAGT TAAATGGCTT GGCGCGGGAA GGACTTTATA ACCGCTCGAT GGAAGACAGC
TGA
 
Protein sequence
MNTDELKSVA AAIVAREKGI LAADESSPTI KKRFASIGVE STEENRRRYR ELLFTAEGIE 
RHISGVILYD ETIRQSSKEG VPFPQVLAGR GIIPGIKVDK SAKPLALQPG NKITEGLDSL
RDRLAEYKQL GAKFAKWRAV MEIDEHSLPS AYAIRANCHA LARYAALCQE ASLVPIVEPE
VLMDGAHDIG RCESITSAML ETLFGELDAH GVVFEGALLK PNMVIPGKKC ALQASSQQVA
EATIRCLRRY VPAAVPGIVF LSGGQSPEEA TDNLNAMNVI RGNCPWQLSF SYGRALQEPV
LAAWKGEEKN VAEAQRVFSR RCQLNGLARE GLYNRSMEDS