Gene Nmul_A0529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0529 
Symbol 
ID3784518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp604712 
End bp606016 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content50% 
IMG OID637810611 
Producthypothetical protein 
Protein accessionYP_411229 
Protein GI82701663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.474153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCTC AATCCAGCTT CATGATTGTT GCGGCTGTAC GTAACGGGCA GTTGAAAGAC 
CTGCGCGCCT TGCTGGCTTC GATGAATAAC CTGCCCGGTC ATGCTGACCC CAACAATGAA
CTGATGCCAT TTGGGAAGTT TGATCGCCTG CACTTTGCAC GTTTCGTGCT CATAGAAGCA
AGAACCGCGC AGGAAATCAG AGCATTCGGC GTAACGCCGA GACCGTGGCA ACCCACGCTC
GCCTTTCTTG GCGATATTGA CGGGGATATG CAGACTTTTT TTCTGGAACT GATCGAGCGC
GCGGAGCCGG GTCTGAAAAA AATCTTTTCT CATTGCGAAG GTTTTTCCGA GGAAAATCAG
GATCTCCTGG GCTGGATGAA GGCAAACAAT ATAAATGCCA GCGCCACCTA TGTTAACTGG
ATCGGGCGAA CGGTCAGGCA AATCCATGAA GAAGCAGCGC TCCATCGAAG TTTGTCTGCC
TATCTGCCGA AAACTGTTGA CGATGTGGGC CGGGAGAATG TGCGTGCCTT GCGGCAAAAG
CTGTTGTCTT ATGTGGAAAT GGAAAAATAT AAAGGCAGGC TTACGTTAAC CCCGCCAGAA
CCCACGCCCC CCGAATGGAA AATGCGCAAT CTTCTGCATA TGATCGGGGT TCCATTGATC
CTGCTTCTTC TATCTCCGCT ATTACTGGTT ATCGCACTTA TCTTTGCACT ACGTTTGAGA
ATGCTCGAAC GCTCTGACCC TGAGCTCTTT ATCCGGCCCA GCCGTGAACA TTTGGCGGAG
CTTACCGTGC AGGAAGATCG GGATGTCAGT AACCAGTATA GTGTGTTCGG TGACGTGAAA
CCTGGGGGGG TCCGCTTACT GACTTTCAAA TTCGTACTCC TGGTGACCGA CTATTTGGCC
CGGCACATAT ACAACCGTGG ATTTCTCGCC CGAATAAAAA CGATTCATTT TGCCCGGTGG
GTGTTCATGG ACAATAACCA CAGGGTTTTT TTCGCCAGCA ATTACGATGG CAGCCATGAA
AGCTATATGG ATGATTTTAT CAATAAGGTC GGCTGGGGCC TCAATCTTAC CTTCACCAAT
GGTGTCGGCT ACCCTACCAC CCGGTGGATC ATCAAGGAAG GTGCAAACCG GGAACATGCA
TTCAAATATA CGCAAAGGCG GCATCAAATA CCCACCGAGG TTTGGTATAA GGCGTACCCG
GGATTAACGG CCGTTGATCT GGCGCGAAAC AGTCGTATCC GGCAAGGTGT GGAAATTCGG
CAATCCAATG ATGCGGAAAT CCGTGAATGG CTCAGCCTGA TCTGA
 
Protein sequence
MTPQSSFMIV AAVRNGQLKD LRALLASMNN LPGHADPNNE LMPFGKFDRL HFARFVLIEA 
RTAQEIRAFG VTPRPWQPTL AFLGDIDGDM QTFFLELIER AEPGLKKIFS HCEGFSEENQ
DLLGWMKANN INASATYVNW IGRTVRQIHE EAALHRSLSA YLPKTVDDVG RENVRALRQK
LLSYVEMEKY KGRLTLTPPE PTPPEWKMRN LLHMIGVPLI LLLLSPLLLV IALIFALRLR
MLERSDPELF IRPSREHLAE LTVQEDRDVS NQYSVFGDVK PGGVRLLTFK FVLLVTDYLA
RHIYNRGFLA RIKTIHFARW VFMDNNHRVF FASNYDGSHE SYMDDFINKV GWGLNLTFTN
GVGYPTTRWI IKEGANREHA FKYTQRRHQI PTEVWYKAYP GLTAVDLARN SRIRQGVEIR
QSNDAEIREW LSLI