Gene Nmul_A1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1550 
Symbol 
ID3785272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1775740 
End bp1777173 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content54% 
IMG OID637811638 
Productthreonine synthase 
Protein accessionYP_412245 
Protein GI82702679 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCTACA TTTCCACTCG TGGCGGCATG CCGCCTAAAA AGTTCTCCAG GATTCTTCTC 
GGCGGCCTTG CTCCGGATGG CGGGCTGACG CTGCCGGAAA CCTATCCCAG GTTCGATGAC
GCAAAATTGC AGGAACTGCG GGACATGGAT TACCCGGAAC TTGCGTTCGA GATCCTTTCC
GGTTTTGCGG ATGACATTCC TGCTGCGGAC TTGCGCGCAA TCATCGGGCG AACCTATACT
GCCCAGTCGT TTCAAAACGA TGAAATCACG CCGCTTAAAA CCCTGGAACC GGGGCTGCAC
ATACTTGGTT TGTCGAATGG CCCGACGCTG GCCTTCAAGG ATATCGCCCT GCAATTGCTG
GGCAACCTGT TCGAGTACGC GTTGGACAAG AATGGCGAGC AGCTGAATAT CCTCGGTGCA
ACTTCCGGCG ATACCGGGCC GAGCGCAGAG TATGCCATGC GGGGCAAGCG CGGCATTCGT
GTGTTCATGC TTTCGCCGCA TGGAAAAATG AGCCCGTTCC AGACAGCTCA AATGTTTTCC
CTGCATGATC CGAATATTTT CAATATCGCC ATTCGCGGCG TGTTCGACGA TTGCCAGGAC
ATTATCAAGG CTGTCAGCAA TGATTATGCC TTCAAGCAGA AATACCGCAT TGGTACGGTT
AATTCCATAA ACTGGGCACG CATTGCGGCG CAGACCGTTT ATTACTTCAA AGGTTATTTT
GCTGCCACCC GCTCGAATAC AGAGCAGGTA TCTTTCGCGG TGCCATCGGG AAATTTCGGC
AACATTTGCG CAGGGCACGT GGCGCGCATG ATGGGCCTGC CTATCAAAAA GCTGATACTT
GCCACCAATG AAAATGATGT GCTGGATGAA TTTTTCCGGA CGGGACATTA TCGCCCGCGC
ACGACCGCCG AGACCGTCCA GACCAGTAGT CCCTCGATGG ATATTTCCAA GGCCTCGAAC
TTCGAGCGCT TCATTTTCGA CTTAACCGGA AGAGATGCCG CCAAAGTGAA GGAATTATGG
CAGGCAGTAG ATGAGGGCGG AGCCTTTGAT CTCGCCGATA CGCCCCTGTG GGAGAGAATC
GAAGACTTTG GCCTTGTATC AGGGACCAGC AGTCATGCAG ACCGAATCGC CACCATCCGC
CGGGTGCATG ACCGATATGG CTTGGTGATA GACCCGCATA CAGCTGACGG CGTGAAAGCG
GGATTGGAAC ATCGTGACGC CAGCGTGCCG CTGATCTGTC TTGAAACGGC ATTGCCAGTC
AAATTCTCAG CAAGCATTGT CGAGGCGATC GGACATGAGC CTGAGCGTCC CGCAGGATAT
GAGAATATTG AAGAAAAAGC ACAGCGCTAT GTAGTCATGG ATGCTGACGC AGGGGCGGTC
AAGGCGTTTA TTGTGGAGCA GGCGGGGCCG CCCCAAACCG CGGCGGCGAT ATAA
 
Protein sequence
MLYISTRGGM PPKKFSRILL GGLAPDGGLT LPETYPRFDD AKLQELRDMD YPELAFEILS 
GFADDIPAAD LRAIIGRTYT AQSFQNDEIT PLKTLEPGLH ILGLSNGPTL AFKDIALQLL
GNLFEYALDK NGEQLNILGA TSGDTGPSAE YAMRGKRGIR VFMLSPHGKM SPFQTAQMFS
LHDPNIFNIA IRGVFDDCQD IIKAVSNDYA FKQKYRIGTV NSINWARIAA QTVYYFKGYF
AATRSNTEQV SFAVPSGNFG NICAGHVARM MGLPIKKLIL ATNENDVLDE FFRTGHYRPR
TTAETVQTSS PSMDISKASN FERFIFDLTG RDAAKVKELW QAVDEGGAFD LADTPLWERI
EDFGLVSGTS SHADRIATIR RVHDRYGLVI DPHTADGVKA GLEHRDASVP LICLETALPV
KFSASIVEAI GHEPERPAGY ENIEEKAQRY VVMDADAGAV KAFIVEQAGP PQTAAAI