Gene Nmul_A1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1251 
Symbol 
ID3786027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1437431 
End bp1439116 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content50% 
IMG OID637811336 
ProductAlpha amylase, catalytic region 
Protein accessionYP_411946 
Protein GI82702380 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTGG GTCCACCCGC CCATTCCATG AGTCGCACTG CCCCTGACAA TTCGAACGCT 
GAAGACGAAT GGTGGAAGAA AACCACCGTT TATCATGTCT ATGTGCGTTC TTTTTACGAC
TCCAATGGCG ATGGAATCGG AGATATCCAG GGCATCATCG AAAAACTGGA TTATCTGCAC
GATCTCGGAT ATGAAACAAT CTGGGTGTCG CCCTTTACGC AAAGTCCACA AAAGGACTTC
GGCTACGACA TCAGCGATTA CCTCTCCATC TCTCCTGAAT ATGGCGATAT GCCGCTCTTT
GAGAAACTGG TTGAAGAGGT GCACCGGCGC AGCATGAAAC TGATATTCGA CCTGGTGCTG
AATCACACTT CGAGCGAGCA TTCCTGGTTC ATCGAATCCG CAAGCTCGCG CGACAATCCC
AAAGCCGACT GGTACGTCTG GAAAGATGGA AAAGGCAAAA AGGGTTTGAG GCGCCCAAAT
AACTGGCGCG CCATGGCAGG TAACAAAGCC TGGACTTACC ATCCGCGACG GAAGCAATTC
TATTACACCG CTTTCCTCCC TTTTCAGCCC GACCTGAACT ACCACAATCC TGAAGTCAAA
CAGGCGATGT TCGAGGTCAT CAGATTCTGG CTGAACAAGG GGGTGGATGG ATTCCGTCTC
GATATCATCA GTGCCATCTA CGAGGACTCA GAATTGCGCA GCAACCCTCC CAGTCCCCGT
TTGACCCCAT CGGATAAATC GCTGTCCATC TTTTTCCAGA ACCTGAAAAA CAATTTTCTC
CACGAAAAGA GCTTCGAATT CGCTATCGAG CTGCGGCGCG TAGTGGATGA GTTCGATAAC
CCCAAAAGGG TGCTTATCGG GGAATCACAT GGAGATGAAG CACTGATCCA TCGCTTCTGT
CGAAACGATG GGCAGCATGG ATTGCACGCT GTTTTTCTTT TCAAGGCCAT TTCCACACCT
TTCAAGGCGG AAAAATACCG CGAAATGCTG ATGACATTCG AGAAGCATTT TCCCGAACCG
CTGATACCCA CTCTCGTCTT TGCCAACCAT GACCGCAACC GTGTCATCAG CCGCCTGGGA
GGGAGTATAG AAAAGGCAAA ACTGCTGGCC CTGTTCCAGT TTACCTGCCG GGGGATTCCC
TTCACCTACT TCGGTGATGA GATTGGCATA CCTCGGGTAA GAATTCCCCT GAAAGACGGA
AAAGATGCGA TTGCCATCCA GCATAAATGG GTACCGCAGT TTCTGGTTGA TCGCAGCAGT
GAAATTCTCA ATCTAGACGA GTGCCGTACC CCGATGCTAT GGAATGAAAG GCCCAGGGCA
GGCTTTTGTG GGAGTTCAGC AGAGCCCTGG TTGCCGGTGG CAGACAGCTT CAGGGAAATA
AATGTGGAAA AACAGATTTC GGAACCGCAT TCCCTTCTCA ATTTTTACAG GAAAATCCTC
CTGTTTCGCA ACAGGACGCC AAGCTTGCAT GCGGGGCGCC TTGAAATCTT GCATGACCTC
TGCAACCGGA AAATTCTCGC CTACCGCAGA ATATTCAATG AAGAGAAGCA CGTAGTGCTC
CTCAACATGT CCCGCCAGCG GGTTAAAATC CCCTTGAATA AACCCGTACT GCTTTCCACG
CATCCTCAAA GCCCTGTGCA TCAATTACAA CCTTTCGAAG GGCGCATCAT CAACGAATCG
CATTGA
 
Protein sequence
MILGPPAHSM SRTAPDNSNA EDEWWKKTTV YHVYVRSFYD SNGDGIGDIQ GIIEKLDYLH 
DLGYETIWVS PFTQSPQKDF GYDISDYLSI SPEYGDMPLF EKLVEEVHRR SMKLIFDLVL
NHTSSEHSWF IESASSRDNP KADWYVWKDG KGKKGLRRPN NWRAMAGNKA WTYHPRRKQF
YYTAFLPFQP DLNYHNPEVK QAMFEVIRFW LNKGVDGFRL DIISAIYEDS ELRSNPPSPR
LTPSDKSLSI FFQNLKNNFL HEKSFEFAIE LRRVVDEFDN PKRVLIGESH GDEALIHRFC
RNDGQHGLHA VFLFKAISTP FKAEKYREML MTFEKHFPEP LIPTLVFANH DRNRVISRLG
GSIEKAKLLA LFQFTCRGIP FTYFGDEIGI PRVRIPLKDG KDAIAIQHKW VPQFLVDRSS
EILNLDECRT PMLWNERPRA GFCGSSAEPW LPVADSFREI NVEKQISEPH SLLNFYRKIL
LFRNRTPSLH AGRLEILHDL CNRKILAYRR IFNEEKHVVL LNMSRQRVKI PLNKPVLLST
HPQSPVHQLQ PFEGRIINES H