Gene Nmul_A1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1404 
Symbol 
ID3786434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1606903 
End bp1610205 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content56% 
IMG OID637811492 
ProductAlpha amylase, catalytic region 
Protein accessionYP_412099 
Protein GI82702533 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACGAG AAGAAGACCC GCTCTGGTAC AAGGATGCCA TTATCTATGA ATTGCACGTC 
AAGACATTCT TCGACAGCAA TGGCGACGGC ATAGGAGACT TTTCGGGCCT GATTTCCAAG
CTCGACTATC TGGCGGAACT GGGCGTTACC GCGCTCTGGC TTCTTCCATT TTATCCGTCT
CCGGGACGGG ATGACGGTTA CGATATTTCC GATTATCACA ATGTGCACCC GGATGTCGGC
ACCCTGGAGG ACTTTCACCG GTTTATCGCG GAAGCTCACA GTCGAGGTTT ACGCGTAATC
ACGGAACTGG TGCTCAATCA TACTTCCGAT CAACATCCAT GGTTTCAGGC AGCGCGCCGC
GCCCCACCTG GGTCCGTCAA GCGCGACTAT TACGTATGGA GCGACGACAA CACCAAGTAC
AGCGGTGCAC GCAGTATTTT CACGGACACA GAGGGATCCA ACTGGGAATG GGATGATGTA
GCCCAGGCCT ATTACTGGCA CCGCTTTTTT TTCCATCAAC CGGACTTGAA TTTTGATAAC
CCCCATGTCT TTAACGCCAT GATGCATGTG ATGCGGCTTT GGCTGGACGC CGGTGTCGAT
GGCATGCGGC TGGATGCCAT GCCTTATCTG TGCGAGCGTG AGGGGACCAA CTGCGAAAAC
CTGCCGGAAA CTCATGCGGT GATCAAACGC ATGCGCTCCG AACTGGATAA ACACTACCGC
AACCGGATGT TCCTGGCTGA GGCGAACCAG TGGCCGGAGG ATGTACGCGA GTATTTTGGC
GAGGGCGACG AATGCCATAT GGCTTTCCAT TTCCCACTCA TGCCGCGAAT GTACATGGCT
ATTGCGCGCG AAGATCGTCA TCCCATTGTA GAGATCATGG AGCAGACACC AGACATCCCC
GAGAATTGTC AGTGGGCAGT GTTTCTGCGC AATCACGACG AACTGACATT GGAGATGGTC
ACCGACCGCG AGCGGGATTA TCTTCACCAG ACCTACGCAA TCGACCCTCA GGCCCGCCTG
CATCTCGGTA TTCGCCGGCG TCTGGCGCCG CTGATGGACA ATGACCGGCA CCGAATCGAA
TTGATGAATC TGCTGCTCAT GACCATGCCA GGGTCGCCCG TGCTCTACTA CGGAGATGAA
ATCGGTATGG GTGATGACCT TCTCCTCGGT GATCGAAATG GGGTGCGCAC GCCCATGCAG
TGGTCGGGTG CAGTTAACGG CGGATTTTCA ACGGCTGATT CTCAACGGCT TTTCCTGCCG
GCCATCATCG ACCCCGTATA TGGGTTTGGG GCAGTGAACG TGGACTCACA GAGACGGAAT
TCGTCATCCC TGCTCAACTG GATGAAGCGC CTGATTGCGA TGCGCAAGGC ACACCGGACC
TTTGGCCGGG GCACGCTGCG TTTTTTACGG CCTGGCAATC GCAAGATTCT TGCTTATCTG
CGTGAGCATG AGGATGAAAC GATTCTGTGC GTGGCTAATC TGTCCCGCGT CGCGCAGCCG
GTAGAGCTGG ATTTAAGCCA ATTCAGGGGA AGGGTACCGG TTGAATTGAT GGGGCGCACG
CCTTTCCCAC CCGTGGGTGA GCTGCCGTAT CTTCTTACCT TGAGCGCTCA TGGCTTTTAT
GCTTTTCGCC TGACAGCCGA TGTGGCCGCC CCTGCCTGGC ATGAGGAGCG GCAGGTATCG
CCGGATCTGC CTGTCCTTGT GCTTGTGGAT TCGGGTTGGG GCACCCTGTT GAACCGGGGC
GAGGGTAACG GGGGCATGAA AGACCTGATG GCTCGTCGTG CCCGCCAACA GCTCGAGGAG
CAGATCATGC CTCGCTTCTT TTATTCCCAA CCCTGGTTTT TGATGCGAAA TCTCCCCGTC
AGAAAATTTG AGCTGGGGGA GATGCACGAG TGGTCCGCGG AGCAGGGAAG CTGGCTGCTT
GCCACCGTTG TTTTAACGCT GGCCAATGAT GAGACTTATC GCTTTGCCGT GCCTTTGGCA
CTGGTGTGGG AGGACGAGGA TGAGGCGATT GTGAGCACGC TGCTGCATGC CACGCTTGCC
AAGGTTCGCC GCCGGGAGCG GACGGGCGTG CTGTTCGACG CCTTCTGGGA TGACGGCTTC
TGCCGCGCCG TGATCTCCAG CATGCATGAG GGTTCCGCAC TCCTGTTTAA GCGGGGGCAA
GTATCTTTCC ATGCGACCAC CGCTTTTCCC GGCCCGGTGG TTTCAGGCGC GTCAACAACG
GTGACCCGGA CAGTTTCGGA GAGAGGGCGA TTGTTCGTGA ACATGGGCGA CCGGCTGGTA
CTGAAAGGAT ACCGCTGGCT TCTTCCCGGC GTGCATCCCG AGCTGGAGAT GTCGCGTTTT
CTGACGGAGA CGGCAAAATT TACCCACATG GCGCAACTCG CCGGCACCGT GGAGTACACG
GACAGTGAGG AAGGCAATTC CACTCTGGCA ATCCTCGAGC ACTACGCCGA GAATCAGGGT
AGCGCCTGGG CTTATACACA GGACTATCTG CAACGCTACC TGGATGAATG CCGCACACAA
CAAAAGCGTC CCATTGATTC GCGGCATATT GCCTACATGA CCCTCATCAA TACGCTGGGA
TTGCGTACGG CGGAATTTCA CCGGGCACTT GCGCAGGATG ATGCAGAGGG AGCTTTCGGC
GTCGAGCCCA TTACCTCCGA AGATCTTGCG CAGTGGGCGA GCACCGTGCG TGCGCAGATG
GATGAAATGT ACAAATTGCT GGAAGCGAAA TGGCCGGATG TACCCAAATC CGCGCAGGAG
GCCGGCAACG ATCTTTTATC AGCCCGGTCG AAATTCTACC GTCGTATCAC CCGCCTTGCA
GCCATACATC CCCAGGCCTT GAAGGCGCGT TGTCATGGGG ATTACAGCCT GCGTCAGGTG
TGGCTCTCGA ATAATGATTT TCTGATTACG AATTACGGCG GCGGTGCCGA ACGCGCATGG
CGTGAGCGCC GCTGGAAACA GAGTCCCCTC CGCGATGTGG CAGGTATGCT GTTTTCGTTT
TCCGAGGTGG CGGCAGCGGC ACTGGAGCAT GTCACGGATG AATATCCGGA ATCAACCATT
ATGCTTGCAC AACAAGCTGA TAAATGGCGA GTGCTTGCCA GCGGCGATTT CCTCAAAAGC
TATCGCAGGG CGATGAAGGG AAATTCCCTG TTCCCCGCTG ATGCCGGAGT AACCGATGCT
TTGGTTACGC TCTTCATGGT GGAGAAAGCT GTTGCCAGCG TGAGTAACGC GCTCGCGCAA
CAATCGAAGG CAGTCGATGG AACCATGCAG CGACTGATAC GGCTGATGCA ACACAGAAGG
TAG
 
Protein sequence
MGREEDPLWY KDAIIYELHV KTFFDSNGDG IGDFSGLISK LDYLAELGVT ALWLLPFYPS 
PGRDDGYDIS DYHNVHPDVG TLEDFHRFIA EAHSRGLRVI TELVLNHTSD QHPWFQAARR
APPGSVKRDY YVWSDDNTKY SGARSIFTDT EGSNWEWDDV AQAYYWHRFF FHQPDLNFDN
PHVFNAMMHV MRLWLDAGVD GMRLDAMPYL CEREGTNCEN LPETHAVIKR MRSELDKHYR
NRMFLAEANQ WPEDVREYFG EGDECHMAFH FPLMPRMYMA IAREDRHPIV EIMEQTPDIP
ENCQWAVFLR NHDELTLEMV TDRERDYLHQ TYAIDPQARL HLGIRRRLAP LMDNDRHRIE
LMNLLLMTMP GSPVLYYGDE IGMGDDLLLG DRNGVRTPMQ WSGAVNGGFS TADSQRLFLP
AIIDPVYGFG AVNVDSQRRN SSSLLNWMKR LIAMRKAHRT FGRGTLRFLR PGNRKILAYL
REHEDETILC VANLSRVAQP VELDLSQFRG RVPVELMGRT PFPPVGELPY LLTLSAHGFY
AFRLTADVAA PAWHEERQVS PDLPVLVLVD SGWGTLLNRG EGNGGMKDLM ARRARQQLEE
QIMPRFFYSQ PWFLMRNLPV RKFELGEMHE WSAEQGSWLL ATVVLTLAND ETYRFAVPLA
LVWEDEDEAI VSTLLHATLA KVRRRERTGV LFDAFWDDGF CRAVISSMHE GSALLFKRGQ
VSFHATTAFP GPVVSGASTT VTRTVSERGR LFVNMGDRLV LKGYRWLLPG VHPELEMSRF
LTETAKFTHM AQLAGTVEYT DSEEGNSTLA ILEHYAENQG SAWAYTQDYL QRYLDECRTQ
QKRPIDSRHI AYMTLINTLG LRTAEFHRAL AQDDAEGAFG VEPITSEDLA QWASTVRAQM
DEMYKLLEAK WPDVPKSAQE AGNDLLSARS KFYRRITRLA AIHPQALKAR CHGDYSLRQV
WLSNNDFLIT NYGGGAERAW RERRWKQSPL RDVAGMLFSF SEVAAAALEH VTDEYPESTI
MLAQQADKWR VLASGDFLKS YRRAMKGNSL FPADAGVTDA LVTLFMVEKA VASVSNALAQ
QSKAVDGTMQ RLIRLMQHRR