Gene Nmul_A1403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1403 
Symbol 
ID3786433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1605003 
End bp1606880 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content56% 
IMG OID637811491 
Productalpha amylase, catalytic region 
Protein accessionYP_412098 
Protein GI82702532 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGACAC GACTTCACGA CATGCCTTAT GGCACTCAAA TCACGGACCA CGGAGTGCGC 
TTCCGTCTTT GGGCGCCTGG TTGCAAACAG GTGGGATTGT GCCTTGCAGA CGGACGGACG
AATGAGAAGC CCGAACGGGC ACTCCCCATG ACTCCCCGCG CAGGAGGATG GTTCGAGTCG
ACTGTAGCGG AAGCGGGAGC GGGAATACGC TATGGGTTCG ATGTGAACGG TGCCCGCGTG
CCCGATCCCG CTTCGCGCTT CAATCCGGAT GATGTGCATG GTTTAAGTGA GGTGATCGAT
CCTTCGGCCT TCGCCTGGCA GGATGCGAAA TGGCGTGGAC GGCCATGGGA AGAAGCCGTC
ATCTATGAGC TGCATGTAGG CACGTTCTCG CCCGAAGGAA CCTTCAAGGG CGTCTCCCAG
AGACTGGACT ATCTGGTCGA ACTCGGTGTG ACCGCCATTG AGCTCATGCC GGTTGCCGAT
TTTCCGGGCT CACGAAACTG GGGCTATGAC GGCGTTCTGC TTTACGCGCC GGATAGCCGT
TACGGACGTC CGGAGGATCT CAAGGAACTG GTTCAGGCCG CGCATGAACG GGGACTGATG
ATTCTGCTCG ATGTCGTCTA CAATCACTTC GGGCCGGAAG GAAACTATCT TCACCTTTAT
GCAAAGGAAT TCTTTACTGG GCGCCATCAC ACTCCGTGGG GTGCGGCAAT AAATTTCGAT
GGCCCCGCAA GTGGACGGGT ACGCGAGTTT TTCATCCACA ACGCGTTGTA TTGGCTGCAA
GAGTATCATT TCGACGGCCT GCGGCTCGAT GCAGTCCACG CGATCATCGA CCATTCCAGT
CCGCACATCC TTGTCGAACT GGCGGAGCGT GTTCGTGCCG AGGTCGGTAC AGAGCGCCAC
GTGCATCTGA TCCTGGAAAA CGATGCCAAC AATGCCCGCT ATCTGACAAA TAGTTGGTAC
GACGCGCAGT GGAACGACGA TATTCACCAC GCTCTGCATG TGCTTTCTAC ACAGGAGGGC
GATGGATACT ATGTGGATTA TGCCGACAAT CCGGTACGCC ATCTCGGGCG CTGCCTGGCG
GAAGGGTTTG CCTATCAGGG AGAGATATCC GTTTATCGCG ACGATATGGC GCGTGGAGAG
GCGAGCATTC ATCTGCCGCC CCAGGCTTTC ATTTCCTTTC TGCAATCCCA CGACCAGGCG
GGCAACCGTG CATTTGGCGA GCGTATTAGT CATATTGGCG AGGAAGCGTT GACACGTATG
GCGGCAGCCA TCTATTTGCT GGCTCCCGGC ATTCCCATGC TGTTCATGGG CGAGGAGTTT
GCCGCAAAAT CGCCTTTCCT GTTTTTCTGC GACTTCGGAC CGGAGCTGCG CGAAGCTGTG
ACACAGGGGC GACGCAGGGA ATTTGCCCGC TTTGCGCATT TTACGCAAGA CATGGACGAG
ACAGCGATAC CGGACCCGAA TGCTGTTCAG ACTTTTCTTA TCTCGAAAAT CGATTGGGAT
TTACTCAGGA ATGAAGCTCA CTTTGCCTGG CTGGAGTATT ACCGCAACCT TATGAAACTT
CGCAGCGAAA TTATCGTACC GCGCCTTCGC GGAATGAAGG GTAATTCGGC ACATTTTGAG
GTATTCGCGC CTAAATGCCT GTGGGTCTGC TGGCAGTTGG GCGACGATTC CACCTTGCGG
CTATTGGCGA ATTTTTCCGA CGAGAGCGTG GCGGCTCCCC GGTTCGGCGG GCAGATCGTG
TTTGCCTCAC CCGGTGCAAT ACCCCCATCT TCCCGAACAA CCAAAAGTAT CCCGGTGAAA
AGTATGCTGG CGCCTCGTTC GGTAGTCTGG ATGCTTGAAC CGGCTGCCGA TAAAAGTAAA
GGCAGTTTTA CCGGATAA
 
Protein sequence
MLTRLHDMPY GTQITDHGVR FRLWAPGCKQ VGLCLADGRT NEKPERALPM TPRAGGWFES 
TVAEAGAGIR YGFDVNGARV PDPASRFNPD DVHGLSEVID PSAFAWQDAK WRGRPWEEAV
IYELHVGTFS PEGTFKGVSQ RLDYLVELGV TAIELMPVAD FPGSRNWGYD GVLLYAPDSR
YGRPEDLKEL VQAAHERGLM ILLDVVYNHF GPEGNYLHLY AKEFFTGRHH TPWGAAINFD
GPASGRVREF FIHNALYWLQ EYHFDGLRLD AVHAIIDHSS PHILVELAER VRAEVGTERH
VHLILENDAN NARYLTNSWY DAQWNDDIHH ALHVLSTQEG DGYYVDYADN PVRHLGRCLA
EGFAYQGEIS VYRDDMARGE ASIHLPPQAF ISFLQSHDQA GNRAFGERIS HIGEEALTRM
AAAIYLLAPG IPMLFMGEEF AAKSPFLFFC DFGPELREAV TQGRRREFAR FAHFTQDMDE
TAIPDPNAVQ TFLISKIDWD LLRNEAHFAW LEYYRNLMKL RSEIIVPRLR GMKGNSAHFE
VFAPKCLWVC WQLGDDSTLR LLANFSDESV AAPRFGGQIV FASPGAIPPS SRTTKSIPVK
SMLAPRSVVW MLEPAADKSK GSFTG