Gene Nmul_A1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1401 
Symbol 
ID3786431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1597609 
End bp1599714 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content55% 
IMG OID637811489 
Productglycogen debranching protein GlgX 
Protein accessionYP_412096 
Protein GI82702530 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02100] glycogen debranching enzyme GlgX 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCT GGCCAGGCAC TCCCTACCCC CTTGGTGCGA CATACGATGG AGCGGGAACC 
AATTTCTCCC TGTTTTCCGA GGCGGCGGAA CGCGTCGAAC TATGCCTGTT CGATGAAGCC
GGTCGTGAAA CCAGGGTGAA TTTGCCCGAG GTGACAGGAT ACTGCTGGCA CGGCTACCTG
CCGGGCGTAG AGCCTGGACA GCGTTACGGT TTTCGGGTGC ATGGGCCGTG GTCCCCGGAG
CAAGGTAATC GCTGTAATTC CGCGAAGCTG CTGCTCGATC CGTATGCAAA AGGGATCGAT
GGCGACATTA CCTGGGATGA AGCTGTATTC CCCTATCTTT TCGATGACCG CGATGCACGG
AATGACAAGG ATAGCGCGCC TTTCATGCCC CGCAGCATCG TTCATCAGCC GTTTTTCGAT
TGGTCCGGCG ATCGTCAACT CCAGCGCCCC TGGCACGAAA CAGTCATTTA TGAGTTGCAT
GTCAAGGGCT TCACCGCACA GCATCCCGAC ATACCCCCTG AACTGCGGGG CACTTATGCG
GGGCTCGCAC ACTCTGCCTC TATCGATTAT CTGAAACAGC TTGGGGTCAC GGCTGTCGAG
CTGTTGCCAG TGCACTACTT TGTGCAGGAC AAACATCTCC TGGATCAGGG TTTGCGCAAC
TACTGGGGTT ATAACTCCAT TGCCTATTTT GCTCCCCATA GCGCCTACGC CGCGGACAAG
CGTCCGGGAG CCGCAAGCGC CGAATTCAAG CAAATGGTCA AGGCCCTGCA TCAAGCCGAC
ATAGAAGTTA TCCTCGATGT GGTGTACAAC CATACTGCGG AAGGCAACGA TCTCGGGCCG
GTGTTATGCC TGAAGGGAAT CGACAACGCC TATTATTACC AGCTCATGGA AGATCGGCGC
TACTACAGGG ATTACACAGG CACAGGCAAC TGCCTCAATA TGAGGCAACC GCATGTGCTG
CAACTGATCA TGGATTCGCT TCGCTATTGG GTTCTGGAAA TGCACGTCGA TGGCTTCCGC
TTCGATCTCG CATCCGCGCT CGCACGCGAA CTCCATGAAG TGCAAATGCT GAATGCATTT
TTCAACATTA TCCAGCAGGA TCCTGTTACA AATCAGGTCA AGCTGATCGC GGAACCCTGG
GACCTGGGGG AAGGCGGTTA CCAAGTGGGG AAATTCCCGT CAGGCTGGTC TGAATGGAAC
GGCAAGTACC GGGACTGTAT CCGCAATTTC TGGCGCACCC AGGAACCTAC TTTAAGTGAA
TTCGCTTATC GCTTTACCGG AAGCTCAGAC CTTTACGAGG GGAATTCGCG TCATCCTTTT
GCAAGCATCA ATTTTGTCAC CTCGCACGAC GGTTTCACAT TGCGGGATCT TGTCTCCTAT
AACGAAAAAC ACAACCTGGC CAATGGCGAG GACAACAGGG ATGGCACTGA CGACAACCGT
TCCTGGAACT GCGGTGTCGA AGGGCCTACG GATGACATGG AAGTGCTTAC GCTTCGCGCC
CGGCAGCAGC GCAATTTTCT GGCTACCCTG GTATTGTCTC AGGGCGTGCC CATGTTGCTG
GCAGGGGATG AATTGGGTCG CACCCAGCAG GGAAACAATA ATGCGTACTG CCAGGACAAC
GAAATTTCCT GGGTGGATTG GGGGAAGGTG GATACCGGTT TACAGGAATT CACCAGACGC
CTTGCTCGCT TCCGCCGTGA TCACCCGGTC TTCCGCCGAC GCCGGTGGTT TCAGGGGCAA
CCCATTCACG GTGGGGGCCA GGACGATATT GCGTGGTTCA ATTACATGGG AGAACAGGCG
AGCGAGGAAT TATGGGGGAA TGGGGGAATT CAAAGCCTGG GAATTTTTCT CAACGGAGAC
AGTTTCCCGA ATCCCAATGC TCGAGGGGAG CCGGTAAAGG ATGACAGCTT CTACCTGATT
TTCAACGCAC ATTTCGAGCC GATCGATTTC GTCCTGCCCC CCAACCACTG GGGGCTACGC
TGGTTGAGAA TACTCGACAC CAACGAGGGC TGGGTAGAGA ATGCAGCGGA TCAGCCCGGG
CTCGAAGCCG GATCCGCTTT GTCAGTCGCC GCCCGGTCAC TGGTGCTTTT GCAGCGGCAG
GCATGA
 
Protein sequence
MKIWPGTPYP LGATYDGAGT NFSLFSEAAE RVELCLFDEA GRETRVNLPE VTGYCWHGYL 
PGVEPGQRYG FRVHGPWSPE QGNRCNSAKL LLDPYAKGID GDITWDEAVF PYLFDDRDAR
NDKDSAPFMP RSIVHQPFFD WSGDRQLQRP WHETVIYELH VKGFTAQHPD IPPELRGTYA
GLAHSASIDY LKQLGVTAVE LLPVHYFVQD KHLLDQGLRN YWGYNSIAYF APHSAYAADK
RPGAASAEFK QMVKALHQAD IEVILDVVYN HTAEGNDLGP VLCLKGIDNA YYYQLMEDRR
YYRDYTGTGN CLNMRQPHVL QLIMDSLRYW VLEMHVDGFR FDLASALARE LHEVQMLNAF
FNIIQQDPVT NQVKLIAEPW DLGEGGYQVG KFPSGWSEWN GKYRDCIRNF WRTQEPTLSE
FAYRFTGSSD LYEGNSRHPF ASINFVTSHD GFTLRDLVSY NEKHNLANGE DNRDGTDDNR
SWNCGVEGPT DDMEVLTLRA RQQRNFLATL VLSQGVPMLL AGDELGRTQQ GNNNAYCQDN
EISWVDWGKV DTGLQEFTRR LARFRRDHPV FRRRRWFQGQ PIHGGGQDDI AWFNYMGEQA
SEELWGNGGI QSLGIFLNGD SFPNPNARGE PVKDDSFYLI FNAHFEPIDF VLPPNHWGLR
WLRILDTNEG WVENAADQPG LEAGSALSVA ARSLVLLQRQ A