Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1402 |
Symbol | |
ID | 3786432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1599755 |
End bp | 1604947 |
Gene Length | 5193 bp |
Protein Length | 1730 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637811490 |
Product | putative bifunctional 4-alpha-glucanotransferase/malto-oligosyltrehalose synthase |
Protein accession | YP_412097 |
Protein GI | 82702531 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.701685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAATC CGCTTGACCA GCTGTGCGAA TTATATGGCG TGCTGCCCTG CTACAGCGAT ATCTGGGGAA ATACCCGCTA CACTTCACAG GACGCAAAGC GCGCTCTACT CGGGGCAATG GGGGTTGCAG CGGCCACCGA AGAGGAAATT TCAGCGTCCC TGCACATTTT TATAAGGCGC AAATGGGAGC ATGTGTTGCC ACCCGTACAA GTGGTGCGGG AGTGGATGCG ACCCTATCGT ATTGCAGTCG CACTACCGGC CCTCGAGGGC AAAGGGCGCT ATCGCTGGCG CTTGTGCAGG GAATCCGGGA CCGAAGACCA CGACGAATTT ATTCCTGATG CCCTGGAGGA GGTCGAGCGT TATCGCTTCA CCGCTCCTGG CAATTTCGGC AGTTTCGATG ATCCTGGTGA GCAGGAATTC GTGCGGCGGA TTCTGGTGCT GGATCTCTCT GTGGAACCAG GGTATCACCG GCTTTTCATC GAAAAGAGCG ATGGCCTCGC TCTGGCTGAA ATGCCGTTTA TCATGGCTCC TGCCACCTGT TATTGGCCGC CCGGTCTCGA AGGTCCGAGG CGGGTGTGGG GCCCTGCCCT CCAGTTATAT GGGATTCGTT CCCAGCGCAA TTATGGCATC GGGGATTTCA GCGATCTTCG GCGCCTGGTC GAATTCTATG CCAGCGCAGG GGGAAGCACG CTCCTGCTCA ACCCGCTTCA TGCGTTATTT CCCGATGCTC CGGAACACAC CAGTCCCTAC AGCCCCTCCA GCCGCACCTG GTTCAACCCG CTTTATCTGG ATGTGGAAGC CATACCCGAG TTTGCGGAAT GCGAGGAAGC ACGTTCGATA GTGCTCGCGC CGGAATTCCA GGCGCGGCTG CGCGCGCTTC GTGCTACCGA ACAGGTGGAT TACAGAGGCG TTGCGGAGGC AAAATCGGAG GTTCTCGTCT CCCTTTACCG GTATTTCCGT AATACCCATC TGGCGCGAAA TACCGATCGC GCTCGTGTTT TTCGTGCTTT CCAGGCTGAG CACGGCGAGG TTCTTCGCAA GCAGGCACTG TTCGAGGCGC TTCAGGAATA TTTCCGGGCA GAGGATAGCT CTGTATGGGG GTGGCGGGTG TGGCCCGAAG CATACCGCGA TCCCGAAGCT CCGGAAGTGA CCGCGTTCCG CGAAGCTCAT CTCGAGCGCG TGGAATATTT CGAATACCTG CAGTGGCAGG TTTCCCTGCA ACTGGGCTCA GTGGGCACAC GCTCATGGGA AGTAGGACTC AATATTGGTC TAATGTTCGA TCTGGCGATC GGGGTAGCGG AAGGGGGCGG GGCAACGTGG TCCCGCCGTG AACTCTATGC ACTTGAAGCC AGTGCAGGGG CGCCACCCGA TGACTTCAAC CGTCTGGGGC AGAATTGGGG CTTGCCGCCC TGGATTCCTC ACCAGCTTAC TTCATCCGCT TATGCGCCTT TCATTGAAGT GCTGCGCGCC AACATGCGCG ACAGCGGTTC ACTGCGCATA GATCACGTAA TGGGCCTGCG CCGCCTGTTC TGGGTGGTGC GGGGGCTGCC TGCAACGGAG GGGGCCTACA TACGCTATCC CTTTGAAGAT ATGCTGAGCA TATTGGCACT GGAGAGCCAG CGCAACCGCT GCCTGGTGAT AGGTGAAGAT TTGGGTACAG TACCCCCTGA AGTACGTGAA GCCTTGCATC CCGCAAACGT GATGTCCACA CGGTTATTAT ATTTCGAGCG GGTTGATGGC GGACGATTGA AGCCACCGCA GGCGTACCCG GCAAATGCCG TAGTGGCGGT AACAACCCAT GACCTTCCGA CTCTGGCAGG ATTCTGGCAA GGGCTCGATA TTGATTTGCG CGATCGTCAC CATCTGTTTC CCGAAGATGA GATGCGGAAC AGGCAAATCG TGGAGCGAGC GGCAGACCGT GCACAGCTCC TGGTGGCATT GGAAGGGGAA GGCGTATTGC CGTCGAGAGG AGGACTTCAT CAGGTCGGCT TTCCAGAGAT GACGGCGGAA CTCGCCGCAG CAGTGTACAC CTATCTGGCC CGTGCACCGT CCAAACTTCT GCTGCTGCAG CTCGAAGATG GCTTCGGAGT ACTTGAGCAA CCGAATCTGC CCGGGGCGAC AGGAGACACT TATCCATCCT GGCGTCTCAA ACTGCCCCTC AATCTGGAGG AGTGGCAGGG AAGCGCCTGG CTGCAGGGGA TAATCCTGGC ATTGCGCCGG GAACGCCCCT TGTCGCAAGC GCCCTCCTCC TCAGAGGGTG AGGTCGCGCA GGATGTACAA GTGCAGATTC CCAGGGCAAC TTACCGCTTG CAATTGAATC GGGACTTCAA TCTGAGGCAG GCAACGGAAC TGATCCCATA CCTGGATGAG CTGGGCATCA GTCATTGCTA TCTTTCGCCC CTCCTCAAAG CGCGCCCCGG CAGCATCCAT GGCTATGACG TAACCGACCA CGGCAGGCTC AATCCGGAGA TCACGAGCGC CCGCGACTTT GAGCGGTTTG CCGCTGTCTT GAAGCGCCAT GGAATGAGCC AGATCATGGA TGTGGTGCCC AATCACATGT GCATCACCGG CGTTGATAAC GAATGGTGGC TGGATGTACT GGAAAACGGT CCGGCATCCC GCTTTGCCAG CTATTTTGAC ATCGACTGGT ACGTGATGGG AGAGCACCTG CCGGGCCAGG TGCTGCTCCC TGTTCTGGGC GATCACTATG GCACCGTTCT GGAGAACGGG GAGCTCAAAC TCGCGTTCGA CATCGAACAA GGCTCCTTCA GTGTTTTTTA TCATGAGCAC AGGTTTCCTG TAGATCCGCG TGAATATCCC CGTATTCTTG GGCATGATCT GAGAAGGCTC GAGATGCGAC TGGGGGAGCA GCACCCGGAG TTTCTGGAAC TCCAGTCCCT GATTACGGCA TTAACCCATT TGCCGCTGCG AGAGCGCGTT TCCCCCGATG CGGTCGCGGA GCGGGTACGC GACAAGGAGA TACACAAGCG CCATCTGGCT TCACTGTTCG TGAAGAGCGC CGACATCGCC CAGTTCGTCC AGGAAAATAT CACTCTATTC AACGGTGACG CACCGGGGCA ACCCAGGAAT TTCGATCTGC TACATGAACT GCTGGCCGTC CAGGCCTATC GCCTGGCTTT CTGGCGTGCG GCAGCGGACG AAATCAATTA CCGCCGCTTC TTCGACATCA ATGATCTGGC GGCGCTGCGC ATGGACAACC CGGAAGTATT CGAAAGCACG CACCGCCTGG TTCGTGAACT GATCGCGCGC GGCTATGTAA GCGGATTACG CATCGATCAT CCGGATGGTT TATACGCGCC CCAGGAATAC TTCGAGCGCT TGCAGGCGAT GGCTGCGGCA GCACTGTTTC CGGGAACGGT GAAGGACGGC GCCAAGTCGC TCTATATCGT CGCAGAGAAA ATACTTGCGT CATATGAACA TCTTCCGAAG GCCTGGTCCA TCCACGGCAC TACGGGATAC GACTTTGCCG CGGCTTGCAC CGGTCTTTTT GTCGACACCC ACGCCGCCGG GGAGTTTACC CGCATCTATG AGCGTTTTAT CCGGGCACGT CCGGATCTCG ATGCAATGAT ACGGGCGAAC AAGCATCTGA TCATGGATCG TGCGCTGGCA GGTGAGCTTC AGGTACTGGC AATACAATTG GCACGTATCG CCAAGGGCGA CCGCCGTACA TGCGATTTCA CCTTCAACAG TCAGCACAGC GCACTTGCCC AGGTTGTCGC CAATTTTCCT GTCTATCGTA CTTATGTATC CGATTGTGAA AGTTCTGCTG ATGATGTCCG CTACGCCGAC TGGGCGGTTG AGGTAGCCAA GAAGCGCAGT CAGGCGGTTG ATACCACCAT CTTCGATTTC GTGCGGGATG TATTGCTGGG ACGGCAGGCC AAAGGTCAAG CCGAAGCATA CAGGAACGCC ATATGCACCT TTGCCATGAA GTTCCAGCAG TACACCAGCC CGGTAATGGC CAAAGCGATG GAGGATACGA CCTTCTATCA ATACAACCGG CTGGTATCGC TCAACGAGGT CGGCAACGAG CCGCAGCGGT TCGGGGTGTC CCTCGCAGCC TTTCACCGGG AGAATCAGGA GCGAGCGAGC CATTGGCCGC ATGCCATGCT GTCCACTTCA AGTCACGACA GCAAGCGGTC GGAGGATGTG CGGGCGCGCA TAAGCGTTCT CTCCGAAATA CCCGATCAGT GGGCACAGGC TCTTAAACGA TGGAACAGGC TCAATCGCAG CAGCCGTTGG GAGCTGGACA ACACTTATGC GCCGAGCCGG AATGACGAGT ACCTGCTATA CCAGATATTG CTGGGCATAT GGCCTTTCGA CACACCGCAT GCCGAGGAAT TGGCAAACCT GTCCGACCGC GTAGTTGCCT ATATGCGCAA GGCCGCGCGA GAAGCGAAGG TGAACAGTTC CTGGATCAAC CCTGATAGCG AATATGAGGC AGCAATGCAG GACTTCGTGC ATGCACTTCT CTCCGAGCAG CCCACCAATC TGTTCCTGCG CGACTTTCTG CCTTTCCAGC AGCGGGTTGC ATGGGTGGGT GCCTTCAACA GCTTGTCACA AGTGTTACTC AAACTGACCT CACCGGGAGT ACCAGACATC TACCAGGGCA ATGAAACCTG GGATTTCAGC CTCGTGGACC CGGACAACAG GCGTCCTGTG GACTACACGG CGCGTCGCAG GTCCTTGCAG GCCATCCGCT CGATGCATGC TGAAGAAGGT CCCGGGGCAT GCGCCCAGCA TCTGATGGAG AACCTGCGGG ACGGCAGGAT CAAGCTCTAT CTGACCTGGA AAGCGCTCAC GTTCCGCCGC GAGCACGAAC AGCTTTTCCG CGACGGCGAC TACCTGCCGC TAAAAGCGCA TGGAGACTGC AGCGAACATG TATGCGTGTT TGCACGGCGC CGGGAAAATG AAATTATTGT CGTTGCCGTA CCGCGATTGC TCGGTAAGCT CATCGGAGAA CAACACAGGT TCCCAGTCGG CAAATCCATC TGGACCGACA CCTGGGTGGA ACTGCCTTCA GACGAGTTGC GCGAGAAGTG GATCAATGTG TTGACTGGCG AGATTCTTGC CACCCAGAGG ACGGAGGAGG CATGTGGGAA GTTTGGGTTG GCTCACCTTT TCGGGACGTT TCCTTATGCG TTGCTATCCC CTTTCGAGCA GACAACGGGA TGA
|
Protein sequence | MSNPLDQLCE LYGVLPCYSD IWGNTRYTSQ DAKRALLGAM GVAAATEEEI SASLHIFIRR KWEHVLPPVQ VVREWMRPYR IAVALPALEG KGRYRWRLCR ESGTEDHDEF IPDALEEVER YRFTAPGNFG SFDDPGEQEF VRRILVLDLS VEPGYHRLFI EKSDGLALAE MPFIMAPATC YWPPGLEGPR RVWGPALQLY GIRSQRNYGI GDFSDLRRLV EFYASAGGST LLLNPLHALF PDAPEHTSPY SPSSRTWFNP LYLDVEAIPE FAECEEARSI VLAPEFQARL RALRATEQVD YRGVAEAKSE VLVSLYRYFR NTHLARNTDR ARVFRAFQAE HGEVLRKQAL FEALQEYFRA EDSSVWGWRV WPEAYRDPEA PEVTAFREAH LERVEYFEYL QWQVSLQLGS VGTRSWEVGL NIGLMFDLAI GVAEGGGATW SRRELYALEA SAGAPPDDFN RLGQNWGLPP WIPHQLTSSA YAPFIEVLRA NMRDSGSLRI DHVMGLRRLF WVVRGLPATE GAYIRYPFED MLSILALESQ RNRCLVIGED LGTVPPEVRE ALHPANVMST RLLYFERVDG GRLKPPQAYP ANAVVAVTTH DLPTLAGFWQ GLDIDLRDRH HLFPEDEMRN RQIVERAADR AQLLVALEGE GVLPSRGGLH QVGFPEMTAE LAAAVYTYLA RAPSKLLLLQ LEDGFGVLEQ PNLPGATGDT YPSWRLKLPL NLEEWQGSAW LQGIILALRR ERPLSQAPSS SEGEVAQDVQ VQIPRATYRL QLNRDFNLRQ ATELIPYLDE LGISHCYLSP LLKARPGSIH GYDVTDHGRL NPEITSARDF ERFAAVLKRH GMSQIMDVVP NHMCITGVDN EWWLDVLENG PASRFASYFD IDWYVMGEHL PGQVLLPVLG DHYGTVLENG ELKLAFDIEQ GSFSVFYHEH RFPVDPREYP RILGHDLRRL EMRLGEQHPE FLELQSLITA LTHLPLRERV SPDAVAERVR DKEIHKRHLA SLFVKSADIA QFVQENITLF NGDAPGQPRN FDLLHELLAV QAYRLAFWRA AADEINYRRF FDINDLAALR MDNPEVFEST HRLVRELIAR GYVSGLRIDH PDGLYAPQEY FERLQAMAAA ALFPGTVKDG AKSLYIVAEK ILASYEHLPK AWSIHGTTGY DFAAACTGLF VDTHAAGEFT RIYERFIRAR PDLDAMIRAN KHLIMDRALA GELQVLAIQL ARIAKGDRRT CDFTFNSQHS ALAQVVANFP VYRTYVSDCE SSADDVRYAD WAVEVAKKRS QAVDTTIFDF VRDVLLGRQA KGQAEAYRNA ICTFAMKFQQ YTSPVMAKAM EDTTFYQYNR LVSLNEVGNE PQRFGVSLAA FHRENQERAS HWPHAMLSTS SHDSKRSEDV RARISVLSEI PDQWAQALKR WNRLNRSSRW ELDNTYAPSR NDEYLLYQIL LGIWPFDTPH AEELANLSDR VVAYMRKAAR EAKVNSSWIN PDSEYEAAMQ DFVHALLSEQ PTNLFLRDFL PFQQRVAWVG AFNSLSQVLL KLTSPGVPDI YQGNETWDFS LVDPDNRRPV DYTARRRSLQ AIRSMHAEEG PGACAQHLME NLRDGRIKLY LTWKALTFRR EHEQLFRDGD YLPLKAHGDC SEHVCVFARR RENEIIVVAV PRLLGKLIGE QHRFPVGKSI WTDTWVELPS DELREKWINV LTGEILATQR TEEACGKFGL AHLFGTFPYA LLSPFEQTTG
|
| |