Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2266 |
Symbol | |
ID | 3785428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2575147 |
End bp | 2577531 |
Gene Length | 2385 bp |
Protein Length | 794 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637812354 |
Product | sucrose synthase |
Protein accession | YP_412950 |
Protein GI | 82703384 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR02470] sucrose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.856509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCACA AATTTGAAGC CTGGGCTGCC GATCATCGCG GTGATATGTA TACGCTTCTG CGCAGATGGT TCGAACTGGA AAGGCCGCTT TTGCTTCATT CCGATCTGGG AGCGGTTTTT AATGCGTTGA GCGCGGAGCA GGCGTCTTTG CTTGCTGATT CGCAGGTACG AGAAATCGTA AACACTCTGC AGGAAGCGGT ATGCCGGCCG CCAATAGTCT ACATGGCGGC GCGCGAGGAG GCGGGGTGCT GGTGGTACGC ACGCTTGCAT CTGGATAGGC TAATCCCCGA GGCTGTCACC GTGTCGGAAT ATCTCGCTTT CAAGGAGCTG CTGGTAAATC CGGAGGGGGC GAATGAGCCC GTGCTGGAAA TTGATTTTGC GCCTTTTAAT CGCGGTTCTC CAAAGCTCAA GGAAATCCGG TCCATAGGGC AGGGTGTGAT TTTCCTCAAC AAGCAACTTG CCGGAGGACT GTTTGGGCAA CTGGGGTTGG GGTCGGACAA GCTGCTGCAT TTCCTGACAG TCCATTCCAT GGACGGGAAG CAGTTGATGC TGGGCGGCAA TTTTGCCGAT GTGCCGGCGC TGCGTTCCGG ATTGCGCAGG GCTCTGAGCA TGCTTGAGAA GTATCCCGAC GATACCGAGT GGAAGGATGT TGCCGAACCC CTTGGAGGTA TCGGTTTTGC ACCCGGCTGG GGAAATTGCG TGGGCCGTGT GAGCGAGACG ATGAGTTTGC TGGTGGATAT TCTGGAAGCC CCTTCTCCCC AGATCCTGGA GAGCTTCCTC GCCCGTATTC CGATGATCTC GAAACTGCTA ATCCTGTCTC CCCATGGCTA CTTCGGTCAG GATAACGTGT TGGGATTGCC GGACACGGGC GGTCAGGTGG TATACATCCT CGATCAGGTG CGGGCTCTGG AACGGGAAAT GAGCGAGCGC CTGATATTGC AGGGAATAGA TGCGGCGCCA AAAATCCTGA TTGGCACGCG CCTCATTCCT GACGCGGGCG ATACACTTTG CCACCAGCCC CTGGAAAAAA TCCACGGTAC CCAGAATTCA TGGATCGTGC GAGTGCCATT CCGAAAAGGG AGTGGTGAAA TCGTTCGCCA TTGGATTTCC CGGTTCGAGA TATGGCCATA CCTGGAAAAT TTTGCGCATG ACATTGAACG CGAAGCTCTG GCCCAGCTCA GCGGTAGGCC CGACCTGATC ATAGGCAACT ATTCCGATGG AAATCTTGTC GCCTCGCTGA TTTCCAAACG GATTGGTGTA ACCCAGTGCA ATATTGCGCA TGCACTTGAG CAGAGCAAGT ACCTGCACTC GGCGTTGTAT TGGCGGGAGA ACGAAGCCCA GTATCACTTC AACTGCCAAT ATACCGCCGA TCTTATTGCG ATGAACAGCG CTGATTTCAT CATCACCAGC ACCTTTCAGG AAATTGCCGG TACCGAGCAG ACGGTAGGCC AATATGAAAC ATACCAGAAC TATACGATGC CCGGCTTATA CCGGGTGGTG AACGGCATTG ACCTTTTCGA TCCCAAGTTC AATATCGTTT CGCCGGGTGC GGATGCAGAA GTTTATTTTT CTTATCTCGA TCACGAGCGG CGTCTGGACG CCCTGATTCC GGACATCGAG CGTCTCTTGT ACGGGGATGA TCCGGGCGTG CCCTGCCGGG GCTACTTCGC GGATCCTGCG AAACCTTTGA TTTTTACAAT GGCTCGCCTG GACACGGTGA AGAATCTTAC CGGCCTCGCC GCGTGGTTCG GGCAGTGTGA GGCCTTATCG ACTGCCGCCA ACCTTCTGGT AATCGGCGGG CATATCGATC CAGCAGCCTC CTGTGATGGG GAGGAGCGTG CCGAGATCGA GCACATGCAT GCCCTCATGA ACGAGTACAA ACTGGAGGGG CGCATGCGCT GGCTTGGCAC CCGGCTGGAA AAGAATCTTG CCGGTGAGCT GTACCGGCAC GTGGCGGACC GTCGCGGCAT TTTTGTTCAG CCGGCGCGAT TCGAGGCATT CGGGTTGACC ATTATCGAGG CCATGGCCTC CGGTCTGCCT GTATTCGCCA CCTGCTATGG CGGCCCGCGC GAAATCATTC AACATGGGGT TTCGGGCTAT CATTTCGATC CCAACGATGG ATTGGCGGGA GCTTCCGCCA TGGCGGATTT TTTTGAGCGG GTGGCGGCCG ATCCAGGTTT TTGGGACAGG ATTTCGCAGA AAGCCTTGCA AAGGGTCGAA GCGCGCTATA CCTGGCGACT CTATGCCGAG AGAATGATGA CGCTATCACG TATTTACGGT TTCTGGAAGT TCGTCAGTAA ACTGGAGCAT GAAGAAACCG CGCGTTACCT CAACATGTTC TATCACTTGC AGTTTCGGCC GATGGCACAG GCGCTTCCCC AATAA
|
Protein sequence | MLHKFEAWAA DHRGDMYTLL RRWFELERPL LLHSDLGAVF NALSAEQASL LADSQVREIV NTLQEAVCRP PIVYMAAREE AGCWWYARLH LDRLIPEAVT VSEYLAFKEL LVNPEGANEP VLEIDFAPFN RGSPKLKEIR SIGQGVIFLN KQLAGGLFGQ LGLGSDKLLH FLTVHSMDGK QLMLGGNFAD VPALRSGLRR ALSMLEKYPD DTEWKDVAEP LGGIGFAPGW GNCVGRVSET MSLLVDILEA PSPQILESFL ARIPMISKLL ILSPHGYFGQ DNVLGLPDTG GQVVYILDQV RALEREMSER LILQGIDAAP KILIGTRLIP DAGDTLCHQP LEKIHGTQNS WIVRVPFRKG SGEIVRHWIS RFEIWPYLEN FAHDIEREAL AQLSGRPDLI IGNYSDGNLV ASLISKRIGV TQCNIAHALE QSKYLHSALY WRENEAQYHF NCQYTADLIA MNSADFIITS TFQEIAGTEQ TVGQYETYQN YTMPGLYRVV NGIDLFDPKF NIVSPGADAE VYFSYLDHER RLDALIPDIE RLLYGDDPGV PCRGYFADPA KPLIFTMARL DTVKNLTGLA AWFGQCEALS TAANLLVIGG HIDPAASCDG EERAEIEHMH ALMNEYKLEG RMRWLGTRLE KNLAGELYRH VADRRGIFVQ PARFEAFGLT IIEAMASGLP VFATCYGGPR EIIQHGVSGY HFDPNDGLAG ASAMADFFER VAADPGFWDR ISQKALQRVE ARYTWRLYAE RMMTLSRIYG FWKFVSKLEH EETARYLNMF YHLQFRPMAQ ALPQ
|
| |