Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0388 |
Symbol | |
ID | 3784083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 427023 |
End bp | 429038 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810464 |
Product | transketolase |
Protein accession | YP_411088 |
Protein GI | 82701522 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAACAT TTCGAGATTT TGAAGCGCCG GTATTCAAGA ACCTGACCAG CGCCATCCGT GCACTGGCGA TGGATGCCGT GCAAAAAGCC AACTCGGGCC ATCCGGGAAT GCCTATGGGC ATGGCTGAGA TCGCTGAGGT ATTGTGGATA CATCACCTAC GCCATAATCC GGCAAATCCG AAGTGGGCCG ACCGTGACCG CTTCGTTTTG TCCAATGGGC ACGGATCCAT GCTCATTTAC GCCCTGCTGC ACCTCACGGG CTACGATTTG CCGATGGAGG AAATCAAGCG TTTCCGTCAG CTTCATTCCA AGACTCCCGG CCATCCGGAA TATGGTTATA CACCCGGCGT CGAGACGACC ACAGGTCCGC TGGGCCAGGG GATCACCAAT GCTGTGGGAA TGGCTCTGGC AGAGAAAATA CTTGCCTCGG AATTCAACCG TCCCGGCTTC GATATCGTCA ATCACCATAC CTATGTATTC CTGGGCGATG GCTGCCTGAT GGAGGGTATT TCCCATGAGG CCTGCTCGCT TGCAGGCACA TTGGGGCTGG GCAAGCTGAT CTGTTTTTAT GACGACAACG GCATTTCCAT CGATGGGCAT GTGGAGGGAT GGTTCACAGA CGACACGCCC AAACGCTTCG AAGCCTATGG TTGGCATGTT GTGCCGAATG TCAACGGACA TGATCCGGTA GCGATAGAGG CCGCCATCGA AGCTGCCAAG CAGGCCGTGG ACAAACCTTC CATGATCTGC TGCAAGACCG TGATCGGAAT GGGCTCGCCC AACAAGGCAA ATACTCACGA GGTGCACGGC GCGGCGCTGG GGGACTTGGA AATAGCCGCT GCGCGCCCGC ATATCGGGTG GAATCACCTG CCATTCGAGA TCCCCGAGGA TGTCTACCAG AACTGGGATG CGCGCGCAAA AGGACAAAAG CTGGAAGACG GCTGGAATCG CAAGTTCGCA GAGTACGCTG CGAAATATCC GACTGAAGCG GCCGAATTCA GTCGGCGGAT GGCAGGTGAA CTGCCGGAGG GGTGGCAGGA GCACGTGGAT GGCCTGGTTG CACGTGTTCA TGCAAAGGAA GAAACCATTG CAAGCCGCAA GGCATCGCAG AATGCGATTG AAGGACTGGC ACCCAAGTTG CCGGAACTGG TCGGCGGCTC GGCAGATCTG GCCGGATCGA ACCTTACCCT CTGGTCGGGT TCAAAAGGCA TCGCCCGGCA GGATGGCGGC AACTACGTAT ATTACGGCGT GCGCGAATTC GGCATGAGCG CCATCATGAA CGGGCTGGCG CTGCATGGCG GAATCATTCC TTACGGCGCC ACTTTCCTCA TGTTCTCAGA ATATGCGCGG AATGCGCTTC GCATGGCCGC CCTGATGAAA ATACGCTGCC TGTTCGTATT CACCCATGAT TCCATCGGTT TGGGCGAAGA TGGTCCTACC CACCAGCCGG TGGAACAGAC CGCCACATTG CGCTACATCC CCAACATGGA TGTGTGGCGT CCGTGCGATA CGGTCGAGTC GACCGTCGCC TGGGCACGGG CAATCGAGCG CAAGGATGGC CCCTCCACAC TGATTTTCAG CCGCCAGAAC CTTCCCTTTC AGAAACGCGA AGGGAATACG ATCAAGCTGA TCGATAAGGG CGGCTATATC CTGTCGGAAG CCTCCGACAA TCAACCGCGG GCAGTCATCA TTGCCACAGG TTCGGAAGTC GGCCTGGCGA TGATGGCGCA AAAAGCGCTG GCCGAAACGG GAATTCATGT GCGCGTCGTT TCGATGCCCT GCACGAACGT ATTCGATCGC CAGGATGTCG ATTATAAAAG CAGCGTGCTC CCCAAGGGTA TAGGGCGCGT GGCGGTGGAA GCAGGCGTGA CGGATTACTG GCGCAAGTAT GTAGGCCTGG AGGGAGCAGT GGTCGGTATC GATACCTTCG GCGAGTCCGC GCCGGCTGGA GAGCTGTTCA AGCACTTCGG CATCACCGTA GAGAATGTGA TAAAGGCGGT AAACAGCGTC ATTTAA
|
Protein sequence | MGTFRDFEAP VFKNLTSAIR ALAMDAVQKA NSGHPGMPMG MAEIAEVLWI HHLRHNPANP KWADRDRFVL SNGHGSMLIY ALLHLTGYDL PMEEIKRFRQ LHSKTPGHPE YGYTPGVETT TGPLGQGITN AVGMALAEKI LASEFNRPGF DIVNHHTYVF LGDGCLMEGI SHEACSLAGT LGLGKLICFY DDNGISIDGH VEGWFTDDTP KRFEAYGWHV VPNVNGHDPV AIEAAIEAAK QAVDKPSMIC CKTVIGMGSP NKANTHEVHG AALGDLEIAA ARPHIGWNHL PFEIPEDVYQ NWDARAKGQK LEDGWNRKFA EYAAKYPTEA AEFSRRMAGE LPEGWQEHVD GLVARVHAKE ETIASRKASQ NAIEGLAPKL PELVGGSADL AGSNLTLWSG SKGIARQDGG NYVYYGVREF GMSAIMNGLA LHGGIIPYGA TFLMFSEYAR NALRMAALMK IRCLFVFTHD SIGLGEDGPT HQPVEQTATL RYIPNMDVWR PCDTVESTVA WARAIERKDG PSTLIFSRQN LPFQKREGNT IKLIDKGGYI LSEASDNQPR AVIIATGSEV GLAMMAQKAL AETGIHVRVV SMPCTNVFDR QDVDYKSSVL PKGIGRVAVE AGVTDYWRKY VGLEGAVVGI DTFGESAPAG ELFKHFGITV ENVIKAVNSV I
|
| |