Gene Nmul_A0388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0388 
Symbol 
ID3784083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp427023 
End bp429038 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content57% 
IMG OID637810464 
Producttransketolase 
Protein accessionYP_411088 
Protein GI82701522 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAACAT TTCGAGATTT TGAAGCGCCG GTATTCAAGA ACCTGACCAG CGCCATCCGT 
GCACTGGCGA TGGATGCCGT GCAAAAAGCC AACTCGGGCC ATCCGGGAAT GCCTATGGGC
ATGGCTGAGA TCGCTGAGGT ATTGTGGATA CATCACCTAC GCCATAATCC GGCAAATCCG
AAGTGGGCCG ACCGTGACCG CTTCGTTTTG TCCAATGGGC ACGGATCCAT GCTCATTTAC
GCCCTGCTGC ACCTCACGGG CTACGATTTG CCGATGGAGG AAATCAAGCG TTTCCGTCAG
CTTCATTCCA AGACTCCCGG CCATCCGGAA TATGGTTATA CACCCGGCGT CGAGACGACC
ACAGGTCCGC TGGGCCAGGG GATCACCAAT GCTGTGGGAA TGGCTCTGGC AGAGAAAATA
CTTGCCTCGG AATTCAACCG TCCCGGCTTC GATATCGTCA ATCACCATAC CTATGTATTC
CTGGGCGATG GCTGCCTGAT GGAGGGTATT TCCCATGAGG CCTGCTCGCT TGCAGGCACA
TTGGGGCTGG GCAAGCTGAT CTGTTTTTAT GACGACAACG GCATTTCCAT CGATGGGCAT
GTGGAGGGAT GGTTCACAGA CGACACGCCC AAACGCTTCG AAGCCTATGG TTGGCATGTT
GTGCCGAATG TCAACGGACA TGATCCGGTA GCGATAGAGG CCGCCATCGA AGCTGCCAAG
CAGGCCGTGG ACAAACCTTC CATGATCTGC TGCAAGACCG TGATCGGAAT GGGCTCGCCC
AACAAGGCAA ATACTCACGA GGTGCACGGC GCGGCGCTGG GGGACTTGGA AATAGCCGCT
GCGCGCCCGC ATATCGGGTG GAATCACCTG CCATTCGAGA TCCCCGAGGA TGTCTACCAG
AACTGGGATG CGCGCGCAAA AGGACAAAAG CTGGAAGACG GCTGGAATCG CAAGTTCGCA
GAGTACGCTG CGAAATATCC GACTGAAGCG GCCGAATTCA GTCGGCGGAT GGCAGGTGAA
CTGCCGGAGG GGTGGCAGGA GCACGTGGAT GGCCTGGTTG CACGTGTTCA TGCAAAGGAA
GAAACCATTG CAAGCCGCAA GGCATCGCAG AATGCGATTG AAGGACTGGC ACCCAAGTTG
CCGGAACTGG TCGGCGGCTC GGCAGATCTG GCCGGATCGA ACCTTACCCT CTGGTCGGGT
TCAAAAGGCA TCGCCCGGCA GGATGGCGGC AACTACGTAT ATTACGGCGT GCGCGAATTC
GGCATGAGCG CCATCATGAA CGGGCTGGCG CTGCATGGCG GAATCATTCC TTACGGCGCC
ACTTTCCTCA TGTTCTCAGA ATATGCGCGG AATGCGCTTC GCATGGCCGC CCTGATGAAA
ATACGCTGCC TGTTCGTATT CACCCATGAT TCCATCGGTT TGGGCGAAGA TGGTCCTACC
CACCAGCCGG TGGAACAGAC CGCCACATTG CGCTACATCC CCAACATGGA TGTGTGGCGT
CCGTGCGATA CGGTCGAGTC GACCGTCGCC TGGGCACGGG CAATCGAGCG CAAGGATGGC
CCCTCCACAC TGATTTTCAG CCGCCAGAAC CTTCCCTTTC AGAAACGCGA AGGGAATACG
ATCAAGCTGA TCGATAAGGG CGGCTATATC CTGTCGGAAG CCTCCGACAA TCAACCGCGG
GCAGTCATCA TTGCCACAGG TTCGGAAGTC GGCCTGGCGA TGATGGCGCA AAAAGCGCTG
GCCGAAACGG GAATTCATGT GCGCGTCGTT TCGATGCCCT GCACGAACGT ATTCGATCGC
CAGGATGTCG ATTATAAAAG CAGCGTGCTC CCCAAGGGTA TAGGGCGCGT GGCGGTGGAA
GCAGGCGTGA CGGATTACTG GCGCAAGTAT GTAGGCCTGG AGGGAGCAGT GGTCGGTATC
GATACCTTCG GCGAGTCCGC GCCGGCTGGA GAGCTGTTCA AGCACTTCGG CATCACCGTA
GAGAATGTGA TAAAGGCGGT AAACAGCGTC ATTTAA
 
Protein sequence
MGTFRDFEAP VFKNLTSAIR ALAMDAVQKA NSGHPGMPMG MAEIAEVLWI HHLRHNPANP 
KWADRDRFVL SNGHGSMLIY ALLHLTGYDL PMEEIKRFRQ LHSKTPGHPE YGYTPGVETT
TGPLGQGITN AVGMALAEKI LASEFNRPGF DIVNHHTYVF LGDGCLMEGI SHEACSLAGT
LGLGKLICFY DDNGISIDGH VEGWFTDDTP KRFEAYGWHV VPNVNGHDPV AIEAAIEAAK
QAVDKPSMIC CKTVIGMGSP NKANTHEVHG AALGDLEIAA ARPHIGWNHL PFEIPEDVYQ
NWDARAKGQK LEDGWNRKFA EYAAKYPTEA AEFSRRMAGE LPEGWQEHVD GLVARVHAKE
ETIASRKASQ NAIEGLAPKL PELVGGSADL AGSNLTLWSG SKGIARQDGG NYVYYGVREF
GMSAIMNGLA LHGGIIPYGA TFLMFSEYAR NALRMAALMK IRCLFVFTHD SIGLGEDGPT
HQPVEQTATL RYIPNMDVWR PCDTVESTVA WARAIERKDG PSTLIFSRQN LPFQKREGNT
IKLIDKGGYI LSEASDNQPR AVIIATGSEV GLAMMAQKAL AETGIHVRVV SMPCTNVFDR
QDVDYKSSVL PKGIGRVAVE AGVTDYWRKY VGLEGAVVGI DTFGESAPAG ELFKHFGITV
ENVIKAVNSV I