Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2691 |
Symbol | |
ID | 3785053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 3091269 |
End bp | 3094082 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637812781 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_413370 |
Protein GI | 82703804 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCGC TCGCTTCCCC AAGCGAAAGT ATTGATACTA ACCCTAATTA CATCCATGAC AAGGAACCGA AGGATGATCC TCTGCGCGAG GACATCCGCC TGCTCGGACG CATGCTGGGC GACACCCTGC GTGAACAGGA GGGCGAGCCT ACTTTTGATC TGGTAGAAAA TATTCGCCAG ACAGCTATCC GATTTCGCCG CGACCAGGAT CCGAAGGCGC GACAGGAACT GGACAGACTG CTTAATCAGT TGAGCAACAA GGCTACCGAA GCAGTTGTTC GCGCATTCAG CCAGTTCTCG CAGCTATCGA ACATCGCCGA GGATATGCAT CACAACCGCC GGCGTCGAAG CTACCTTTTA GCCCGGTCGC AACCACAGGC GGGCAGTGTC GCGCGCGCGC TCGACCTGGT TTTCTCCAAG GAAACAAGCA GCGCGGCCCT TGGCCGGTTC TTTGACAAAG CGCTCGTTTC GCCGGTGTTG ACAGCCCATC CGACGGAAGT GCAGCGAAGA AGCATACTTG ACTGCCAGTT GGCAATTGCT CGCTTGTTGA ACGAGCGCGA CCGGGTGCAG CTCACCCCGG ATGAGCTAAG CAAGAACGAA GAGGGATTGC GCACCAACAT TCAAATATTA TGGCAGACGC GCATGCTGCG CTCGGCGCGT CTGTCCGTAT ATGACGAAAT CAAAAATGGC CTGGCCTATT ATCACTATAC TTTTCTTACA GAAGTGCCGC ATCTGTATGC GGAAATCGAA GATCTGCTTG AACGCCGGAT GGGCGATAAA GCTCCCCGTA TTCCTCCGTT TCTTCGTATC GGCAGCTGGA TAGGCGGCGA CCGCGACGGC AATCCCTTCG TTACTCATGA GGTATTACTG CACGCCGCCG AGCGGCAATC CGCCCTGGCA CTGGATTTTT ACATGGGCGA GGTCCACCGG ATCGGCCGCC GGTTAAGCCT GACTGATCGC CTGGTCGACG TGGACGAAGC CTTGGCGGCG TTGGCCGAGG CTTCGCCCGA TCGGGCGCCG AGTCGCGCTG ATGAGCCTTA TCGCCGCGCC CTGATCGGTA TTTACGCGCG CCTCGCTGCC ACCAGCATGG GGCTCGGTCA CGCCATCAGG CAACGGCGCC CGGTCGGTCC GGCGGAGCCT TACACTGACA GCCTGGAACT CGTGCGCGAT CTCGATATCG TCATCCATTC GCTCGAACAA CATAAGTCGG AGTTACTCGC GCGGGGCGAT CTACGTCGCG TGCGGCGGGC GGCCGAAGTA TTCGGTTTTC ATCTCGCGCC GTTGGACATG CGCCAGCACA GCCACATTCA TGAGCAGGTC GTGGCCGAAC TGTTCGAACG CGGCGCCAAC CTGAAAGGCT ATTCAGATCT GCCGGAAGCA GAACGTGTGC GCTGTCTCCT GACCGAAGTC AGCAGTCCGC GTCTTCTGCG CTCGCCTTAC CTCGACTATT CCGAGCTTGC CCAGAGTGAA TTGCATATCG TGGAGACGGC GGCCGAAATC CACCGACGTT TTGGTCCGGC GGCATTGCCC AATTACGTCA TTTCCAAAGC TGATGGGATA TCCGACATCC TGGAAGTGGC GCTGCTGCTG AAAGAAGTAG GACTGCTGCG GGCAGGAGAA AAACCCTGCC TGCATATCAA CATTGTTCCA TTGTTCGAAA CCATTGCCGA CCTGCGTGGA TGTGCCCGCA TCATGGATGA ATTGTTCTCG ATCCCTTATT ACCGCAAGCT GGTGGATTCG CGCAATGACG TTCAGGAAGT GATGCTCGGC TACTCCGATA GCAACAAGGA TGGCGGATTC CTCGCAGCCA ACTGGGAACT TTACAAGGCC GAAACCGAAC TCACGAAGGT CTTTGCCAAG CATAAAGTCG AGCTGCGGCT GTTCCATGGA CGCGGCGGCA CGGTCGGACG CGGCGGCGGA CCCAGCTATC AGGCAATACT GGCGCAGCCG CCGGGGAGCG TCAACGGGCA GATACGCATC ACCGAGCAGG GCGAGGTCAT CGGCAGCAAA TATTCCGATC CGGAGATCGG ACGGCGTAAT CTGGAAACGC TGGTCGCAGC AACCATCGAG GCGACGCTGC TGAGTCATGA TACGCTTGGC CAATGCGCGG ATGAATATTA TGGAGTCATG GAAGTACTGG CAGGCGATGC CCTGCGGGCT TATCGCAGCC TGGTATACGA GACCCCCGGA TTTAACCGGT ATTTTCAGGA ATCGACTCCG ATAAAGGAAA TCGCGGGACT CAATATCGGC AGCCGTCCCC CTTCCCGCAA AAAATCCGAC CTGATAGAGG ATCTGCGTGC CATTCCATGG ACGTTCAGCT GGGGTGTCAA CCGCGCGATG ATTACCGGCT GGTATGGTTT CGGCACCGCA GTGGAAATGT TCGTGCAGCG CGAAGGCAAG GGCGACAACG GACTGGGACT CCTGCAGAAA ATGTATCAAG CCTGGCCGTT CCTGCAGACA TTGCTGTCGA ACATGGACAT GGTGCTCGCC AAAACTGACA TGGGTATTGC TTCGCGCTAC GCTGAACTTG TTACGGATGT GGAACTGCGG CGCGAAGTTT TCGGGCGGAT ACAAAAAGAA TGGGAACTTA GCGTGAAATG GCTTTTTGCC GTGACCGGTC GAACTGAACT ATTGCAAGAT AACCCCACGC TGGCGCGCAG CATTCGCAAT CGTACCCCTT ATATCGATCC ACTCAATCAT TTGCAGGTGG AACTGTTGCG CCGGTACCGG TCCGGGGACG CTACAGATGC GGTAACACGC GCCATCCAGC TTACTATCAA CGGAGTTGCC GCCGGATTGA GGAACAGCGG GTAA
|
Protein sequence | MASLASPSES IDTNPNYIHD KEPKDDPLRE DIRLLGRMLG DTLREQEGEP TFDLVENIRQ TAIRFRRDQD PKARQELDRL LNQLSNKATE AVVRAFSQFS QLSNIAEDMH HNRRRRSYLL ARSQPQAGSV ARALDLVFSK ETSSAALGRF FDKALVSPVL TAHPTEVQRR SILDCQLAIA RLLNERDRVQ LTPDELSKNE EGLRTNIQIL WQTRMLRSAR LSVYDEIKNG LAYYHYTFLT EVPHLYAEIE DLLERRMGDK APRIPPFLRI GSWIGGDRDG NPFVTHEVLL HAAERQSALA LDFYMGEVHR IGRRLSLTDR LVDVDEALAA LAEASPDRAP SRADEPYRRA LIGIYARLAA TSMGLGHAIR QRRPVGPAEP YTDSLELVRD LDIVIHSLEQ HKSELLARGD LRRVRRAAEV FGFHLAPLDM RQHSHIHEQV VAELFERGAN LKGYSDLPEA ERVRCLLTEV SSPRLLRSPY LDYSELAQSE LHIVETAAEI HRRFGPAALP NYVISKADGI SDILEVALLL KEVGLLRAGE KPCLHINIVP LFETIADLRG CARIMDELFS IPYYRKLVDS RNDVQEVMLG YSDSNKDGGF LAANWELYKA ETELTKVFAK HKVELRLFHG RGGTVGRGGG PSYQAILAQP PGSVNGQIRI TEQGEVIGSK YSDPEIGRRN LETLVAATIE ATLLSHDTLG QCADEYYGVM EVLAGDALRA YRSLVYETPG FNRYFQESTP IKEIAGLNIG SRPPSRKKSD LIEDLRAIPW TFSWGVNRAM ITGWYGFGTA VEMFVQREGK GDNGLGLLQK MYQAWPFLQT LLSNMDMVLA KTDMGIASRY AELVTDVELR REVFGRIQKE WELSVKWLFA VTGRTELLQD NPTLARSIRN RTPYIDPLNH LQVELLRRYR SGDATDAVTR AIQLTINGVA AGLRNSG
|
| |