Gene Nmul_A2691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2691 
Symbol 
ID3785053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3091269 
End bp3094082 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content57% 
IMG OID637812781 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_413370 
Protein GI82703804 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCGC TCGCTTCCCC AAGCGAAAGT ATTGATACTA ACCCTAATTA CATCCATGAC 
AAGGAACCGA AGGATGATCC TCTGCGCGAG GACATCCGCC TGCTCGGACG CATGCTGGGC
GACACCCTGC GTGAACAGGA GGGCGAGCCT ACTTTTGATC TGGTAGAAAA TATTCGCCAG
ACAGCTATCC GATTTCGCCG CGACCAGGAT CCGAAGGCGC GACAGGAACT GGACAGACTG
CTTAATCAGT TGAGCAACAA GGCTACCGAA GCAGTTGTTC GCGCATTCAG CCAGTTCTCG
CAGCTATCGA ACATCGCCGA GGATATGCAT CACAACCGCC GGCGTCGAAG CTACCTTTTA
GCCCGGTCGC AACCACAGGC GGGCAGTGTC GCGCGCGCGC TCGACCTGGT TTTCTCCAAG
GAAACAAGCA GCGCGGCCCT TGGCCGGTTC TTTGACAAAG CGCTCGTTTC GCCGGTGTTG
ACAGCCCATC CGACGGAAGT GCAGCGAAGA AGCATACTTG ACTGCCAGTT GGCAATTGCT
CGCTTGTTGA ACGAGCGCGA CCGGGTGCAG CTCACCCCGG ATGAGCTAAG CAAGAACGAA
GAGGGATTGC GCACCAACAT TCAAATATTA TGGCAGACGC GCATGCTGCG CTCGGCGCGT
CTGTCCGTAT ATGACGAAAT CAAAAATGGC CTGGCCTATT ATCACTATAC TTTTCTTACA
GAAGTGCCGC ATCTGTATGC GGAAATCGAA GATCTGCTTG AACGCCGGAT GGGCGATAAA
GCTCCCCGTA TTCCTCCGTT TCTTCGTATC GGCAGCTGGA TAGGCGGCGA CCGCGACGGC
AATCCCTTCG TTACTCATGA GGTATTACTG CACGCCGCCG AGCGGCAATC CGCCCTGGCA
CTGGATTTTT ACATGGGCGA GGTCCACCGG ATCGGCCGCC GGTTAAGCCT GACTGATCGC
CTGGTCGACG TGGACGAAGC CTTGGCGGCG TTGGCCGAGG CTTCGCCCGA TCGGGCGCCG
AGTCGCGCTG ATGAGCCTTA TCGCCGCGCC CTGATCGGTA TTTACGCGCG CCTCGCTGCC
ACCAGCATGG GGCTCGGTCA CGCCATCAGG CAACGGCGCC CGGTCGGTCC GGCGGAGCCT
TACACTGACA GCCTGGAACT CGTGCGCGAT CTCGATATCG TCATCCATTC GCTCGAACAA
CATAAGTCGG AGTTACTCGC GCGGGGCGAT CTACGTCGCG TGCGGCGGGC GGCCGAAGTA
TTCGGTTTTC ATCTCGCGCC GTTGGACATG CGCCAGCACA GCCACATTCA TGAGCAGGTC
GTGGCCGAAC TGTTCGAACG CGGCGCCAAC CTGAAAGGCT ATTCAGATCT GCCGGAAGCA
GAACGTGTGC GCTGTCTCCT GACCGAAGTC AGCAGTCCGC GTCTTCTGCG CTCGCCTTAC
CTCGACTATT CCGAGCTTGC CCAGAGTGAA TTGCATATCG TGGAGACGGC GGCCGAAATC
CACCGACGTT TTGGTCCGGC GGCATTGCCC AATTACGTCA TTTCCAAAGC TGATGGGATA
TCCGACATCC TGGAAGTGGC GCTGCTGCTG AAAGAAGTAG GACTGCTGCG GGCAGGAGAA
AAACCCTGCC TGCATATCAA CATTGTTCCA TTGTTCGAAA CCATTGCCGA CCTGCGTGGA
TGTGCCCGCA TCATGGATGA ATTGTTCTCG ATCCCTTATT ACCGCAAGCT GGTGGATTCG
CGCAATGACG TTCAGGAAGT GATGCTCGGC TACTCCGATA GCAACAAGGA TGGCGGATTC
CTCGCAGCCA ACTGGGAACT TTACAAGGCC GAAACCGAAC TCACGAAGGT CTTTGCCAAG
CATAAAGTCG AGCTGCGGCT GTTCCATGGA CGCGGCGGCA CGGTCGGACG CGGCGGCGGA
CCCAGCTATC AGGCAATACT GGCGCAGCCG CCGGGGAGCG TCAACGGGCA GATACGCATC
ACCGAGCAGG GCGAGGTCAT CGGCAGCAAA TATTCCGATC CGGAGATCGG ACGGCGTAAT
CTGGAAACGC TGGTCGCAGC AACCATCGAG GCGACGCTGC TGAGTCATGA TACGCTTGGC
CAATGCGCGG ATGAATATTA TGGAGTCATG GAAGTACTGG CAGGCGATGC CCTGCGGGCT
TATCGCAGCC TGGTATACGA GACCCCCGGA TTTAACCGGT ATTTTCAGGA ATCGACTCCG
ATAAAGGAAA TCGCGGGACT CAATATCGGC AGCCGTCCCC CTTCCCGCAA AAAATCCGAC
CTGATAGAGG ATCTGCGTGC CATTCCATGG ACGTTCAGCT GGGGTGTCAA CCGCGCGATG
ATTACCGGCT GGTATGGTTT CGGCACCGCA GTGGAAATGT TCGTGCAGCG CGAAGGCAAG
GGCGACAACG GACTGGGACT CCTGCAGAAA ATGTATCAAG CCTGGCCGTT CCTGCAGACA
TTGCTGTCGA ACATGGACAT GGTGCTCGCC AAAACTGACA TGGGTATTGC TTCGCGCTAC
GCTGAACTTG TTACGGATGT GGAACTGCGG CGCGAAGTTT TCGGGCGGAT ACAAAAAGAA
TGGGAACTTA GCGTGAAATG GCTTTTTGCC GTGACCGGTC GAACTGAACT ATTGCAAGAT
AACCCCACGC TGGCGCGCAG CATTCGCAAT CGTACCCCTT ATATCGATCC ACTCAATCAT
TTGCAGGTGG AACTGTTGCG CCGGTACCGG TCCGGGGACG CTACAGATGC GGTAACACGC
GCCATCCAGC TTACTATCAA CGGAGTTGCC GCCGGATTGA GGAACAGCGG GTAA
 
Protein sequence
MASLASPSES IDTNPNYIHD KEPKDDPLRE DIRLLGRMLG DTLREQEGEP TFDLVENIRQ 
TAIRFRRDQD PKARQELDRL LNQLSNKATE AVVRAFSQFS QLSNIAEDMH HNRRRRSYLL
ARSQPQAGSV ARALDLVFSK ETSSAALGRF FDKALVSPVL TAHPTEVQRR SILDCQLAIA
RLLNERDRVQ LTPDELSKNE EGLRTNIQIL WQTRMLRSAR LSVYDEIKNG LAYYHYTFLT
EVPHLYAEIE DLLERRMGDK APRIPPFLRI GSWIGGDRDG NPFVTHEVLL HAAERQSALA
LDFYMGEVHR IGRRLSLTDR LVDVDEALAA LAEASPDRAP SRADEPYRRA LIGIYARLAA
TSMGLGHAIR QRRPVGPAEP YTDSLELVRD LDIVIHSLEQ HKSELLARGD LRRVRRAAEV
FGFHLAPLDM RQHSHIHEQV VAELFERGAN LKGYSDLPEA ERVRCLLTEV SSPRLLRSPY
LDYSELAQSE LHIVETAAEI HRRFGPAALP NYVISKADGI SDILEVALLL KEVGLLRAGE
KPCLHINIVP LFETIADLRG CARIMDELFS IPYYRKLVDS RNDVQEVMLG YSDSNKDGGF
LAANWELYKA ETELTKVFAK HKVELRLFHG RGGTVGRGGG PSYQAILAQP PGSVNGQIRI
TEQGEVIGSK YSDPEIGRRN LETLVAATIE ATLLSHDTLG QCADEYYGVM EVLAGDALRA
YRSLVYETPG FNRYFQESTP IKEIAGLNIG SRPPSRKKSD LIEDLRAIPW TFSWGVNRAM
ITGWYGFGTA VEMFVQREGK GDNGLGLLQK MYQAWPFLQT LLSNMDMVLA KTDMGIASRY
AELVTDVELR REVFGRIQKE WELSVKWLFA VTGRTELLQD NPTLARSIRN RTPYIDPLNH
LQVELLRRYR SGDATDAVTR AIQLTINGVA AGLRNSG