Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0614 |
Symbol | |
ID | 4446895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 659594 |
End bp | 662458 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639688412 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_830113 |
Protein GI | 116669180 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGACA CTGCAGTAAA TCCAGGCGCT CATCTCGCGG CAGGCACTGA TCCTGCCGGT TCCGATTCCA GCGCTGATCT CGCGTCCGAA CTCCGGGCGG ATGTCCGGCG CGTCTCCACG CTGCTGGGCG AATCGCTGGT CCGCCAGCAC GGCCCGGAGC TCCTCGACCT CGTCGAGCAG GTCCGCCTGC TGACCAAGGA GTCCAAAGAG GCGGCCCGCG GAGGCGCGGA CGCCACCGGC CCTTGGAGCT CGCACGACGT CGTCGCCCAG GTCCGCGAAC TGCTCGGCTC CCTGCCCATT GACCAGGCCA CCGACCTGGT CCGTGCTTTT GCGTTCTACT TCCACCTGGC CAACGCCGCC GAACAGGTCC ACCGGGTCCG GGGCCTGCGG ACCCGGCAGG AGAAGGACGG CTGGCTGGCC AAGGCCGTGG AGGAGATCTC CGGCCAGGCC GGACCGGCAG TCCTCCAGGA AGTGGTCAAC GAGCTGGATG TCCGCCCCAT CTTCACAGCC CACCCCACAG AAGCCTCCCG CCGCTCCGTC CTGGACAAGA TCCGCCGGCT TTCCGACGTC CTGGCAGTTT CCACGGAGGA AGGCACCTCG GCCCGCCGCC GCCAGGACCG CCAGCTCGCC GAGGTGATCG ACCAGATGTG GCAGACGGAC GAGCTCCGCC AGGTCAGGCC CACCCCCGTG GACGAGGCCC GCAACGCCAT CTATTACCTG AACAGCATCC TCACCGATGC CATGCCTGAA ATGCTTTCGG ACCTCTCCGA GCTCTTGGGT GAGCACGGCG TGACGCTGCC GTCGCAGAAA GCCCCCATCC GCTTCGGCTC GTGGATCGGC GGCGACCGAG ACGGCAACCC GAACGTCACT GCCGGCGTCA CCCGCGAGAT CCTGCAGATC CAGAACCAGC ACGCCGTCCG GATCAGCATT GCCATGATCG ACGAGCTCAT TGCCATCCTG TCCAACTCCA CCGCGCTGGC CGGGGCGGAC CGGGAGCTGC TGGACTCGAT CGATACGGAC CTCAAGAACC TGCCGGGCCT GGACCGCCGG GTCCTCGAAC TCAATGCCCA GGAGCCCTAC CGGCTCAAGC TCACCTGCAT CAAGGCCAAG CTCCTCAATA CCGCCAAGCG CGTGGCGGCC GGCTCCTCCC ATGAGCCTGG CCGGGACTAC ACGTCAACGG CCGGCGTCCT GGCCGAGCTG GGGCTGCTGG AGGAATCGCT GCGCAACCAC CACGCCGGGC TTGCCGCCGA CGGTGCGCTG GCACGGGTGC GCCGCGCCAT CGCCTCCTTC GGACTGCACC TCGCCACCCT GGACATCCGC GAGCACGCCG ACCACCACCA TGACGCCGTC GGGCAACTCA TGGACCGCCT GGGCGGTCCC GGCGTGCGGT ACTCCGAACT CAGCCGTGCC GAGCGCCTCG AGGTGCTGGG AGCGGAACTG GGATCCCGCC GTCCGCTGTC CGGCCACCCG ATAAAGCTCG ACGGCGTGGC TGACGGCACG TACGACGTCT TCCGTGAGAT CCGCCGGGCC CTTAAGACCT ATGGCCCGGA CGTCATCGAG ACATACATCA TTTCCATGAC CCGCGGCGCC GACGACGTCC TTGCCGCAGC TGTGCTGGCC CGCGAAGCGG GGCTCGTCAG CCTCTTCGGT GACGCGCCCT ACGCCAAGAT CGGTTTTGCT CCGCTGCTGG AAACGGTGGA GGAGCTGCGG GCTTCAGCGG AGATCGTGGA CCAGCTGCTG TCCGATCCGT CCTACCGGGA ACTCGTCCGG CTGCGCGGCG ACGTACAGGA AATCATGCTT GGCTACTCGG ACTCCAACAA GGAATCCGGC GTGATGACCA GCCAGTGGGA GATCCACAAG ACCCAGCGGA AGCTGCGCGA TGTCGCGGCC AAGCACGGCG TCCGGGTGCG CCTCTTCCAC GGCCGCGGCG GATCCGTGGG CCGCGGCGGC GGGCCCACCT ATGACGCGAT CATGGCGCAG CCCAACGGCG TCCTCGAAGG TGAAATCAAG TTCACCGAGC AGGGTGAAGT CATCTCGGAC AAGTACTCGC TGCCTGAGCT GGCCCGCGAG AACCTGGAGC TTTCCCTGGC CGCCGTGATG CAGGGCTCAG CGCTGCACCG CACCCCGCGC ACGTCCGAGG ACCAGCGCGA GCGCTACGGC CACGTCATGG AAACCATTTC CGATGCCGCC TTCGCACGCT ACCGCTCCCT CATCGACGAT CCGGACCTGC CGGCCTACTT CATGGCCTCC ACTCCCGTGG AACAACTGGG CTCCCTCAAC ATCGGCTCAC GTCCGTCCAA GCGCCCGGAC TCCGGCGCCG GTCTGGGCGG GCTGCGCGCC ATCCCGTGGG TCTTCGGCTG GACCCAGTCG CGGCAGATCG TGCCGGGCTG GTTCGGCGTG GGCTCCGGGT TGAAGGCTGC CCGCGAAGCC GGCAACACGG AACAGCTCAC GGAAATGATG GACCGCTGGC ACTTCTTCCG CTCGGTCCTG TCCAACGTGG AAATGACGCT GGCCAAGACG GATATGGACA TTGCCGGCTA CTACGTCTCG ACGCTGGTAC CTGAAGAGCT GCACCATTTG TTCCGGGCCA TCCGGGAGGA ATACGACCTC ACAGTCGCCG AGATCCAGAA GCTCACCGGC GAGCACGTGC TCCTGGACGC GCAGCCCACC CTCAAGCGCT CCCTGGAGAT CCGCGACCAG TACCTCGATC CCATCAGCTA CCTCCAGGTG GAGCTGATGC GCCGCATGCG GGCCGAGGGT TCCGAGGGCG AGAGCATTTC CGGAGCAGAG ATCGACGAAC GCCTCCAGCG CGCCATGCTC ATCACGGTCA ACGGAGTGGC GGCGGGCCTC CGCAACACCG GATAA
|
Protein sequence | MADTAVNPGA HLAAGTDPAG SDSSADLASE LRADVRRVST LLGESLVRQH GPELLDLVEQ VRLLTKESKE AARGGADATG PWSSHDVVAQ VRELLGSLPI DQATDLVRAF AFYFHLANAA EQVHRVRGLR TRQEKDGWLA KAVEEISGQA GPAVLQEVVN ELDVRPIFTA HPTEASRRSV LDKIRRLSDV LAVSTEEGTS ARRRQDRQLA EVIDQMWQTD ELRQVRPTPV DEARNAIYYL NSILTDAMPE MLSDLSELLG EHGVTLPSQK APIRFGSWIG GDRDGNPNVT AGVTREILQI QNQHAVRISI AMIDELIAIL SNSTALAGAD RELLDSIDTD LKNLPGLDRR VLELNAQEPY RLKLTCIKAK LLNTAKRVAA GSSHEPGRDY TSTAGVLAEL GLLEESLRNH HAGLAADGAL ARVRRAIASF GLHLATLDIR EHADHHHDAV GQLMDRLGGP GVRYSELSRA ERLEVLGAEL GSRRPLSGHP IKLDGVADGT YDVFREIRRA LKTYGPDVIE TYIISMTRGA DDVLAAAVLA REAGLVSLFG DAPYAKIGFA PLLETVEELR ASAEIVDQLL SDPSYRELVR LRGDVQEIML GYSDSNKESG VMTSQWEIHK TQRKLRDVAA KHGVRVRLFH GRGGSVGRGG GPTYDAIMAQ PNGVLEGEIK FTEQGEVISD KYSLPELARE NLELSLAAVM QGSALHRTPR TSEDQRERYG HVMETISDAA FARYRSLIDD PDLPAYFMAS TPVEQLGSLN IGSRPSKRPD SGAGLGGLRA IPWVFGWTQS RQIVPGWFGV GSGLKAAREA GNTEQLTEMM DRWHFFRSVL SNVEMTLAKT DMDIAGYYVS TLVPEELHHL FRAIREEYDL TVAEIQKLTG EHVLLDAQPT LKRSLEIRDQ YLDPISYLQV ELMRRMRAEG SEGESISGAE IDERLQRAML ITVNGVAAGL RNTG
|
| |