Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1996 |
Symbol | |
ID | 4445475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2250584 |
End bp | 2252452 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639689805 |
Product | hypothetical protein |
Protein accession | YP_831477 |
Protein GI | 116670544 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0906204 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCTGA AGTCCTTCCC CAAACCCTCC GAGCTTGCGG TCCCCGCCGG CGCCGAGGGC TGGGAAAAGA TCTACCCCTA TTACCTGGTC TTCCAGGACA AGCTCAAGGA ACAGGAAGAC GCCAAGTTCT GGTTCTGCGA CAGCCAGCAC TGGCCCACCG TCTTCAAGCC CTTCGAGACC ATCGGCGGAG AATTCGCTGT CAAGTGCCTG GGCCAGTACA ACGCCCGGCA CCTGATGATC CCCAACGCCA ACGGCATCGA ATTCCGGGTG CACCTGGGAT ACCTCTACAT GTCGCCCATT CCGGTGCCGG AGGACCAGAT CGCCGCCCGC GTGCCCCTGT TCGAACAGCG CGTGGGGCAC TACTTCCAGA ACTGGGACAA GCTCCTCAAG CAGTGGCACG TCAAGGTCAA GGGCACCATC GACGAGATGG AAACCATCTC CTTCCCCGGA CTCCCGGACA TGGTCCCGAT GGAGGACATC CTCTCCGGAA AGGGCAAGGA CGGCTCGGAG AAGCTGCTGG AGAGCTACGA CCGGCTGATC CAGCTGGCCT ATCAGAACTG GCAGTACCAC TTCGAGTTCC TCAACCTCGG CTACATCGCT TACCTGGACT TCTTCAACTT CTGCAAGCAG GTCTTTCCCA ACATCCCGGA CCAGTCCATC GCCACCATGG TGCAGGGTGT GGACATGGAA CTCTTCCGCC CGGACGACGA GCTCAAGCAG CTCGCCAAAC TCGCCGTCGA ACTGGGGCTG CAGCCGCATT TCAGCAACAC GGACGACGTC GACGCCACCT TGCGTGCCAT CGCGGCGACC CCCGGCGGGG ACCGCTGGAC GGCCCAGTAC GAGGCTGCCA AGGACCCGTG GTTCAACTTC ACCGTGGGCA ACGGCTTCTA CGGCCACGAC AAGTATTGGA ACGAACACCA GGAAATCCCG CTCGGCTACA TCGCGGACTA CATCCGCCGC GTGGACGAGG GCCAGGAGAT CATGCGCCCG ATCGAGGCCC TGATCATCGA GCGTGACCGC ATCATCGAGG AATACCGGGA TCTGCTGGAA GGCGAAAACC AGGCGCTCTT CGACGCCAAG CGCGGGCTTG CCGCCACCGC CTACCCGTAC GTGGAGAACC ACAACTTCTA CATCGAGCAC TGGACCATGG GCGTCTTCTG GCGCAAGATT CGCGAACTCA GCCGCATGAT GCAGGCCGAG GGCTTCTGGA CCGAACCGGA CGACCTGCTC TACCTGGGCC GCAACGAGGT CCGCGACGCG CTCTTCGACC TGGTCACCGG CTGGGGTGTC GGCGCCAAAC CGATCGGCCC GGACTACTGG CCGGAGGAGA TCGAACGCCG CCGCGGGATC GTGGACGCGC TCAAGACCGC CCGGCCCGCC CCCGCCCTGA ACACCCCGCC GGAAATCATC ACCGAACCCT TCACCCGGAT GCTCTGGGGC ATCACCACGG AACAGGTCCA GCAGTGGCTG GGCGCAGGCG AGGCCGTCGA AGGCGGCGGC CTGCGCGGCA TGGCCGCCTC GCCCGGCGTC GTGGAAGGCC TGGCCCGCGT GGTCACCGAT GCGGACCAGC TCTCCGAGGT GCAACAGGGC GAGATCCTGG TCGCCACGGT CACCGCGCCG TCCTGGGGCC CGATCTTCGG CAAGATCAAG GCCACGGTCA CGGACATTGG CGGCATGATG AGCCACGCGG CCATCGTGTG CCGCGAGTAC GGCCTCCCGG CCGTGACCGG CACCGGCTCG GCGTCCACCA CCATCAAGAC CGGCCAGCGG CTGCGGGTGG ACGGCACCAA GGGCACGGTC CAGATCCTCG ACGCCGAAGA CGAACTGGTC GTCGCAGGAC CGGGCGCGCA CAGTCACAGC CATGTCTGA
|
Protein sequence | MSLKSFPKPS ELAVPAGAEG WEKIYPYYLV FQDKLKEQED AKFWFCDSQH WPTVFKPFET IGGEFAVKCL GQYNARHLMI PNANGIEFRV HLGYLYMSPI PVPEDQIAAR VPLFEQRVGH YFQNWDKLLK QWHVKVKGTI DEMETISFPG LPDMVPMEDI LSGKGKDGSE KLLESYDRLI QLAYQNWQYH FEFLNLGYIA YLDFFNFCKQ VFPNIPDQSI ATMVQGVDME LFRPDDELKQ LAKLAVELGL QPHFSNTDDV DATLRAIAAT PGGDRWTAQY EAAKDPWFNF TVGNGFYGHD KYWNEHQEIP LGYIADYIRR VDEGQEIMRP IEALIIERDR IIEEYRDLLE GENQALFDAK RGLAATAYPY VENHNFYIEH WTMGVFWRKI RELSRMMQAE GFWTEPDDLL YLGRNEVRDA LFDLVTGWGV GAKPIGPDYW PEEIERRRGI VDALKTARPA PALNTPPEII TEPFTRMLWG ITTEQVQQWL GAGEAVEGGG LRGMAASPGV VEGLARVVTD ADQLSEVQQG EILVATVTAP SWGPIFGKIK ATVTDIGGMM SHAAIVCREY GLPAVTGTGS ASTTIKTGQR LRVDGTKGTV QILDAEDELV VAGPGAHSHS HV
|
| |