Gene Arth_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1996 
Symbol 
ID4445475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2250584 
End bp2252452 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content65% 
IMG OID639689805 
Producthypothetical protein 
Protein accessionYP_831477 
Protein GI116670544 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0906204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTGA AGTCCTTCCC CAAACCCTCC GAGCTTGCGG TCCCCGCCGG CGCCGAGGGC 
TGGGAAAAGA TCTACCCCTA TTACCTGGTC TTCCAGGACA AGCTCAAGGA ACAGGAAGAC
GCCAAGTTCT GGTTCTGCGA CAGCCAGCAC TGGCCCACCG TCTTCAAGCC CTTCGAGACC
ATCGGCGGAG AATTCGCTGT CAAGTGCCTG GGCCAGTACA ACGCCCGGCA CCTGATGATC
CCCAACGCCA ACGGCATCGA ATTCCGGGTG CACCTGGGAT ACCTCTACAT GTCGCCCATT
CCGGTGCCGG AGGACCAGAT CGCCGCCCGC GTGCCCCTGT TCGAACAGCG CGTGGGGCAC
TACTTCCAGA ACTGGGACAA GCTCCTCAAG CAGTGGCACG TCAAGGTCAA GGGCACCATC
GACGAGATGG AAACCATCTC CTTCCCCGGA CTCCCGGACA TGGTCCCGAT GGAGGACATC
CTCTCCGGAA AGGGCAAGGA CGGCTCGGAG AAGCTGCTGG AGAGCTACGA CCGGCTGATC
CAGCTGGCCT ATCAGAACTG GCAGTACCAC TTCGAGTTCC TCAACCTCGG CTACATCGCT
TACCTGGACT TCTTCAACTT CTGCAAGCAG GTCTTTCCCA ACATCCCGGA CCAGTCCATC
GCCACCATGG TGCAGGGTGT GGACATGGAA CTCTTCCGCC CGGACGACGA GCTCAAGCAG
CTCGCCAAAC TCGCCGTCGA ACTGGGGCTG CAGCCGCATT TCAGCAACAC GGACGACGTC
GACGCCACCT TGCGTGCCAT CGCGGCGACC CCCGGCGGGG ACCGCTGGAC GGCCCAGTAC
GAGGCTGCCA AGGACCCGTG GTTCAACTTC ACCGTGGGCA ACGGCTTCTA CGGCCACGAC
AAGTATTGGA ACGAACACCA GGAAATCCCG CTCGGCTACA TCGCGGACTA CATCCGCCGC
GTGGACGAGG GCCAGGAGAT CATGCGCCCG ATCGAGGCCC TGATCATCGA GCGTGACCGC
ATCATCGAGG AATACCGGGA TCTGCTGGAA GGCGAAAACC AGGCGCTCTT CGACGCCAAG
CGCGGGCTTG CCGCCACCGC CTACCCGTAC GTGGAGAACC ACAACTTCTA CATCGAGCAC
TGGACCATGG GCGTCTTCTG GCGCAAGATT CGCGAACTCA GCCGCATGAT GCAGGCCGAG
GGCTTCTGGA CCGAACCGGA CGACCTGCTC TACCTGGGCC GCAACGAGGT CCGCGACGCG
CTCTTCGACC TGGTCACCGG CTGGGGTGTC GGCGCCAAAC CGATCGGCCC GGACTACTGG
CCGGAGGAGA TCGAACGCCG CCGCGGGATC GTGGACGCGC TCAAGACCGC CCGGCCCGCC
CCCGCCCTGA ACACCCCGCC GGAAATCATC ACCGAACCCT TCACCCGGAT GCTCTGGGGC
ATCACCACGG AACAGGTCCA GCAGTGGCTG GGCGCAGGCG AGGCCGTCGA AGGCGGCGGC
CTGCGCGGCA TGGCCGCCTC GCCCGGCGTC GTGGAAGGCC TGGCCCGCGT GGTCACCGAT
GCGGACCAGC TCTCCGAGGT GCAACAGGGC GAGATCCTGG TCGCCACGGT CACCGCGCCG
TCCTGGGGCC CGATCTTCGG CAAGATCAAG GCCACGGTCA CGGACATTGG CGGCATGATG
AGCCACGCGG CCATCGTGTG CCGCGAGTAC GGCCTCCCGG CCGTGACCGG CACCGGCTCG
GCGTCCACCA CCATCAAGAC CGGCCAGCGG CTGCGGGTGG ACGGCACCAA GGGCACGGTC
CAGATCCTCG ACGCCGAAGA CGAACTGGTC GTCGCAGGAC CGGGCGCGCA CAGTCACAGC
CATGTCTGA
 
Protein sequence
MSLKSFPKPS ELAVPAGAEG WEKIYPYYLV FQDKLKEQED AKFWFCDSQH WPTVFKPFET 
IGGEFAVKCL GQYNARHLMI PNANGIEFRV HLGYLYMSPI PVPEDQIAAR VPLFEQRVGH
YFQNWDKLLK QWHVKVKGTI DEMETISFPG LPDMVPMEDI LSGKGKDGSE KLLESYDRLI
QLAYQNWQYH FEFLNLGYIA YLDFFNFCKQ VFPNIPDQSI ATMVQGVDME LFRPDDELKQ
LAKLAVELGL QPHFSNTDDV DATLRAIAAT PGGDRWTAQY EAAKDPWFNF TVGNGFYGHD
KYWNEHQEIP LGYIADYIRR VDEGQEIMRP IEALIIERDR IIEEYRDLLE GENQALFDAK
RGLAATAYPY VENHNFYIEH WTMGVFWRKI RELSRMMQAE GFWTEPDDLL YLGRNEVRDA
LFDLVTGWGV GAKPIGPDYW PEEIERRRGI VDALKTARPA PALNTPPEII TEPFTRMLWG
ITTEQVQQWL GAGEAVEGGG LRGMAASPGV VEGLARVVTD ADQLSEVQQG EILVATVTAP
SWGPIFGKIK ATVTDIGGMM SHAAIVCREY GLPAVTGTGS ASTTIKTGQR LRVDGTKGTV
QILDAEDELV VAGPGAHSHS HV