Gene Arth_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1972 
Symbol 
ID4445486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2225118 
End bp2227535 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content66% 
IMG OID639689781 
Productphosphoenolpyruvate synthase 
Protein accessionYP_831453 
Protein GI116670520 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID[TIGR01418] phosphoenolpyruvate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAG ACATCCTGTG GTTCTCAGAA CTCGGACTCA AGGACCTGGA CCGGGTGGGC 
GGAAAAAACG CCTCCCTTGG TGAGATGGTG CAGAACCTGA CCTCTGCCGG CGTCCAGGTC
CCTGACGGCT TCGCCACGAC GGCCGACGCC TACCGCACCT TCCTGGCTGA CTCCGGCCTG
GACCAGAAGA TCGCCGACCG GCTGGTCGGG CTGGACACCG ACGACGTGAC GGCACTCGCC
GCGGCCGGCC AGGAAATCCG GACCCTGATG CGCGAGACGC CCTTCCTGCC GGACTTTGAA
GCGCAGATCC GCGCCTCGTA CCAGGAACTG GTGGACAAGC ACGGGGGCTC CGATGACCTG
TCCTGGGCCG TCCGTTCCAG CGCAACGGCG GAAGACCTTC CCGATGCCTC CTTCGCCGGG
CAGCAGGAGA CCTTCCTGAA CGTCCGCGGG ATCGAGAACA TCCTGCTCGC CATCAAGGAC
GTCTTCGCGT CCCTCTACAA CGACCGCGCC ATCGCCTACC GGGTGCACCA TAAGTTCGAA
CACGCCGAGG TCGCGCTGTC GGCCGGCATC CAGCGGATGG TTCGTTCCGA CGTGGGCGCT
TCCGGTGTCA TGTTCACCAT GGACACCGAG TCCGGGTTCA AGGACGCTGT CTTCGTCACC
TCCTCGTACG GCCTCGGCGA AGCCGTGGTC CAGGGCGCCG TCAACCCCGA CGAGTTCTAC
GTCTACAAGC CCGCCCTGCA GGCCGGCCGA CCCGCCATCC TCAAGCGCGG ACTGGGCGAA
AAAGCCCTGC AGATGACCTA CACGAGCAGC CGGGAGATCG GCCGCACCAT CGACTTCGTA
CCGGTTGAGG CTTCGCTGCG GAACCGCTTC AGCCTCACTG ACGACGACGT CGAGCAGCTC
GCCCGCCACG CGGTCGCCAT CGAAAACCAC TATGGCCGTC CGATGGACAT CGAATGGGGC
AAGGACGGGA TTGACGGCGG CCTGTACATC CTGCAAGCGC GCCCCGAGAC CGTGCAGTCC
CGCCGTGCGT CCGGCAGCCT GAGCCGTTTC CGCCTCAACG GGACCGGCCG GGTGCTCGTG
GAGGGCCGCG CCATCGGCCA GCGCATCGGT GCCGGCAGCG TCCGCATCCT TACCGCGATC
GACCAGATGG CCGCTTTCAA GACCGGCGAC GTCCTCGTCG CCGACATGAC CGACCCGGAC
TGGGAACCGA TCATGAAACG TGCCTCCGCG ATCGTCACCA ACCGCGGCGG ACGCACGTGC
CACGCGGCCA TCATTGCCCG CGAACTGGGG ATTCCCGCCG TCGTCGGCAC CGGGAACGCC
ACTGACGCAC TCTCGGACGG TCTCGAAGTC ACCGTCTCCT GCGCCGACGG TGAAACCGGC
GTCATCTACG AAGGGCTCCT GGACTTCAGC GTCGAGGAAA CCGAGATCAC CCAGCTGCCG
GAGGCTCCGG TCAAGGTCAT GATGAACGTC GGCACGCCCG AACAGGCCTT CACGTTCGCC
CAGCTGCCGA ACGACGGAGT AGGCCTAGCC CGGCTCGAAT TCATCATCAA CCGCCAGATC
GGGATCCACC CCAAAGCACT GTTGAACCTG GAGGACCAGC CGGCGGAGGT CATTGCGGAG
ATCCGGGAGC GGATCGCCGC CTACGACAGC CCGCGCGACT ACTACATCAA GCGCCTTGCC
GAAGGCGTGG CCACCATCGC CGCGGCCTTC GCGCCGGAGC CGGTGATTGT CAGGATGTCC
GACTTCAAGT CCAACGAGTA CGCCAACCTG ATCGGCGGCC CCGCGTACGA ACCGCATGAA
GAGAACCCGA TGCTCGGCTT CCGCGGCGCG TCACGCTACC TGGAACCGTC CTTCCGGGAC
TGCTTCGACC TGGAGTGCGA GGCCCTGTCC TTCGTGCGCA ACGAGATGGG ACTGACCAAC
GTCAAACTGA TGATCCCCTT CGTGCGGACC GTGGACGAGG CCAGTGGCGT CATCGACCTG
CTCGCCGAGA ACGACCTGCG CCGCGGCGAA AATGGCCTCG AGGTCATCAT GATGTGCGAG
ATTCCATCCA ACGCGCTGCT CGCCGACGAT TTCCTGGACT ATTTCGACGG ATTCTCCATC
GGTTCCAACG ACATGACCCA GCTGGCCCTT GGCCTGGACC GGGACTCCGC GATTGTCTCG
GGCGGCTTCG ATGAACGCGA CCCCGCCGTC AAGAAGCTCC TGAGCATGGC GATCAAGGCG
TGCAAGGCGC GCGGCAAATA TGTGGGCATC TGCGGCCAGG GGCCGAGCGA CCATGCGGAC
TTCGCCGAAT GGCTGGTGGC GGAAGGCATC GACTCGGTTT CGTTGAACCC GGACACCGTG
GTGGACACCT GGCTCCGGCT CGCCGGCGCG GCAGGCCTGG CCGGGGCCGG AGCGGCCGTC
GGCGCAGAAG CGCGCTGA
 
Protein sequence
MTTDILWFSE LGLKDLDRVG GKNASLGEMV QNLTSAGVQV PDGFATTADA YRTFLADSGL 
DQKIADRLVG LDTDDVTALA AAGQEIRTLM RETPFLPDFE AQIRASYQEL VDKHGGSDDL
SWAVRSSATA EDLPDASFAG QQETFLNVRG IENILLAIKD VFASLYNDRA IAYRVHHKFE
HAEVALSAGI QRMVRSDVGA SGVMFTMDTE SGFKDAVFVT SSYGLGEAVV QGAVNPDEFY
VYKPALQAGR PAILKRGLGE KALQMTYTSS REIGRTIDFV PVEASLRNRF SLTDDDVEQL
ARHAVAIENH YGRPMDIEWG KDGIDGGLYI LQARPETVQS RRASGSLSRF RLNGTGRVLV
EGRAIGQRIG AGSVRILTAI DQMAAFKTGD VLVADMTDPD WEPIMKRASA IVTNRGGRTC
HAAIIARELG IPAVVGTGNA TDALSDGLEV TVSCADGETG VIYEGLLDFS VEETEITQLP
EAPVKVMMNV GTPEQAFTFA QLPNDGVGLA RLEFIINRQI GIHPKALLNL EDQPAEVIAE
IRERIAAYDS PRDYYIKRLA EGVATIAAAF APEPVIVRMS DFKSNEYANL IGGPAYEPHE
ENPMLGFRGA SRYLEPSFRD CFDLECEALS FVRNEMGLTN VKLMIPFVRT VDEASGVIDL
LAENDLRRGE NGLEVIMMCE IPSNALLADD FLDYFDGFSI GSNDMTQLAL GLDRDSAIVS
GGFDERDPAV KKLLSMAIKA CKARGKYVGI CGQGPSDHAD FAEWLVAEGI DSVSLNPDTV
VDTWLRLAGA AGLAGAGAAV GAEAR