Gene Arth_1688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1688 
Symbol 
ID4445794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1879993 
End bp1881324 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content67% 
IMG OID639689509 
Producttryptophan synthase subunit beta 
Protein accessionYP_831182 
Protein GI116670249 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.998465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGACG CGCCAACAAC CGGCTCTGAT GAGGGCACCG CGGACGCATT TCTGCAAGGA 
GACCGGTCCC TGCGCCACGC GCCGGGTCCG TACTTCGGCT CCTACGGCGG GCGCTGGATG
CCCGAATCCC TTATCGCGGC CCTGGATGAG CTGGAAGACA CTTTCGAAAA GGCCAAGGCC
GACCCGGAAT TCGTCGCCCA GATCAAGGAC CTGAACAAGA ACTACTCCGG CCGTCCGTCC
CTGCTGACCG AGGCCAAGCG CTTCGCGGAG CACGCCGGGG GAGTCCGCAT CTTCCTCAAA
CGCGAGGACC TGAACCACAC CGGTTCGCAC AAGATCAACA ACGTCCTGGG CCAGGCCCTG
CTGGCCAAGC GCATGGGCAA GACCCGCGTG ATCGCCGAGA CCGGTGCGGG CCAGCACGGC
GTAGCCAGCG CAACGGCCGC CGCCCTGCTG GGCCTCGAGT GTGTGGTGTA CATGGGCGCC
GAGGACTGCC GGCGCCAGGC CCTGAACGTG GCCCGCATGG AGCTCCTGGG CGCCACGGTC
ATTCCGGTGA CCAGCGGATC GCAGACGCTC AAGGACGCCA TCAACGAGGC GCTCCGCGAC
TGGGTGGCGA ACGTGGACCA CACCCACTAC CTGCTCGGCA CGGCCGCCGG TGCCCACCCG
TTCCCGGCGA TGGTGCGGTA CTTCCACGAG GTCATCGGTG AAGAAGCCCG CGCCCAGATC
CTGGAACAGG CCGGCAGGCT GCCGGACGCC GTCTGTGCCT GCATCGGCGG CGGCTCCAAC
GCGATCGGCA TCTTCCATGG CTTCCTGGAC GATCCTTCCG TGCGGATTTA CGGCTTCGAG
GCCGGCGGCG ACGGCGTGGA AACCGGCCGG CACGCCGCCA CCATCAGCCT GGGCAAGCCG
GGTGTGCTCC ACGGTGCGCG CTCGTACCTG ATGCAGGACG ACGACGGGCA GACCATCGAG
TCGCACTCCA TCTCCGCGGG CCTGGACTAT CCCGGCGTCG GCCCGGAGCA TGCCTACCTT
TCGGACATCG GCCGCGTCAG CTACGAACCC ATCACGGATG CCGAAGCCAT GGATGCCTTC
CGGGTCCTGT GCCGGACCGA GGGCATCATT CCGGCCATCG AATCGGCACA TGCCCTGGCG
GGAGCCATCA AGGTGGGGCA GCGCCTCGCC GCCGAAGCTG CAGCCGAAGG CCAGCCCGCG
GACAGCAAGA TCGTGATCGT TAACCTCTCC GGCCGCGGGG ACAAGGACGT GGCCACGGCC
GCCGAATGGT TCGACCTGCT GGACAAGGAT TCCGTTGAGG CCGAGATCGG CAAAGAAGGG
GAACAGCTGT GA
 
Protein sequence
MVDAPTTGSD EGTADAFLQG DRSLRHAPGP YFGSYGGRWM PESLIAALDE LEDTFEKAKA 
DPEFVAQIKD LNKNYSGRPS LLTEAKRFAE HAGGVRIFLK REDLNHTGSH KINNVLGQAL
LAKRMGKTRV IAETGAGQHG VASATAAALL GLECVVYMGA EDCRRQALNV ARMELLGATV
IPVTSGSQTL KDAINEALRD WVANVDHTHY LLGTAAGAHP FPAMVRYFHE VIGEEARAQI
LEQAGRLPDA VCACIGGGSN AIGIFHGFLD DPSVRIYGFE AGGDGVETGR HAATISLGKP
GVLHGARSYL MQDDDGQTIE SHSISAGLDY PGVGPEHAYL SDIGRVSYEP ITDAEAMDAF
RVLCRTEGII PAIESAHALA GAIKVGQRLA AEAAAEGQPA DSKIVIVNLS GRGDKDVATA
AEWFDLLDKD SVEAEIGKEG EQL