Gene Arth_2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2087 
Symbol 
ID4445391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2355646 
End bp2356656 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content63% 
IMG OID639689895 
Productglyceraldehyde-3-phosphate dehydrogenase 
Protein accessionYP_831567 
Protein GI116670634 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.313178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGACCC GTATTGGTAT CAACGGCTTT GGCCGCATTG GCCGCAATTA CTTCCGTGCT 
GCACTGGCAC AGGGCGCTGA CCTCGAGATC GTTGCAGTCA ACGACCTCAC CAGCCCCGAA
GCGCTGGCCC ACCTCTTCAA GTACGACTCC GTAGGCGGCC GCCTCAAGGA GACCATCGAG
GTCAAGGACG GCAACATCGT CGTCAACGGC AACGTCGTTA AGGTTCTCGC CGAGCGCGAC
CCCGCGAACC TCCCCTGGGG AGAGCTGGGC GTTGACATCG TCATCGAGTC CACCGGCTTC
TTCACCAAGG CCGCTGCCGC CAAGAAGCAC CTCGACGCCG GCGCCAAGAA GGTCCTGATC
TCCGCCCCGG CTTCGGACGA GGACATCACC ATCGTGATGG GCGTCAACCA CGAGCTTTAC
GACAACGCCA AGCACCACAT CATCTCCAAC GCATCCTGCA CTACCAATTG CCTCGGCCCG
CTGGCCAAGG TCATCAACGA CGAGTTCGGC ATCGAACGCG GCCTCATGAC GACGGTCCAC
GCGTACACGG CCGACCAGAA CCTGCAGGAC GGTCCGCACA ACGACCTCCG CCGTGCCCGC
GCCGCCGCCA TCAACATGGT CCCCACCTCC ACCGGTGCGG CCAAGGCAAT CGGCCTGGTG
CTTCCGGAAC TCAAGGGCAA GCTGGACGGC TACGCCATCC GCGTCCCCGT CCCCACCGGC
TCCGCCACCG ACCTCACGGT CACCGTTTCC CGTGAGACCA CCGTTGAGGA AGTCAACGCA
GCCCTGAAGA AGGCATCCGA GTCCGAGTCG CTCCAGGGCT TCCTGACCTA CACGGATGAG
CCGATCGTCT CATCGGACAT CGTGGGCGAC CCGGCGTCGT CGATTTTCGA CTCCGGCCTG
ACGAAGGTCA TCGGCAACCA GGTCAAGGTT GTTTCCTGGT ATGACAACGA ATGGGGCTAC
TCGAACCGCC TGGTCGACCT CACGGAGCTC GTCGCATCCA AGCTGGGCTA G
 
Protein sequence
MTTRIGINGF GRIGRNYFRA ALAQGADLEI VAVNDLTSPE ALAHLFKYDS VGGRLKETIE 
VKDGNIVVNG NVVKVLAERD PANLPWGELG VDIVIESTGF FTKAAAAKKH LDAGAKKVLI
SAPASDEDIT IVMGVNHELY DNAKHHIISN ASCTTNCLGP LAKVINDEFG IERGLMTTVH
AYTADQNLQD GPHNDLRRAR AAAINMVPTS TGAAKAIGLV LPELKGKLDG YAIRVPVPTG
SATDLTVTVS RETTVEEVNA ALKKASESES LQGFLTYTDE PIVSSDIVGD PASSIFDSGL
TKVIGNQVKV VSWYDNEWGY SNRLVDLTEL VASKLG