Gene Arth_0225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0225 
Symbol 
ID4447316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp235433 
End bp236530 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content70% 
IMG OID639688021 
ProductPDZ/DHR/GLGF domain-containing protein 
Protein accessionYP_829726 
Protein GI116668793 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTCAG CGCGAGCCGG GACCGGCGTC CGGGCCGGCG CAGCCCTGGC CCTCTTGGCA 
ATCCTGGGCT CCGCCGTGAC CGGATGCACG GGGGCGGACC AGCGCCCGGC CCCTGCGTCC
TCCTCAGCTG CAGGATCCCC TGCTACAGGC TCCTCCGGGA CAGCCCCCGC TACCCGGCCG
GGCACCCCGC AGGCTGCCGC GGACATCCCG TCCATCGTGG AGAACGTCCA GCCGTCCGTT
GTCACGGTGC TCACCCACGG CGGACTGGGC AGCGGCGTGG TCTTCGCGGC GGAGGGACTG
ATCCTCACCA ACGAGCATGT GGTGCGGGGC AACACCGAGG TGGAGATCGC CTTTGCCGAC
GGCCAACGGG TTGCAGGCAC AGTCAAGGCC ACGGATGCCA TTTCGGACCT GGCCCTCGTG
GAGGCGAAGC GCACAGGGCT TCCTGCCGCA AAGTTCCAGT CCGAACTCCC CCGGGTGGGC
GAACTGGCAA TCGTGATCGG CTCTCCGCTG GGTTTCGAAA ACACGGCGAC GGCGGGCATC
ATCTCCGGTT TGCACCGCGA GATCCCGGGG TCGGCGGCCA GCAGCCAGTC GCTGGTGGAC
CTGATCCAGA CTGATGCGGC CATCAGCCCC GGGAACTCCG GCGGCGCGGT GGTGAATTCC
CGGGGCGAGG TGATCGGCAT CAGCGAGGCG TACATCCCGC CGCAGTCCGG GGCGGTGGCG
CTGGGCTTCG CGATCCCCGC GGCGACCGCC GTCCGGGTAG CCGGGCAGCT GCGCGAGGAC
GGGACAGCGG ACCACGCCTT CATCGGACTC CGCCCGGGCG AGATCACCTC GCAGATCGCA
GACACGCTGG GACTGGAAAA CACCCGCGGC GCCCTGGTGC TCTCGGTGGT GGACGGCGGC
CCCGCGGACC GTGCGGGCAT CCGGCCCGGG GACGTCCTTG TCTCCTTGGA CGGCAAGGAA
CTGGCCTCAC CCGAGGACCT GCTCGCCGAA CTCCGCGGCA AGAACCCGGA CCAGACCGTT
AACGTCGGCT ACCGCCGGGG CACGGAAGCC AAGGAAGCCA AGGTCACCCT CGCTGCCCGC
CCCGCCTCGG GGGGCTGA
 
Protein sequence
MISARAGTGV RAGAALALLA ILGSAVTGCT GADQRPAPAS SSAAGSPATG SSGTAPATRP 
GTPQAAADIP SIVENVQPSV VTVLTHGGLG SGVVFAAEGL ILTNEHVVRG NTEVEIAFAD
GQRVAGTVKA TDAISDLALV EAKRTGLPAA KFQSELPRVG ELAIVIGSPL GFENTATAGI
ISGLHREIPG SAASSQSLVD LIQTDAAISP GNSGGAVVNS RGEVIGISEA YIPPQSGAVA
LGFAIPAATA VRVAGQLRED GTADHAFIGL RPGEITSQIA DTLGLENTRG ALVLSVVDGG
PADRAGIRPG DVLVSLDGKE LASPEDLLAE LRGKNPDQTV NVGYRRGTEA KEAKVTLAAR
PASGG