Gene Arth_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3854 
Symbol 
ID4447553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4336688 
End bp4337815 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content65% 
IMG OID639691678 
Productdiguanylate phosphodiesterase 
Protein accessionYP_833329 
Protein GI116672396 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATGG ATGAGACTTC TTTTCCAGCA TTGACCGATG GAGAATCCAC TGCTTTCCGG 
GGACCGCAGT CGGTGCCGCC TTCGGACGTG AGTGATCCGA CGATCCGTGG ACAGGCGCGG
GACATCATCG AGTCTGTACT GGGTGACGCC ACTCCGCAGA GCGAGTGGGC TCGTGAACAG
TTGCGAAGCC GGATGGAATC GTACCCGGGT AATCCCGAAC GCGCCCTCCT TGAGCACCTC
ATGGCCACCC GGAGCATCAC GGACGAGCAG CCGGAAGAAA CCGAGGCCAC CCTTCCCAGC
CCCGACTTGC CGACTCCCGA TGAGGACTAC GGCAACACGG TGCTGTTCAC CCGCCGGAGC
AGGCGCCGCA TCGAGGCGAT CCTCGGCGAC AGGATGCTCC TCACCGCGTT CCAGCCCATC
CACGAGCTTC GCAGCCGGAA CGTCGTTGGT GTTGAGGCGC TGACGCGTTT CGTCAGTGAC
GACGGCGCGA GTGCGGACCA CTGGTTCAAT GAGGCTGCTG CCGTAGGCCT CGGACCCGAC
CTTGAATTCG CTGCCCTGCA GGCGGCACTC GTTGCCGCCG AACAACTGCC GGCCCACGTC
TACGTGGCTC TGAACCTGTC ACCGGTCACC TGCCTGGACC CCCGGCTCCG GGCGTTCGTG
GAGCAATCCC AACTGGCCGT GGACCGGATC GTCATCGAGT TGACGGAGCG GCTTGCCGAG
CATGAATACG ATCCCGTCGT GGCAGCGCTG GCACCCCTCC GCTTGCGCGG ACTGCGGGTA
GCTGTCGACG GCGCCGGAGC GGGTTTCGGC TCGATGAGCC AGGTCACGCA CCTCAGTCCG
GACATCATCA AGCTCGACCG CAGCCTCATC GCGGGAATCG ACCATGCCGC GGGCCAGAAG
ACCCTGGGCG CGGCCATGGT GGAGTTCGCC CGGCAAATCG GCGCGGACCT GGTTGCCGAA
GGAATCGAAA CCCAGGCCGA GCTCACCTCG GTGATGGACC TTGGGATGGC CTACGGGCAG
GGATACCTCC TGGGCCGTCC CTCGGTCCAG CCCCTCGACT GGGCCGCCTG GCGAACCTCC
TCCGATCACG AAGCCTCCAT TTCGGGGTCC GCCGGCCCGG CCAACTAG
 
Protein sequence
MSMDETSFPA LTDGESTAFR GPQSVPPSDV SDPTIRGQAR DIIESVLGDA TPQSEWAREQ 
LRSRMESYPG NPERALLEHL MATRSITDEQ PEETEATLPS PDLPTPDEDY GNTVLFTRRS
RRRIEAILGD RMLLTAFQPI HELRSRNVVG VEALTRFVSD DGASADHWFN EAAAVGLGPD
LEFAALQAAL VAAEQLPAHV YVALNLSPVT CLDPRLRAFV EQSQLAVDRI VIELTERLAE
HEYDPVVAAL APLRLRGLRV AVDGAGAGFG SMSQVTHLSP DIIKLDRSLI AGIDHAAGQK
TLGAAMVEFA RQIGADLVAE GIETQAELTS VMDLGMAYGQ GYLLGRPSVQ PLDWAAWRTS
SDHEASISGS AGPAN