Gene Arth_3769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3769 
Symbol 
ID4447854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4246881 
End bp4249079 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content67% 
IMG OID639691593 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_833244 
Protein GI116672311 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCGC AAGCACCGGA CAACGCTTCA GGACCAGCGG CCACGCCCCG GGACCCCGCA 
ACCGGGCCTG CCGCGTCCGG GCGCACGTGG AGCCTGCGGC GCGAATGGTC CAGGGCGTTC
CTGCTGATGC TCGCCGCACT GCTGATCGGC GCCGTGGTCA CGATCGTGGG TGTGCGCGGC
CTGATGGACC AGGTGCAGGA AACGTCCGGC CGGTTGCAGC TCGAGCTCGA AAAAGTAGAG
ACCCTCCGGT CGGCACTCGA CAGCCACGAG CAATTGGGGC ACCAGTTGCT CTCGAGTGCC
CCCGTGGACC GCTCCGCTTT CCTCCGGCAG CAGGAGGAAA TAACCCGCGT CTTCGAGGAC
GTCGCCGCCG TCCTGCCGGC TGAGACAGAA GTGCGGGAAA CGATTGCCGC GGCCCGGCAG
TCGTGGGAGG AGACCTTGGC GGAACACGGC CTGTGGGGCA GCGGGGTGCT TTCACTCCGG
GGCAGCCACG TGGAGGAGAC CCCGGCATTG GCCGCCTCCA GCGCAGGAAT CCGCAGCAAG
CTGGCGGACA TCCGGCGCTA CTCCCTGGCG GAAATGGACA AGGGACTCGC GCACAGCGCC
GAACTCGAGC GGTTCCTGGT TGTCGCGCGC AGCTTCCTGT TCGTGGTGGC AGTGGGCGCT
ACGGTCTACT TCCGCCGCAG GATGGTCAAG GACCTCATGC GCCCGGTAGG GAGCCTGCAC
CAGGGTGTCC GCAAGCTGCA GGCAGGCAAT TACAACCACC GCATCGAGGT CTTCCGCCGC
GACGAACTCG GCGAACTGGC GGAGGCATTC AACGGCATGG CCGCCGCCGT CCACAGCAGC
CATGAGACAC TTACCCACCG GGCGACGCAC GACCCCCTCA CCGGCCTGGC AAACCGGGCG
GCCCTGATGG AACGCCTCAC GGCATCCTTT GGGGCCGGAA GCACCCGCAG GAACCGGAAC
GAGGGGCTGC TGTTCATCGA CATCGACGAC TTCAAGGAGG TCAACGATTC ACTGGGACAT
GAGCGGGGTG ATGCCCTGCT CATCCAGCTG GCCGCCCGCC TCAAGGGCTG CGTCCGCTCC
GACGACGTGG TTGCCCGGCT CGGCGGCGAC GAGTTCGCGA TCGTTGTGAT GGACGACGAC
GCCGGCTCGG TCACCGCCTG GATCGCCGAT CGGATCCACC GGGCGTTGCG CGCACCGTTC
TTCGTCGGCG ACGACCGGCT GACCGTCACG GCCAGCATGG GCGCGGCACA ACGGCGTCCG
GAAACCACAG ACGCTGCCGA GCTGCTGCGG CAGGCGGACT TCGCGATGTA TCTGGCCAAA
CACGGCGGAA AGGCACGTTT CCAGCTTTTC GACGCCGAGG GCTACGACCA TATGACGTAC
CGCGCCGCCC TCAAACGGGA CTTGGCCTCC GCCGTGCCCG CCGGCCAGCT CCGCCTCGAG
TACCAGCCCG TTGCCGATCT GAACAGCGGA GCCATCCTCG GCGTCGAGGC TCTGGTGCGC
TGGGAGCACC CCACGCTGGG GCTGCTTGCG CCGTCGGAGT TCATTCCGCT CGCCGAGGAG
ACGGGCGACA TCGACGCCGT CGGCTGCTGG GTGCTGGACA ACGCGAGCCG CCAGGCGGCC
AGCTGGCGGA AATCCCTGCC GCACTGCGGA GACCTCTGGA TTGCCATCAA CCTCTCCACC
ATCCAGCTGC CCAACCCGCG GAACCTGGTC GCCATCACGC GCATCCTTGC CGACCCCGCG
GCACAGGCGG AAAAGGTGGT ACTCGAAGTC ACCGAAACAG CCCTTGCTGG CAGCACGGAC
GGCGGCATCG CCGCACTCAA GGCCCTGAAG GAGACCGGTG TGCGCGTGGC TATCGACGAC
TTTGGGACGG GATATTCCTC GCTGAGCTCC TTGGCGGTGC TGCCGGCGGA CATCCTCAAG
ATTGACCGTT CGTTCCTTGG ACGGCAATCG TCCGAGGCGC AATCGGCGGC GATGCTCGAA
GGCATCCTGG GGCTGGCCCG CATGCTCTCC CTCGACGTGA TCGCCGAAGG GGTGGAGGAG
CCCGGACAGC TGGACCTTCT CCGCGCCCTG GATTGCCCGA TGGGCCAGGG CTACTTCCTG
GCCCGCCCGG GTTCCGCCGA GGCGATCGAG ACGCTCCTGG CTTCGGGTGC CCGCCTTCAG
CCCCGCCCGG CAGCACTCGA GCAGGCCGTC GATTCATAA
 
Protein sequence
MDAQAPDNAS GPAATPRDPA TGPAASGRTW SLRREWSRAF LLMLAALLIG AVVTIVGVRG 
LMDQVQETSG RLQLELEKVE TLRSALDSHE QLGHQLLSSA PVDRSAFLRQ QEEITRVFED
VAAVLPAETE VRETIAAARQ SWEETLAEHG LWGSGVLSLR GSHVEETPAL AASSAGIRSK
LADIRRYSLA EMDKGLAHSA ELERFLVVAR SFLFVVAVGA TVYFRRRMVK DLMRPVGSLH
QGVRKLQAGN YNHRIEVFRR DELGELAEAF NGMAAAVHSS HETLTHRATH DPLTGLANRA
ALMERLTASF GAGSTRRNRN EGLLFIDIDD FKEVNDSLGH ERGDALLIQL AARLKGCVRS
DDVVARLGGD EFAIVVMDDD AGSVTAWIAD RIHRALRAPF FVGDDRLTVT ASMGAAQRRP
ETTDAAELLR QADFAMYLAK HGGKARFQLF DAEGYDHMTY RAALKRDLAS AVPAGQLRLE
YQPVADLNSG AILGVEALVR WEHPTLGLLA PSEFIPLAEE TGDIDAVGCW VLDNASRQAA
SWRKSLPHCG DLWIAINLST IQLPNPRNLV AITRILADPA AQAEKVVLEV TETALAGSTD
GGIAALKALK ETGVRVAIDD FGTGYSSLSS LAVLPADILK IDRSFLGRQS SEAQSAAMLE
GILGLARMLS LDVIAEGVEE PGQLDLLRAL DCPMGQGYFL ARPGSAEAIE TLLASGARLQ
PRPAALEQAV DS