Gene Daro_3954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3954 
Symbol 
ID3567492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4249835 
End bp4251838 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content53% 
IMG OID637682428 
ProductPAS:GGDEF:hemerythrin HHE cation binding region 
Protein accessionYP_287152 
Protein GI71909565 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain
[TIGR02481] hemerythrin-like metal-binding domain 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.567366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCAG ACATGAACTC ATTGCTCGAC ATTCGGACGC TGATACTGGT CATCACCATT 
GTCGTGACCT GTCGCACGTT CATATTAGGG TACGTCTGGA AAATCACCCG TCCCTATCGG
CCCGCAAGCT ATTGGGCTAT CGGGTCGCTG TTGATCAGCG TCGGCGAGTT GTTGGTCGGC
CTGAGGGATC ACATATCTCC TGTTGTTTCA ATCCTGCTCG CGCAGTGCCT GATCATGACT
GGCTGGATGA GCATCACCGG AGGGATTTCC ATCGCGGCGC GGCACAAGCC GGCATGGCGG
ACGGGGGGCG CGTTCATCGC CCTGGCGATC CTCTGCACTT TCTGGTTTCT CGTCATATCC
CCTGATTTTG CGATAAGGAC CTACGCGGCC AGCCTGCCTG TCATCGGCTT CGACCTGTAT
GCCGCTTTCG TGTGCCTGCG CTTCAAGGAA GGGGCACAAC GCGCAACGTT CAGGATACTG
GCGATTACCT TGCTGGCACA GGTGATCTCC AATCTCGTCA AGACCGCTTA CATCGGCATC
AACGAATTGA CCCAGCTTTT TGACGCCCGT TGGCAGGTCG GTCAGTTCTA TGTGGTGTCG
GTAATCACGG CATCGGTATC GACCGCGTTA TTCGTGTTGC TCGCCGTACA ACGCCTTCAG
GAACAATTGA ATGCCGAACT CCTGGCGCGG CGAGAAATCG ACCACTCAGT ACAGCTTGCC
GCCATGGTCT ATCAGGCAAG CAGTGAAGGC ATGCTCGTCA CGGAACCCGA TGGCAGAATC
ATCTCCGTCA ATCCTGCTTT CACCGCCATC TCTGGTTATA CGCAAGATGA GCTGGTCGGA
AAAACTCCCC GGATATTCAA GTCGGGAAAA CAAGAGCCCG AGTTTTATGC GGCGATGTGG
CGGGAAATCA TTGCGACGGG GCACTGGAAA GGCGAACTGT GGAATCGACA CAAGGACGGA
AACCTGTCTG CCGAGGCGCT CAGTATCAAT ACCATACGCA ACCCGGACGG ATCGCCACAA
CGGTTTGTAG CGCTCTATCA CGATGTCACG TCACAAAAAC AATCCGCAGA GGTCATTTAC
CATCAGGCCA ACCACGATCA ATTAACCAAT TTAGCCAACA GAAACCACTT CTTTCAGCAG
CTTTCGAAAG AACTATCCCG CGCCCGCCGC ACCGGAAAGC GCGTTGGGCT GCTGTTCACC
GACCTGAATC GATTCAAGCC AATTAACGAC CAATTTGGTC ACGAAGCAGG CGACACGGTT
CTAAAAACCG TCTCGCAACG GTGGTTGGCT TGTGTACGCG ACAACGACCT TCTGGCAAGG
ATAGGCGGCG ACGAATTTGC GCTGATTATT TGTGATCTAA ACGATGTATC GCAAACCAAG
ATCATCGCGA GAAAGCTCAT TGCGACGCTG GAAGCACCGA TCGCAATTGG TAACGATCAA
AGCTGCACGG TCGGCACAAG CATAGGGATC GCCATCTATC CTGACAACGC CATGGAAATG
GATTCCCTGA TTTCTGTCGC CGACGCGGCC ATGTATGCCA GCAAGTCGTC TGGCAACAAC
ATCATTCGAC TCGCCGACGT TGTGGCGTCA GAAAAGAAAA ATCAATCGGA CTGGATTGTT
TTTGAGTCGG GCCATCTTGT TGGCGTCAAA AAAATCGACG AGCAGCATCA GGAACTGGTC
CGCATGGTAA ACAAGATCAA TCGAGCCATT CATTCGCGAT GTGAAGATGC CTCACTGAAA
CTGATGTTCA ATGAACTCGT TGCATTCACC GAACACCATT TCTCGACCGA ATTACGGTTC
ATGATCGATT TTCACTATCC GGAAACCGAC GTTCATGACC AGCAGCACCA GGCACTTGTG
GCGCAACTCA GTAATCTGAT TACCCGGTTC GAGCAAGGTG ATGAGTTGCA ATCGCTCCAG
ATGATCAAAG ACTGGCTACT GAGGCACATC GAACATGCCG ACAAGCCGCT AGGGGCATAT
CTAGCCTCTA AAGGTGCCAA TTAG
 
Protein sequence
MMSDMNSLLD IRTLILVITI VVTCRTFILG YVWKITRPYR PASYWAIGSL LISVGELLVG 
LRDHISPVVS ILLAQCLIMT GWMSITGGIS IAARHKPAWR TGGAFIALAI LCTFWFLVIS
PDFAIRTYAA SLPVIGFDLY AAFVCLRFKE GAQRATFRIL AITLLAQVIS NLVKTAYIGI
NELTQLFDAR WQVGQFYVVS VITASVSTAL FVLLAVQRLQ EQLNAELLAR REIDHSVQLA
AMVYQASSEG MLVTEPDGRI ISVNPAFTAI SGYTQDELVG KTPRIFKSGK QEPEFYAAMW
REIIATGHWK GELWNRHKDG NLSAEALSIN TIRNPDGSPQ RFVALYHDVT SQKQSAEVIY
HQANHDQLTN LANRNHFFQQ LSKELSRARR TGKRVGLLFT DLNRFKPIND QFGHEAGDTV
LKTVSQRWLA CVRDNDLLAR IGGDEFALII CDLNDVSQTK IIARKLIATL EAPIAIGNDQ
SCTVGTSIGI AIYPDNAMEM DSLISVADAA MYASKSSGNN IIRLADVVAS EKKNQSDWIV
FESGHLVGVK KIDEQHQELV RMVNKINRAI HSRCEDASLK LMFNELVAFT EHHFSTELRF
MIDFHYPETD VHDQQHQALV AQLSNLITRF EQGDELQSLQ MIKDWLLRHI EHADKPLGAY
LASKGAN