Gene RPB_3874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3874 
Symbol 
ID3911678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4429091 
End bp4430302 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content67% 
IMG OID637885775 
Productdiguanylate cyclase 
Protein accessionYP_487478 
Protein GI86750982 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAGCG TCCCGACACT CTGGGTGGTC TTTCTGATCA ACTTCCTCGC GCTGGGTCTG 
GTCTGGACCC ACGTGATGCG GAGCTATCCG AACTTCACAC CGGCGCGCTA CTGGACCGCG
GCCTGCTTTG TGGTTTCACT CGGCGCGGGC TTCGGGATGC TGCGGGGCGT GATGGACACC
AAGGTGCCCC TCATCGTCGG CGGCAGCACC GTCGTTCTGG CCGCCTATCT GATCGCAATG
GGCGTCTTCT GCTTTTACGG CCGGCACATG AGCTGGCGGC TCGCCCTCGG CGCGACCGCA
GCCTGCAGCG CGGGTCTCGG CTTCTTCCTG CTCGTGATCG ATTCGATGAT GATGCGGATC
CTGATCTATT CGGTGGCCCA GGCGGTCCCG ATCGCGATGA CGTTGCCACT GATGCTGTCG
CGCAGCGGCC GCCGCAATCC CGGCGCGCGG ATGGCCGCAG CGGTGGCGAT CGCGATGCTG
CTGGTCTACG CGGTCCGCTC CGGGGCTGCG ATCATCGGCG TCGGCGGCGA ATTGTCGGTG
GTCAATTTCA ATGACTTTCA GGCTTCGCTG GTGCTGATGC TGGTGTTCCT GTCGATGACG
CTGAACTTCT CGTTCCTGCT GATGGCGATC GACCGGCTGC GCTCCGAAGT CGAGAGCCTC
GCGCTGATCG ACGACCTCAC CGGCATCGCC AACCGCCGCC ATTTGCTGCA GCGGATGGTG
GCGCAGTGTC GGACCGCGAT GCAGACCGGC GAGCCGTTCA CGGTGCTGGC GATCGACCTC
GACGGCTTCA AGGCGATCAA TGACGGCCAC GGTCACGCCG CCGGCGACGA GTGCCTGCGC
CGGTTCTCCG GCGCCGCGCA GGCGCGGCTG CGTCCCGGCG ACCTGTTTGC GCGCACCGGT
GGCGACGAGT TCTGCGTGGT GATGCCGGCG ACCACCCTGC GCGAAGGCGC GATGGTCGCC
CGCCACATCC TCGAAGAAAG CCGCGCGCTC TCGACCCAAG GCGGCGATGC GGCCAGCGCC
ATCGCCATCG CCGCTTCGAT CGGCGTCGCG CAGTGGACGC CGCAGGTCGG ATTGCATCCC
GACCGGCTGA TCGCGGCCGC GGACGCCGCG CTGTACAACG CCAAGAAACT CGGCAAGGAT
CGCTACGCGA TCCACGAGCC GAAGCTGGAA CCGCTGCCTC CGTTCCTCGA GCCGGTGCGC
AAGATCGCCT GA
 
Protein sequence
MLSVPTLWVV FLINFLALGL VWTHVMRSYP NFTPARYWTA ACFVVSLGAG FGMLRGVMDT 
KVPLIVGGST VVLAAYLIAM GVFCFYGRHM SWRLALGATA ACSAGLGFFL LVIDSMMMRI
LIYSVAQAVP IAMTLPLMLS RSGRRNPGAR MAAAVAIAML LVYAVRSGAA IIGVGGELSV
VNFNDFQASL VLMLVFLSMT LNFSFLLMAI DRLRSEVESL ALIDDLTGIA NRRHLLQRMV
AQCRTAMQTG EPFTVLAIDL DGFKAINDGH GHAAGDECLR RFSGAAQARL RPGDLFARTG
GDEFCVVMPA TTLREGAMVA RHILEESRAL STQGGDAASA IAIAASIGVA QWTPQVGLHP
DRLIAAADAA LYNAKKLGKD RYAIHEPKLE PLPPFLEPVR KIA