Gene Rpal_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2021 
Symbol 
ID6409681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2192773 
End bp2193918 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content64% 
IMG OID642711907 
Productdiguanylate cyclase 
Protein accessionYP_001991019 
Protein GI192290414 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACCAA CTGCCGCGCT GTCGGGCACC CTGCCCGATC CTCAAACCAG GGCCGAAGAC 
GCGCGCACCC TTGCCGTCCG CGGCCGGCGG ATGCGTCAGC GCCGGCACAT GCTGGACCTG
ATCGCCGCCA GCTTCATGAT CGATGCCGCG ATCCTGCTGA TCTACGTCCA AGCCGGGACA
ATCCCGTCGA TGGTGGTGGT GGCGTTCGCC GTCTCCGGCG CCCTGGTCGG TGGCATCCTG
TTCGCTCTGT CGGAATCCGG ATTCAACGAT CGCTTCTCCG ATCACTATCT GACGGTGCCG
ATCGCGATCA CGACGCTGGC ACTGATCATC GCCTTCACGG TATGGGCGCC GCAGATCGGG
GCGTTCTTCC TGTTCGAACT GTTCGTCGTG TTCGGCTTCG CGTCGCTGCG CGCCGACCGT
CGGCAGGCGC TGATCCTCTG GTCCGTGCTG CTGTGCGTGC TGGTGCCGCT GTTCCTGCTC
ACCGATCTGC CGATCGGGAT GCCGCACGAC AATTCGCTCG AACGCCTCGC CACCCTGCTG
GCGCTGGTGC TGACGGTCGG ACGCTGCACC TTCATCGGGG TGTTTTCGAA AGCCTTGCGC
GATTCGCTGT ACAAGCGCGG AGTCCAGCTG CGCGAAGCCT ATCAGCGGAT CGAGGAACTG
GCCGAACTCG ACGAACTGAC CGGGGCCTAT AACCGCCGCT GTATCATGCG GCACCTGGAC
AATGAGATCG AACGCGCGGC GCGGCACTGC GAGTCGCTGT CGCTGGCGTT GATCGACCTC
GACTGGTTCA AGCGCATCAA TGATACCTTT GGCCATCCGA TCGGCGACGA GGTGCTGCGC
ACCTTCGCTA TCACCATCTT CGCCAACATC CGCAGCTTCG ACCGGTTCGG CCGCTACGGC
GGCGAAGAAT TCCTGCTGCT GCTGCCGAAC ACATCCGACG ACGAAGCCCG CCTGATCCTC
GACCGGCTTC GCATGATCGT CGCCGATCTC GACTGGAGCG CGTTCTCGCC CAGCATGATG
GTGACGCTGT CCGCCGGCGT CACCACCATG ACGCTCCAAG AATCCACCGA AGCCGTGCTG
GCACGGGCCG ATCGCGCGCT CTACGAGGCC AAGCACGGCG GACGAAACCG AATTTCCTGC
GCCTGA
 
Protein sequence
MGPTAALSGT LPDPQTRAED ARTLAVRGRR MRQRRHMLDL IAASFMIDAA ILLIYVQAGT 
IPSMVVVAFA VSGALVGGIL FALSESGFND RFSDHYLTVP IAITTLALII AFTVWAPQIG
AFFLFELFVV FGFASLRADR RQALILWSVL LCVLVPLFLL TDLPIGMPHD NSLERLATLL
ALVLTVGRCT FIGVFSKALR DSLYKRGVQL REAYQRIEEL AELDELTGAY NRRCIMRHLD
NEIERAARHC ESLSLALIDL DWFKRINDTF GHPIGDEVLR TFAITIFANI RSFDRFGRYG
GEEFLLLLPN TSDDEARLIL DRLRMIVADL DWSAFSPSMM VTLSAGVTTM TLQESTEAVL
ARADRALYEA KHGGRNRISC A