Gene RPD_1669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1669 
Symbol 
ID4022149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1884126 
End bp1885733 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content66% 
IMG OID637961864 
Producthypothetical protein 
Protein accessionYP_568807 
Protein GI91976148 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.118231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCG ATGGCGTAAG TGGGCGTACT TCCTACATCG GCTCCGGAAT TCTCAATCTC 
CGCAGTCAAC TCGACAACCT GAGCCAGCAG CTTTCGAGCG GGCGGATCTC GTCGACCTAT
GCCGGCGACG GCACTGGCCG CACGTTGGCG ATCGGGATGC GCGAGCAGTT GGCGAACATC
TCCAGCTATT CCGACACGAT GGTCAACATC AACACCCGTA TCGGCGTCGC CAATCTGTCG
CTGCAGCGGC TGAACGCCAT CGGCAGCGAG GTCAAGGGCG CGGCTGCGAG CGCCGGCTCG
ACGCTCGACA ACACCGGGCA GACGCCGGGC CAAAAGACGG CCGGGCTCGA CTTTTTCGAT
TCCGTGGACA TGCTGAATGC GCAGACCGGG GACCGCTATC TGTTCTCCGG GCGCGCCACC
GACACCGCGC CGGTGACGGC TGCCGACCAG ATCATGAACG GCAACGGCGC CGGCATCGCC
GGGCTGAAGC AGGTCGTCAG CGAGCGCCAC GATGCCGATG TCGGCACTAA CGGCATGGGA
CGCACGATCA TCTCCGCCGG CGCGACGCCG ACCTCGGTGC AGATCGGCGA GGATTTCGCC
GCCAATCCCT TTGCGCTGCC GACGCCGATC GGCGCTTCGC CGTTCGGACT GAAGATCGCA
TCGATTGCCA CCACGATCGT CGGCGCCACG GTGACCCAGC CGATCGAGAC GCCGCCGACC
ACGCCGCCGG CCGCGCCCAA TCCGGTGGCG ATGGGGATCG ATCTCGGCGC CACCATTCCG
AAGAATGGCG ACACCGTCTC GTTCACGTTC AACATGCCGG ACGGAACCCA GGAGACACTC
AAGCTGACCG CGTCGTCGCA GACGCCGTTG CCCGCGAATA GCTTCGCGAT CGATCCGGGC
GACCCGTTGG CGGTGCCGCC GGTGCCGGCC TCGCCGTCGA TCACCGCCGC GAACATGCAA
TCCGCGCTGA CGGACGCGGT GAAGAAATTG TCAGGCAGCG CGCTGTCGGC CGCGTCGGCG
ATCAAGGCCG GCGACGACTT CTTCAACAAG ACGCCGCCTC TGCGCGTCGC CGGCACAGCG
CCGTTCGGCA ACGCTACCGC GCAGGTCGTC GCAACCAAAG CCGACACCGT GTTCTGGTAC
AATGGCGAGC CGGACTCGCC GAGCGATCCG GCGCGCGGCA CCGCGATCGG CCGGGTCGAC
GATTCGATCA CTGTCCAGTA TGGCGCCCGC GCCGACGAGC AGGCGCTGCG CAAGCAGTTG
CAGACCGTCG CGGTGTTCGC GGCGGTCACC ACCTCGGCGA CCGATCCTTA CGGCTCCGAC
AAGATGGCCG CGCTCAACCA GAGGGTGGCG GCGAATCTCG CGACCGAGCC GGGCCAGCAA
TCGATCCAGA ACATCCAGGC CGACCTTGCC GGCTCGCAGG CGGCGATGAA GGCCACCAAA
GATCGCCAGA CCCAGACCAA GGCGCTGGCG CAGACGATGC TCGACTCGAT CGAGGGCGTG
AACAACGACG AGGTCGCGAC CAAGCTTCTG GCGCTGCAGA CCAGTCTGCA GGCGTCGTAT
CAGGTGACTT CGCAGCTCTA CCAAATGAGC CTCGTCAAGT TTCTTTAG
 
Protein sequence
MAIDGVSGRT SYIGSGILNL RSQLDNLSQQ LSSGRISSTY AGDGTGRTLA IGMREQLANI 
SSYSDTMVNI NTRIGVANLS LQRLNAIGSE VKGAAASAGS TLDNTGQTPG QKTAGLDFFD
SVDMLNAQTG DRYLFSGRAT DTAPVTAADQ IMNGNGAGIA GLKQVVSERH DADVGTNGMG
RTIISAGATP TSVQIGEDFA ANPFALPTPI GASPFGLKIA SIATTIVGAT VTQPIETPPT
TPPAAPNPVA MGIDLGATIP KNGDTVSFTF NMPDGTQETL KLTASSQTPL PANSFAIDPG
DPLAVPPVPA SPSITAANMQ SALTDAVKKL SGSALSAASA IKAGDDFFNK TPPLRVAGTA
PFGNATAQVV ATKADTVFWY NGEPDSPSDP ARGTAIGRVD DSITVQYGAR ADEQALRKQL
QTVAVFAAVT TSATDPYGSD KMAALNQRVA ANLATEPGQQ SIQNIQADLA GSQAAMKATK
DRQTQTKALA QTMLDSIEGV NNDEVATKLL ALQTSLQASY QVTSQLYQMS LVKFL