Gene RPD_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1116 
Symbol 
ID4021592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1270021 
End bp1271262 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content64% 
IMG OID637961308 
Producthypothetical protein 
Protein accessionYP_568255 
Protein GI91975596 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.048386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT TTGCCATCGC GGCCGCTCCC TCTTCCGTGA CGGCGCCGCG GCTGAGAGCG 
CTGCAGCTTG CCCTGCTGTG GTTCGTCGGC GCCAGCGGCG CGATCGTCTT CATCGAGCCC
AGCCCTTATG AATTCGCGAT CCTGCTGTCT ATCATCGTGT TCCTGGCCTC CGGATTGCGG
ATCACGCCCG TCCTGATCGT GCCGATCGGG CTGCTGATCG GCGTCGAGCT GGGCTACACC
ATCGGCGCCG CCGATCTGCT CGGCGACACC ATCATCCTGA ACTGGCTGTT GACGTCGTGG
TACATGGCGA TCACCGCGAT ATTCTTCGCG CTGGTTTCGT TGCAAGACAC CGGCGAGCGG
ATCGAGGCGA TCGCCAAGGG CTATCTGGTC GGCGGCATCA TCGCATCGCT CGCGGGGATC
GCCGGCTATT TCAACCTGAT CCCCGGCGCG GGAGATCTGC TGACCTACGC CGGACGCGCG
CGCGGCACCT TCAAGGACCC CAACGTGCTC GGCGCGTTCC TGATCTTTCC GGCGATCTAC
GCGCTGCAGC GGGTGATCGA AGGATCGTTC TGGAGCGCGG TGCGCAATGC GATCGCCTTC
GGCATCATCG CGCTCGCGAT CTTCCTGGCC TTTTCGCGCG CGGCCTGGGG CACGCTGGCC
GGCGCGTCGG TGCTGATGAT CGCGCTGACC TTCATCACCG CACCGACCCA GCAGAGACGA
CTGCGGATCG TGGTGCTCGC TGCGCTCGCC GCGGCGATGC TGGTCGCGGC GATCGCCGTG
TTGCTGTCGA TCGACCAGAT CGACGAGCTG TTCAGGCAAC GCGCCAGCCT GTCGCAGCCT
TATGACAGCG GCAGGTTCGG CCGCTTCGGC CGGCATCTGC TCGGCGCCGG CATGGCGCTG
GACTATCCGA CCGGGATCGG ACCGCTGCAA TTCCGCCGAT TCTTTCCCGA GGACACCCAC
AATTCGTTCC TCAACGCCTT CATGTCAGGC GGCTGGATCA GCGGCATTCT GTATCCTGCT
CTGGTGTTCA TCACCGCGGC CTACGGCCTC CGCAACGTCT TCGTCCGCAC GCCCTGGCAG
CGGACCTATA TCGCGATCGT CGCGACGCTG ATCGTGACGC TGCTCGAGAG CTTCATTATC
GATACCGATC ATTGGCGGCA CTATTTTATG CTGATCGGCT TGACCTGGGG CGCGGCAATT
GCGAGCAGTC GAATCCGGTT TCAGAGCAAC GCAGCGCCCT GA
 
Protein sequence
MTDFAIAAAP SSVTAPRLRA LQLALLWFVG ASGAIVFIEP SPYEFAILLS IIVFLASGLR 
ITPVLIVPIG LLIGVELGYT IGAADLLGDT IILNWLLTSW YMAITAIFFA LVSLQDTGER
IEAIAKGYLV GGIIASLAGI AGYFNLIPGA GDLLTYAGRA RGTFKDPNVL GAFLIFPAIY
ALQRVIEGSF WSAVRNAIAF GIIALAIFLA FSRAAWGTLA GASVLMIALT FITAPTQQRR
LRIVVLAALA AAMLVAAIAV LLSIDQIDEL FRQRASLSQP YDSGRFGRFG RHLLGAGMAL
DYPTGIGPLQ FRRFFPEDTH NSFLNAFMSG GWISGILYPA LVFITAAYGL RNVFVRTPWQ
RTYIAIVATL IVTLLESFII DTDHWRHYFM LIGLTWGAAI ASSRIRFQSN AAP