Gene RPB_4363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4363 
Symbol 
ID3912178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4946921 
End bp4948192 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content68% 
IMG OID637886269 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_487961 
Protein GI86751465 
COG category[R] General function prediction only 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0121845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGAAAC CGATCCTGAT CAGCCGTCGC GGCGTGATCC GCGCCGCCGC GGCATCCACG 
GCGTTGCTCG CCTGCCCGGC GATCGCCAAA GCGCGTCCGA AGGTCGTGGT GATCGGCGGC
GGCGCCGGCG GCGCCACCGC GGCGAAGTAT CTGCGCCACG GCGACGATTC CGTCGAGGTG
ACGCTGGTCG AGGCCAACCG CATCTACGTC ACGCCGTTCA CGTCGAACCT GTATCTCGGC
GGGCTGAAGC CGTTCGAGGC GTTGAACTAC GGCTATGAGG GCATTGCGGC GCGCGGCGTC
GGCATGGTGT TCGACAGCGT CGCCGCGATC GACCGCGACG CCAAACAGGT GCGCACCGCG
AGTGGGGCGC GGCTGTCCTA CGACCGCCTG GTGCTGTCGC CCGGCATCGA TTTCCGCTGG
GACGCGATCC CCGGCTATTC CGAGGCCGCC GCCGAGACGA TGCCGCACGG CTATCGCGGC
AGCGCGCAGT TCCAGTTGCT GAAGCGCCGG CTCGACGCGC TGTCCGACGG CGCGCTGATC
GTGATCATCG CGCCGCCCAA TCCGTATCGC TGCCCGCCGG CGCCTTACGA GCGCGCCTCG
ATGATGGCCC ATGCGCTGAA GAGCCGGGGC GTGAAGAACG CCCGCATCGT CATCCTCGAC
GCCAAGGATC ATTTCGCGAT GCAGACGTTG TTCATCGACG GCTGGGAGCG GCATTATCCC
GGCATGATCG AATGGCAGGA CCCGACCATC CACGGCGGCA TCAAGGCAGT CGATCCGAAG
GCGATGACCG TGACCACCGA TTTCGAGACC CACAAGGCGG CGCTGGTCAA CGTCATCCCG
CCGCAGATCG CGGGGAAGCT CGCGCGCGAT TCCGGCCTCG CCGACGCCAG CGGCTTCTGC
CCGGTCGATG CCGGCACCAT GATCTCGCTG ATCGATCCGT CGATCCAGGT GATCGGCGAT
TCCGCGACCG GCGGTGAATT TCCCAAATCC GGCTTCGCCG CCAACAACGA GGCGAAGGGC
GCGGCGATGA TCCTGCGCGC CGAATTGCTC GGCGAGCGGC GGATGCCGAT CCGCTTCACC
AACCATTGCT GGAGCGACAT CGCCCCCGAC GACGCCGTCA AGAACGGCGC CCGCTACACC
CCGCAGGACG GCAAGATCGT GGCGTCCGAT CCCTACACCT CGCAGCTCGA CGAAAGCGCG
GAGCTGCGCG CCAAGCAGGC GCGCGAGGCG GCGGGCTGGT ACATCGGCAT GACGACGGAC
ATCTTCGGCT GA
 
Protein sequence
MPKPILISRR GVIRAAAAST ALLACPAIAK ARPKVVVIGG GAGGATAAKY LRHGDDSVEV 
TLVEANRIYV TPFTSNLYLG GLKPFEALNY GYEGIAARGV GMVFDSVAAI DRDAKQVRTA
SGARLSYDRL VLSPGIDFRW DAIPGYSEAA AETMPHGYRG SAQFQLLKRR LDALSDGALI
VIIAPPNPYR CPPAPYERAS MMAHALKSRG VKNARIVILD AKDHFAMQTL FIDGWERHYP
GMIEWQDPTI HGGIKAVDPK AMTVTTDFET HKAALVNVIP PQIAGKLARD SGLADASGFC
PVDAGTMISL IDPSIQVIGD SATGGEFPKS GFAANNEAKG AAMILRAELL GERRMPIRFT
NHCWSDIAPD DAVKNGARYT PQDGKIVASD PYTSQLDESA ELRAKQAREA AGWYIGMTTD
IFG