Gene Rpal_3454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3454 
Symbol 
ID6411128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3697207 
End bp3698685 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content67% 
IMG OID642713333 
Producthypothetical protein 
Protein accessionYP_001992430 
Protein GI192291825 
COG category[S] Function unknown 
COG ID[COG0397] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.160092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCTC ATTTTCCGTT CGACAACAGC TACGTGGCGC TCCCGCCGAA CTTCTTCGCG 
CGGGTTGCGC CGACGCCGGT CGCCGCCCCC CGGTTGATCA AGCTGAACCG CCCGCTCGCG
GTGCAGCTCG GGCTTGATCC GGACCTGCTC GAGACGCCCG AGGGCGCGGA GATTTTATCC
GGTAACCAAA TGCCGGAGAC CGCAGCCTCG ATCGCGATGG CCTATGCGGG CCACCAGTTC
GGCAACTTCG TGCCGCAGCT CGGCGACGGC CGGGCGATCC TGCTCGGCGA GGTGGTCGAC
CGCAACGGGG TTCGCCGCGA TATCCAGCTG AAGGGCGCCG GCCGGACGCC GTTTTCGCGG
ATGGGCGACG GCCGCGCCGC GCTCGGCCCG GTGCTGCGCG AATACATCGT CAGCGAAGCG
ATGGCAGCTC TTGGCATCCC GACCACCCGC TCGCTCGCCG CGGTGCTGAC CGGCGAAACG
GTGCTGCGCG ATCCGATCCA GCCGGGCGCT GTGCTGACGC GGGTGGCCTC CAGCCATATC
CGGGTCGGCA CCTTCCAGTA TTTCGCCGCC CGCGGCGATC TCGCCAGCGT CCGGGCGCTC
GCCGACCATG CCATCGCCCG CCACTACCCG GAGGCGGCTC AGGCGCCCTC GCCTTATTTG
GCCCTGCTCG AAGGCGTGAT CGGCCGTCAG GCGGAACTGG TGGCGAGCTG GATGATGGTC
GGCTTCATCC ATGGGGTGAT GAACACCGAC AACTGCTCGG TTGCCGGCGA GACCATCGAT
TACGGCCCCT GCGCCTTCAT GGACACCTTC GATCCGAAGA CCGTTTACTC CTCGATCGAC
CAGTTCGGCC GTTACGCCTA CGGCAACCAG CCCCCGATCG CCTTGTGGAA CCTGACCCGG
CTGGCCGAAT GCCTGGTCCG GCTATTGGCC GATGACGACG ACAAGGGCAT CGAAATCGCC
CAGACCGCGC TCGGCGGCTT TGCGGAGCGG TTCAACGCCG CGTATCTGGC CAAGCTGGCG
GCCAAGCTCG GCCTGTTCAC CAGCCAGCCG GACGATCAAC AGTTGTCGCA GGAATTCCTG
ACCGCCCTGG CCAAGGGCGA AGCGGACTTC ACCCTCGCCT TCCGCCGGCT GAGCGACGCG
GCTGTCGATC CGTCGGACCT CGGTGAGGTT CGCGCCCTGT TTGCCGATCC GGCGGCGTTC
GACGAGTGGG CCCCGCGGTG GCGCGCCCGG ATCGCAGCCG AGCCGCAGGA TGCAACGACT
CGCCAGGCCG CGATGCGGCG GGTCAACCCG GCCTATACCC CGCGTAATCA CCGGATCGAA
GCGGTGATCC GGGCCGCGGT CGACCGGGAC GATTTCGCTC CCTTCGAAGA GATCCTGACG
GTGCTCGCCA ACCCCTTCGA GGAAAAGGCG GAATTCGCCC GCTATGCGGA GCCGCCGCAG
CCCCATGAAG AGGTGCTGGA AACCTTCTGC GGAACTTGA
 
Protein sequence
MTAHFPFDNS YVALPPNFFA RVAPTPVAAP RLIKLNRPLA VQLGLDPDLL ETPEGAEILS 
GNQMPETAAS IAMAYAGHQF GNFVPQLGDG RAILLGEVVD RNGVRRDIQL KGAGRTPFSR
MGDGRAALGP VLREYIVSEA MAALGIPTTR SLAAVLTGET VLRDPIQPGA VLTRVASSHI
RVGTFQYFAA RGDLASVRAL ADHAIARHYP EAAQAPSPYL ALLEGVIGRQ AELVASWMMV
GFIHGVMNTD NCSVAGETID YGPCAFMDTF DPKTVYSSID QFGRYAYGNQ PPIALWNLTR
LAECLVRLLA DDDDKGIEIA QTALGGFAER FNAAYLAKLA AKLGLFTSQP DDQQLSQEFL
TALAKGEADF TLAFRRLSDA AVDPSDLGEV RALFADPAAF DEWAPRWRAR IAAEPQDATT
RQAAMRRVNP AYTPRNHRIE AVIRAAVDRD DFAPFEEILT VLANPFEEKA EFARYAEPPQ
PHEEVLETFC GT