Gene RPB_2394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2394 
Symbol 
ID3909394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2746679 
End bp2748049 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content63% 
IMG OID637884293 
Producthypothetical protein 
Protein accessionYP_486010 
Protein GI86749514 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4961] Flp pilus assembly protein TadG 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.600984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.360592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGA CGTCCATAGC CCCTTGGTCG AGGGACGTTT CGAGCCGTTT CGTCAAGACG 
GACGGAGGCA ACGTCGCCAT CATCTTCGCG ATCGCGCTGC TGCCGATGAT CGGCTTCATC
GGCGCGGCGA TCGACTATTC CCGCGCCAAC AAGGCGCGCA CCTCGATGCA GGCGGCGCTC
GATTCGGCCG CCCTGATGGT GTCGAAGGAT CTGGCGTCCG GCGTCATCAC CGCCGGACAG
GTGTCGGCGA AGGCGCAGAG CTATTTCGCC AGCCTCTATA ACAACACCGA GGCCCCCAAC
ATCACCGTCA CCGCGACCTA CACGGCGAAG GACAGCACCG GCTCGTCGAC GGTCTTGCTG
AAGGGCACCG GCGATATCAG CACCGAATTC ATGAACATGT TCGGCTTCCC GACGCTCGGG
ATCGGCAGCG CCGCCACCGC GACCTGGGGC GGTACGAGGC TGCGCGTCGC GATAGCGCTC
GACGTCACCG GGTCGATGGC ATCCGCCGGC AAGATGCCGG CGATGCAGTC TGCAGCCAAG
ACGCTGGTGG ACAATCTCCG CGCCAACGCC CAGACCGCTG ACGATCTCTA TATTTCCATC
ATTCCGTTCG CCCAGATGGT CAACGTCGGC AAGAGCAACA AGAACGCCAG CTGGATCAAA
TGGGACTACT GGGAAGACAC CACCGGCAGC TGCAATTGGT GGTGGCTCAC GACCAAGTCG
AGCTGCGAAA GTGCGGGCCG GACCTGGAGC TCGACCAATC AAAGCCAATG GGGCGGCTGC
GTCACCGATC GCGATCAGCC GGCAGACACG ACCAAGGATG CGCCGACGAC GGCGGCGACA
CGCTTTCCCG CGGCGAACTA CAGCGCCTGC CCCGAACAGA TCCTGCCGAT GACGTCGGCG
TATTCGTCGA GTAACGCGAC GACGATCAAG GACAAGATCG ACGCCCTCTC TCCGAACGGC
GGCACCAACC AGCCGATCGG CATGCACTGG GCGTGGATGT CGTTGCAGGA TGGCGCTCCG
CTCAACACGC CGGCCAAGGA CGCGGACTAC AAGTACACCG ACGCGATCAT CCTGCTGTCG
GACGGCATGA ACACGATCGA CCGCTGGTAC GGCAACGGCT CGAGCTGGTC GAAGGATGTC
GACGCCCGAC AGAAGCTGCT GTGCGACAAC ATCCGCGCCG CATCGGCCGC GAGCACGACG
AAGACCGTGA TCTACACCAT CCAGGTCAAC ACCGACGGTG ATCCGGAATC GGAAGTCCTG
AAATATTGCG CCGACAGCGG CAATTTCTTC GCCACCACCA CCGCGTCCGG CATCAGCACC
GCGTTCGCCC AGATTGGCGC CTCCCTGTCC AAGCTGCGCA TCGCCAAATA G
 
Protein sequence
MPMTSIAPWS RDVSSRFVKT DGGNVAIIFA IALLPMIGFI GAAIDYSRAN KARTSMQAAL 
DSAALMVSKD LASGVITAGQ VSAKAQSYFA SLYNNTEAPN ITVTATYTAK DSTGSSTVLL
KGTGDISTEF MNMFGFPTLG IGSAATATWG GTRLRVAIAL DVTGSMASAG KMPAMQSAAK
TLVDNLRANA QTADDLYISI IPFAQMVNVG KSNKNASWIK WDYWEDTTGS CNWWWLTTKS
SCESAGRTWS STNQSQWGGC VTDRDQPADT TKDAPTTAAT RFPAANYSAC PEQILPMTSA
YSSSNATTIK DKIDALSPNG GTNQPIGMHW AWMSLQDGAP LNTPAKDADY KYTDAIILLS
DGMNTIDRWY GNGSSWSKDV DARQKLLCDN IRAASAASTT KTVIYTIQVN TDGDPESEVL
KYCADSGNFF ATTTASGIST AFAQIGASLS KLRIAK