Gene RPB_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0468 
Symbol 
ID3909813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp514959 
End bp516500 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content70% 
IMG OID637882355 
Producthypothetical protein 
Protein accessionYP_484090 
Protein GI86747594 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.101632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.77905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCGA TCGCCACGTC CCGCGCCCGC CTGTCGCTGA TCGCGCGCTG GATCGACGCC 
CTGCTCGATC CCAAACGGCA GGAGCGTACC GTCATCCTGT CGCTGGCGGT CTATGCGGCG
ATCTGGACCG CGTACCGCAC CATCGCCACC TGGCCGCGCG ACCTGCACGC CGACGAAACC
GAGCTGTACG CCTGGTCGCA GCATCTGGCG TTCGGCACTG ACAAGCATCC GCCGTTCTCG
GCCTGGGTTG CACGCGCCTG GTTCAGTGTC GTGCCGGTGT CGGATCTGAC GTTCCATCTG
CTGGCCACCG TCAACATCGC CGTCACGCTG TACATCGCCT GGCGGACGAT GCGGCGCTAT
ATGACGGCCG AGAAGGCGCT GTTCGGGCTG GCGACACTGA CGCTGATCCC GTTCTTCAAT
TTCATCGCGC TGAAATACAA CGCCAATGCG GTGCTGCTGC CGCTGTGGGC GCTGACCATC
CATGGTTTTC TGCGCGCGTT CGAACAGCGC GGCTGGCTGT GGCCGACGCT GGCGGGGGTG
TTCGCCGGCG CGTCGATGCT GGGCAAATAC TGGTCGATCG TGCTGGTCGG TTCGCTCGGG
CTCGCCGCGC TGCTGGATCG GCGGCGGGCG CGGTTCTTCG CCTCCCCGGC GCCGTGGCTG
ATGATCGTCG CGGGCGGGCT GGTGCTGGCG CCGCATGTCG CCTGGCTGGT CGAGCACCGC
TTCCCGACCT TTGCCTATGC GGCGGCGCGG GAAGCCGACG GCCTCGGGCA CAATGCGCTC
GACACGCTAC GCTATCTCGC CGGCTGTGTC GGCTATGCGG CGCTGGCGCT GATCGCCACC
TGGCTGCTGC TGCGGCCGTC GCGCGCGGCA TTGATCGAGA GCGTCTGGCC GGCGGACCCG
CAGCGCCGGC TGATCGTCAC GATCCAGGTG CTGATGATCG TGGCGCCGGC GCCGGTGGCG
CTCGTGACCG GCATCCGCAT CGTGCCGCTG TGGACGATGC CGGCCTGGAC GCTGCTGCCG
ATCGTGCTGC TGTCGTCGCC GCTGATCGCG GTCGGCCGCG ACGCGCTGCG GCGGATGCTG
ATCGGGGCCG CGGCGCTGGC GCTGACGATC CTCGCCGCGG CGCCGGGCGT GGCGGTGGCG
ATCCACAGCA GCAGCCCGCC GGAGCCGTTC GAATACGCTT CCTTGCTCGC CGACGACATC
GCGCGGGTCT GGCAGCGCCA CACCGACAGG CCGATCGCAC TGGTGGCGGG CGAAACCGTG
CTGGCGCAGA ACACCGCGTA TTATCTGCGC ACCGACAGCC GCGCCTTCGC GACCGCCGAT
CTGGCGACGC TGAAAGCCGA CGCCGCCGCG CGCGGCGCGG CGCTGGTGTG CCCGGCGGCG
GATCAGTCCT GCCTGTCGGT CGCCGAGCAG ATCGTCGCGG CGCAGCCGCA GATCCTGCGC
AGCAAGGTCT GGCTCAGCCG GCCGCTGCTC GGGATCGCCG GCGGCACGGT GCAGGACGTG
TTCTTCCTGG TGCTGCCGCC ATCAGCGACG GGGAAGACGT AG
 
Protein sequence
MTSIATSRAR LSLIARWIDA LLDPKRQERT VILSLAVYAA IWTAYRTIAT WPRDLHADET 
ELYAWSQHLA FGTDKHPPFS AWVARAWFSV VPVSDLTFHL LATVNIAVTL YIAWRTMRRY
MTAEKALFGL ATLTLIPFFN FIALKYNANA VLLPLWALTI HGFLRAFEQR GWLWPTLAGV
FAGASMLGKY WSIVLVGSLG LAALLDRRRA RFFASPAPWL MIVAGGLVLA PHVAWLVEHR
FPTFAYAAAR EADGLGHNAL DTLRYLAGCV GYAALALIAT WLLLRPSRAA LIESVWPADP
QRRLIVTIQV LMIVAPAPVA LVTGIRIVPL WTMPAWTLLP IVLLSSPLIA VGRDALRRML
IGAAALALTI LAAAPGVAVA IHSSSPPEPF EYASLLADDI ARVWQRHTDR PIALVAGETV
LAQNTAYYLR TDSRAFATAD LATLKADAAA RGAALVCPAA DQSCLSVAEQ IVAAQPQILR
SKVWLSRPLL GIAGGTVQDV FFLVLPPSAT GKT