Gene RPB_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1057 
Symbol 
ID3908909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1212175 
End bp1213812 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content68% 
IMG OID637882950 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_484678 
Protein GI86748182 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCT GGGGAGATTC CGGCATCGGC GGAGCGCTGC GCCGCGAGAC CCATTATGGC 
GACCGGACGA TGCTGTGCTT CGCCGAGCGC CCGCCGACGC TGGCGGCGAT GTTCGACGAT
CTGGTGACGA AATTCGGCGA CCGGCCGGCG ATCGTCGACG AGGACGTCAC GCTGAGCTAT
CGCGACCTCG ACGGACGCGT GCGCAGCATC GCCGCGTCGC TGATCGGCCT CGGCGTCGCG
CCGGGCGATC GGGTCGCGCT GTTTCTCGGC AATTGCTGGG AATTCGTCGC CTGCGCGCTG
GCCTGCAACC GGATCGGCGC GCGGCTGGTG CCGATCGGCA CGCGGCAGCG CAAGGCCGAG
CTGGACTTCC TGCTGACCAA CAGCGGCGCC AAGGTGCTGG TGTTCGAGGC CGACCTCGCC
GATCAGATTC CGGCGCAGGC CGACGTCCCG ACGCTGACGC ATCGCTTCGC CGCGCATGGC
GACGCCGCCG GCGCGCGGCC GTTTGCCGAT CTGCTGGCGG CGTCGCCGGC CGATGCGCCG
GTCGTGGCGA TGCACGAGGA CGACACCGCG GTGATCCTCT ACACCTCCGG CACCACCGGC
AAGCCCAAGG GCGCCGAGCT GACGCATCTC AGCATCCTGC ATTCGGCCTA CGCCTTCGCG
CGCGCGCATG AACTGACCGA GCACGATCGC GGCCTGGTCG CCGTGCCGCT GTCGCATGTC
ACCGGCCTGG TCGGCGTCAC CTATGCCACG CTCGCCGCCG GCGGCTGCGT CGTGCTGATG
CGGCAGTCCT ACAAGACACC CGACTTTCTG GCGCTGGCGA GCCGCGAGAA AATCACCTGG
TCGATTCTGG TGCCGGCGAT CTACACGCTG GTGGCGATGG CGCCGGAGTT CGACAGGCAC
GATTTGTCGG CGTGGCGGAT CGGCTGCTTC GGCGGTGCGC CGATGCCGGT GCCGACCATC
GAGATGCTGA GCAAGCGGCT GCCGAATCTA CAATTGCGCA ACGCCTATGG GGCGACCGAG
ACGACCTCGC CGACCACGAT CATGCCGCAG GCCCATTGGC GCGATCACAT GGACAGCGTC
GGACAGCCGA TCCCCTATGC GCAGGTCCGG GTCGTCGATG CCGATGGCAA CGAGGTCGCG
CCCGGCCAGC CGGGCGAATT GCTGATCGCC GGGCCGATGG TGGTGCCGCG CTATTGGCAG
CGCGAGGATG CCAACGCGGC GGAATTCATC GGCGGCTATT GGCGCAGCGG CGATATCGGT
TCGATCGACG CGGAGGGATT TGTGCGGGTG TTCGACCGCA AGAAGGACAT GATCAACCGC
GGCGGCTTCA AGATCTTCTC GGCCGAGGTC GAGAACGTGA TCTGCGGCCT CGACGGCGTG
CTGGAGACCG CGATCGTCGG CACGCCCGAC CCGGTGCTCG GCGAACGCGT CAACGCCATC
GTGGTCACGA GCGAAGGCGC GCAGTTGAGC GAAGGCGATG TCGCCGCCTA CTGCGCGGCG
CGGATGTCGG ACTACAAGGT GCCGGAAAGC ATCATCCTCC GCACCGAGCC ACTGCCGCGC
AACGCCAACG GCAAGATCCA GAAGACGATG CTGCGCGAGA CGATCGCCGA GCGGGCAGCG
CGTCACAGCG CAGCGTAG
 
Protein sequence
MKFWGDSGIG GALRRETHYG DRTMLCFAER PPTLAAMFDD LVTKFGDRPA IVDEDVTLSY 
RDLDGRVRSI AASLIGLGVA PGDRVALFLG NCWEFVACAL ACNRIGARLV PIGTRQRKAE
LDFLLTNSGA KVLVFEADLA DQIPAQADVP TLTHRFAAHG DAAGARPFAD LLAASPADAP
VVAMHEDDTA VILYTSGTTG KPKGAELTHL SILHSAYAFA RAHELTEHDR GLVAVPLSHV
TGLVGVTYAT LAAGGCVVLM RQSYKTPDFL ALASREKITW SILVPAIYTL VAMAPEFDRH
DLSAWRIGCF GGAPMPVPTI EMLSKRLPNL QLRNAYGATE TTSPTTIMPQ AHWRDHMDSV
GQPIPYAQVR VVDADGNEVA PGQPGELLIA GPMVVPRYWQ REDANAAEFI GGYWRSGDIG
SIDAEGFVRV FDRKKDMINR GGFKIFSAEV ENVICGLDGV LETAIVGTPD PVLGERVNAI
VVTSEGAQLS EGDVAAYCAA RMSDYKVPES IILRTEPLPR NANGKIQKTM LRETIAERAA
RHSAA