Gene RPB_4080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4080 
Symbol 
ID3911887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4651251 
End bp4652807 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content68% 
IMG OID637885984 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_487684 
Protein GI86751188 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.656205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.161644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCA CCCACGGCCT GCGGCGCGCG CTGCAGATCA ACGCCGACGG CCTCGCCACC 
GTGTTCAACG GCCGCCGCCG CAGCTGGCGC GACGTCGGCG AGCGCGTCGC CCGCCTCGCC
GCCGGCCTGC GTGCGCTCGG CGTCGGCGAA GGCGAGCGCG TCGCCGTGCT GTCGATGAAT
TCCGACCGCT ATCTGGAGCT GTATCTGGCC GCCGGCTGGT GCGGCGGCGT GATCGTGCCG
CTCAACATCC GCTGGAGCGC ACTGGAGAAC GAGGACGCGC TGCGCGACTG CCGTGCGGTG
GCTCTGGTCG TCGACAAGGC GTTCGCCGCG ACAGGCGCGA CGCTGGCGCA AGCGATTCCA
GGTCTGGCAA TGGTCTTTGC CGACGATGGC GATGTGCCGG CAGGCATGAA GAGCTACGAA
GACGTCATCG CGACCAACGC GCCGATCCCC GACGCGATGC GCAAGGCCGA GGACCTCGCC
GGTATTTTCT ATACCGGCGG CACCACCGGG CGTTCCAAGG GCGTGATGCT GAGCCACGGC
AACCTGATGG CCAACGCGCT CAATGCTTTG GGCGAAGGTC TGTTTCCCAG CACCTCGGTG
TATCTGCACG CGGCGCCGAT GTTCCATCTC GCCAACGGGG CGGCGATGTA TTCGCTGCTG
CTGTCCGGCG GCTCCAATGT CATGGTGCCG TCGTTCACGC CGGAAGGCGT GATGGCGGCG
ATGCAGAACG ACCGCGTCAC CGATGTGCTG CTGGTGCCGA CCATGATCCA GATGTTCGTC
GATCATCCGG CGCTGAAGAG CTACGACCTG TCGTCGCTGA AGAACATCAT CTACGGCGCC
TCCCCGATCA GCGAGGCGGT GCTCGCCCGC GCCAGCGCCG CGCTGCCCCA CGTGCAGTTC
ACCCAGGCCT ACGGCATGAC CGAATTGTCG CCGATCGCCA CGCTGCTGCA CTGGAAGGAG
CATATCGGCG ACGGCAAGGC CAAGGGACGG CAGCGCGCGG CCGGCCGCGC TACGCTCGGC
TGCGAGGTCC GCATCGTCGA CGCCGACGAC CAGCCGGTGC CGTACGGCAC CGTCGGCGAG
ATCTGCGTGC GCGGCGACAA TGTGATGATG GGCTATTGGG AGCGTCCGGA GGAGACCGCG
CGGGCGCTGG CGGGCGGCTG GATGCACACC GGCGACGGCG GCTACATGGA CGAGCACGGC
TTCGTCTACG TCGTCGACCG CGTCAAGGAC ATGATCATCT CCGGCGGCGA GAACGTCTAT
TCGGTCGAGG TCGAGAACGC GGTGGCGCAG CATCCGGCCG TGGCGCAATG CGCGGTGATC
GGGATTCCGC ACGAGGCCTG GGGCGAGCAG GTCCACGCCG TGGTCGTCAC CAAGGCCGGC
GCGAGCGTCA CCGCCGACGA ACTGATCGCG CATTGCAAGG CGCTGATCGC CGGCTACAAA
TGCCCGCGCA GCGTCGACAT CACCGAGACG CCGCTGCCGC TGTCGGGCGC CGGCAAGATC
CTCAAGCGCG AATTGCGACA GCCCTATTGG GAGAACCGCG AACGCCGCGT GAGCTGA
 
Protein sequence
MNITHGLRRA LQINADGLAT VFNGRRRSWR DVGERVARLA AGLRALGVGE GERVAVLSMN 
SDRYLELYLA AGWCGGVIVP LNIRWSALEN EDALRDCRAV ALVVDKAFAA TGATLAQAIP
GLAMVFADDG DVPAGMKSYE DVIATNAPIP DAMRKAEDLA GIFYTGGTTG RSKGVMLSHG
NLMANALNAL GEGLFPSTSV YLHAAPMFHL ANGAAMYSLL LSGGSNVMVP SFTPEGVMAA
MQNDRVTDVL LVPTMIQMFV DHPALKSYDL SSLKNIIYGA SPISEAVLAR ASAALPHVQF
TQAYGMTELS PIATLLHWKE HIGDGKAKGR QRAAGRATLG CEVRIVDADD QPVPYGTVGE
ICVRGDNVMM GYWERPEETA RALAGGWMHT GDGGYMDEHG FVYVVDRVKD MIISGGENVY
SVEVENAVAQ HPAVAQCAVI GIPHEAWGEQ VHAVVVTKAG ASVTADELIA HCKALIAGYK
CPRSVDITET PLPLSGAGKI LKRELRQPYW ENRERRVS