Gene RPB_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3020 
Symbol 
ID3910819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3442308 
End bp3443663 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content66% 
IMG OID637884926 
Productacetyl-CoA carboxylase biotin carboxylase subunit 
Protein accessionYP_486633 
Protein GI86750137 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.203212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATA AAATTCTCAT AGCCAATCGC GGCGAGATCG CTCTGCGGGT TCTGCGTGCG 
TGCAAGGAAC TGGGCATTGC CACGGTCGCC GTGCATTCGA CCGCCGACGC GGACGCGATG
CATGTCCGGC TCGCGGACGA AAGCGTCTGT ATCGGCCCGC CGCCGTCCAA GGACAGCTAT
CTCAACATCC CGGCGCTGCT GGCGGCCTGC GAGATCACCG GCGCCGACGC GGTGCATCCG
GGCTATGGAT TTCTATCCGA GAATGCCCGT TTCGCCGAGA TTCTCGCCGA CCACAATCTG
CATTTCATCG GCCCCAAGGC CGAGCACATC CGGCTGATGG GCGACAAGAT CGAGGCGAAG
AAGACCGCCA GGCGCCTGGG CATCCCCGTG GTGCCGGGCT CGGACGGCGC GGTCGGCCCG
GACGACGACG CGATGTCGAT CGCCAGGGAG ATCGGCTTTC CGGTGCTGGT CAAGGCCGCC
GCCGGCGGCG GTGGCCGCGG CATGAAGGTC GCGCACACCG CCGAAGACCT GTCGATGGCG
ATCTCGACCG CGGGCAACGA GGCCAAGGCC GCCTTCGGCG ACGCCTCGGT CTATCTGGAG
AAGTATCTGC AGAAGCCGCG CCACATCGAA ATCCAGGTGC TGGGTGACGG CCGCGGCGGC
GCGATCCATC TCGGCGAGCG TGACTGCTCG CTGCAGCGGC GGCACCAGAA GGTCTGGGAA
GAGAGCCCCT CCCCGGTGAT CAGCGCGGAA GCCCGCGCCC GGATCGGCGG CATCTGCGCC
AAGGCGATGC AGGACATGAG CTATGTCGGC GTCGGCACCA TCGAATTCCT CTACGAGGAC
GGCGAATTCT ACTTCATCGA GATGAACACC CGGATCCAGG TCGAGCATCC GGTCACGGAG
ATGATCACCG GGATCGATCT GGTGCTGGAG CAGATCCGGA TCGCCGCCGG CGGCGACCTG
CCGGTGTCGC AGGACGAGAT CGTGCTCAAC GGCCACGCCA TCGAGTGCCG GATCAACGCC
GAGAATCCGG TGAGCTTCCG GCCGTCGCCG GGCAAGATCG CGCGTTATCA TCCACCCGGC
GGCCTCGGCG TCCGGATCGA TTCCGCAGTC TTCCAAGGCT ACACCATCCC GCCTTATTAC
GACTCGCTTG TCGGCAAGCT GATCGTCCAC GGCAAGACCC GCGGCGAGTG CCTGATGCGG
CTGCGGCGGG CGCTGGACGA GATGGTGGTC GACGGCATCG AGACCACACT GCCGCTGTTC
CGCGCACTGG TGCGGGAACC GGGGATCATC GACGGCGATT ATCATATCCA CTGGCTGGAG
CAGTATCTCG CCGGCGTCGC CCTCGAGGGC CGCTGA
 
Protein sequence
MFDKILIANR GEIALRVLRA CKELGIATVA VHSTADADAM HVRLADESVC IGPPPSKDSY 
LNIPALLAAC EITGADAVHP GYGFLSENAR FAEILADHNL HFIGPKAEHI RLMGDKIEAK
KTARRLGIPV VPGSDGAVGP DDDAMSIARE IGFPVLVKAA AGGGGRGMKV AHTAEDLSMA
ISTAGNEAKA AFGDASVYLE KYLQKPRHIE IQVLGDGRGG AIHLGERDCS LQRRHQKVWE
ESPSPVISAE ARARIGGICA KAMQDMSYVG VGTIEFLYED GEFYFIEMNT RIQVEHPVTE
MITGIDLVLE QIRIAAGGDL PVSQDEIVLN GHAIECRINA ENPVSFRPSP GKIARYHPPG
GLGVRIDSAV FQGYTIPPYY DSLVGKLIVH GKTRGECLMR LRRALDEMVV DGIETTLPLF
RALVREPGII DGDYHIHWLE QYLAGVALEG R