Gene RPB_2601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2601 
SymboldnaE 
ID3910393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2983527 
End bp2986982 
Gene Length3456 bp 
Protein Length1151 aa 
Translation table11 
GC content67% 
IMG OID637884501 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_486215 
Protein GI86749719 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.742859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAG CCGGATTCGT CCATCTGCAC GTTCACTCGG CCTATTCGCT GCTGAAAGGC 
TCGATGAAGA TCGCGCGGCT GGCGGACCTC GCCAAGGCCG ACCACCAGCC GGCGCTGGCG
CTGACCGACA CCGACAACAT GTTCGGTGCG CTGGAATTCT CCGATAAGCT CGCGGGCAGC
GGCATTCAGC CGATCGTCGG CCTCGAACTC GGCATCGACT TCGGCGACCA GGATCCGACC
TCACGCAACG CGGCGCTGGC GGCGCCGGCG CGGGTGGTGC TGCTGGCGAC GCGCGAGCGC
GGCTACCGCA GCCTGATGCG GCTGAACTCG CGGGCCTTCC TCGAAACCCC GGTGAACCAG
CCGCCGCACA TCAAGTTCGA CTGGCTGGAT GGCGAGACCG ACGACGTGAT CGCGCTGACC
GGCGGGCCGG AGGGGCCGAT CTCGCTGGCG ATGCTGACCG ACCCGGCGCT CGGCCGGCTG
CGCTGCGAGC GGCTGGCGCA GGCGTTCGGC GACCGGCTCT ATGTCGAGCT GCAGCGCCAC
AACACCGATG TCGAACGGCG CGCCGAAGCC GGCCTGATCG ACATCGCCTA CGACATGGGG
CTGCCGCTGG TCGCGACCAA CGAGCCGCAT TTCGCCACCG CCGACGATTT CGAGGCCCAC
GACGCGCTGC TGTGCATCGC CTCCGGCAAG CTGATCGCCG AGACCGATCG CGTCCAGCTC
ACGCCCGACC ACCGCTTCAA GACCCGCGCC GAAATGGCGG TGCTGTTCGC CGATTTGCCG
GAGGCGCTGG CCTCGACGGT CGAGATCGCG CAGCGCTGCG CCTATCGGCC GCTGACCCGC
AAGCCGATCC TGCCGCTGTT CACCGTCGGC GCCAGCGTCA ACGACGCCGA GGAGGCGGCC
GCCGCGGAAG CGGCCGAGCT GCGTCGCCAG GCCGAGCAGG GGCTCGCGGA CCGCATGCGC
GTCCACGGCC TGTCGCAGGG CATGACCGAA GAGGACTACC AGAAGCGCCT CGCCTTCGAA
CTCGACGTCA TCACCCGGAT GAAATACGCC GGCTACTTCC TGATCGTGTC GGACTTCATC
AAATGGGCCA AGGCCCACGG CATCCCGGTC GGGCCGGGCC GCGGCTCCGG TGCCGGCTCG
CTGGTGGCGT ATTCGCTGAC CATCACCGAC CTCGATCCGA TCCGCTTCGG TCTGCTGTTC
GAGCGCTTCC TCAATCCGGA ACGCGTCTCG ATGCCGGACT TCGACATCGA TTTCTGCCAG
GACCGCCGCG GCGAAGTGAT CGAATACGTC CAGCACCGTT ACGGCCGCGA CCAGGTCGCG
CAGATCATCA CCTTCGGCAC GCTGCAGGCG CGCGGCGTGC TGCGCGACGT CGGCCGCGTG
CTGCAGATGC CGTACGGCCA GGTCGACAAG CTGACCAAGC TGGTGCCGCA GAACCCGGCC
GCACCCGTGT CGCTGAAGCA GGCGATCGAA AGCGAGCCGA AGCTTCAGGC GTTTCGCGAC
GAGGACCCGG TCGTCGCCCG CGCCTTCGAC ATCGCGCAGA AGCTCGAGGG CCTGACCCGG
CACGCCTCGA CCCACGCCGC CGGCATCGTG ATCGGCGATC GGCCGCTGTC CGATCTGGTG
CCGATGTATC GCGATCCGAA ATCAGACATG CCGGTCACCC AGTTCAACAT GAAATGGGTC
GAGCCGGCGG GGCTGGTGAA GTTCGACTTC CTCGGCCTCA AGACGCTGAC GGTGCTCGAC
GTCGCGGTGA AGCTCTTGAA GCAGCGCGAC ATCCATATCG ATCTGGCGAC GCTGGGAATC
GAGGACCCGG TCAGCTATCA GATGCTGGCG CGCGGCGACG TGGTCGGCGT GTTCCAGGTT
GAAAGCCAGG GCATGCGGCG TGCGCTGGTG GACATGCGCC CCGACCGCTT CGAGGACATC
ATCGCGCTGG TCGCGCTGTA TCGCCCGGGC CCGATGGCGA ACATTCCGAC CTATTGCGCC
CGCAAACACG GCGACGAGGA GTCGGAATAT CTGCATCCGA TGCTGGAGCC GATCCTGAAG
GAGACCTTCG GCGTCATCAT CTACCAGGAA CAGGTGATGC AGATCGCCCA GGTGATGGCC
GGCTATTCGC TCGGCGAAGC CGACCTGCTG CGCCGCGCGA TGGGCAAGAA GATCCGCGCC
GAGATGGAGA AGCAGCGTGC CATCTTCGTC GAGGGCGCGA CCAAGAACGG TGTGCCGAAG
GACTCCGCCG ACACCATCTT CGATCTGTTG GCGAAATTCG CCGACTACGG CTTCAACAAG
AGCCACGCCG CCGCCTATGC GCTGGTGTCC TATCACACCG CCTACATGAA GGCGCATTAT
CCGGTCGAAT TCATCGCGGC GTCGATGACG CTCGACCTCA ACAACACCGA CAAGCTGTCG
GAATTCCGCG CCGAGGCGCA GCGGCTCGGC ATCAAGGTCG AGGCGCCGTC GGTCAATCGC
TCCGGCCCGA CCTTCGAGGT CGGCGCCAAC ACCATCTACT ACGCGCTCGC CGGCCTGAAA
GGCGTCGGCC TGCAGGCGGT GCAGATGATC GTCGAGGCGC GCGGCGACAA GCCGTTCGCC
TCGCTCGCGG ATTTCGCAGC GCGGGTCAAT CCGCGCGCGA TCAACAAGCG GGTGATCGAA
AGCCTCGCGG CCGCCGGCGC GTTCGATTGT CTCGACAGTA ATCGCGCGCG GGTGTTCGCC
GGCGCCGACG CGATCATCGC CGCCTGCCAG CGCAGCCACG AGGCCGCGAC CTCAGGCCAG
AACGACATGT TCGGCGGGCT CGCCGACGCG CCGCAGGTGG TGCTGCCGCA GATCGAGCCG
TGGCTGCCGG CGGAGCGGCT GCGGCGCGAA TACGACGCGA TCGGCTTCTT CCTGTCCGGC
CACCCGCTCG ACGATTACGC CACCGCGCTG AAGCGGCTGC GGGTGCAGTC CTGGGCGGAA
TTCTGCAAGG CGGTGAAATC CGGCGCCACC GCCGGCAAGG TCGCCGCCAC CGTGGTGTCG
CGGATGGAGC GGCGCACCAA GACCGGCAAC AAGATGGGCA TCATGGGCCT GTCCGACCCG
ACCGGGCATT TCGAGGCGGT GCTGTTCTCC GAGGGGCTCG CGCAATATCG CGACGTGCTG
GAGCCGGGCG CCGCGGTGCT GCTGCAACTC GGCGCCGAAC TGCAGGGCGA GGACGTTCGC
GCCCGCGTGC TGCACGCCGA GCCGCTCGAC GCCGCCGCCG CCAAGACCCA GAAGGGGCTG
CGGATCTTCC TGCGCGACAC CAAGCCGCTC GATTCGATCA CGAAGCGGCT GCAGCCGCCG
GAATCGAAGG TCTCGGGGGG CAATATCGCG CTGGTGCTGA GGCTCGATCC GCACACCGAG
GTCGAATTCG AGTTGCCGGG CCGGTTTCAG GTTTCGCCGC AGATCGCCGG CGCGATCAAG
GCGGTCACCG GCGTGGAACT GGTCGAGACG CTGTAA
 
Protein sequence
MSQAGFVHLH VHSAYSLLKG SMKIARLADL AKADHQPALA LTDTDNMFGA LEFSDKLAGS 
GIQPIVGLEL GIDFGDQDPT SRNAALAAPA RVVLLATRER GYRSLMRLNS RAFLETPVNQ
PPHIKFDWLD GETDDVIALT GGPEGPISLA MLTDPALGRL RCERLAQAFG DRLYVELQRH
NTDVERRAEA GLIDIAYDMG LPLVATNEPH FATADDFEAH DALLCIASGK LIAETDRVQL
TPDHRFKTRA EMAVLFADLP EALASTVEIA QRCAYRPLTR KPILPLFTVG ASVNDAEEAA
AAEAAELRRQ AEQGLADRMR VHGLSQGMTE EDYQKRLAFE LDVITRMKYA GYFLIVSDFI
KWAKAHGIPV GPGRGSGAGS LVAYSLTITD LDPIRFGLLF ERFLNPERVS MPDFDIDFCQ
DRRGEVIEYV QHRYGRDQVA QIITFGTLQA RGVLRDVGRV LQMPYGQVDK LTKLVPQNPA
APVSLKQAIE SEPKLQAFRD EDPVVARAFD IAQKLEGLTR HASTHAAGIV IGDRPLSDLV
PMYRDPKSDM PVTQFNMKWV EPAGLVKFDF LGLKTLTVLD VAVKLLKQRD IHIDLATLGI
EDPVSYQMLA RGDVVGVFQV ESQGMRRALV DMRPDRFEDI IALVALYRPG PMANIPTYCA
RKHGDEESEY LHPMLEPILK ETFGVIIYQE QVMQIAQVMA GYSLGEADLL RRAMGKKIRA
EMEKQRAIFV EGATKNGVPK DSADTIFDLL AKFADYGFNK SHAAAYALVS YHTAYMKAHY
PVEFIAASMT LDLNNTDKLS EFRAEAQRLG IKVEAPSVNR SGPTFEVGAN TIYYALAGLK
GVGLQAVQMI VEARGDKPFA SLADFAARVN PRAINKRVIE SLAAAGAFDC LDSNRARVFA
GADAIIAACQ RSHEAATSGQ NDMFGGLADA PQVVLPQIEP WLPAERLRRE YDAIGFFLSG
HPLDDYATAL KRLRVQSWAE FCKAVKSGAT AGKVAATVVS RMERRTKTGN KMGIMGLSDP
TGHFEAVLFS EGLAQYRDVL EPGAAVLLQL GAELQGEDVR ARVLHAEPLD AAAAKTQKGL
RIFLRDTKPL DSITKRLQPP ESKVSGGNIA LVLRLDPHTE VEFELPGRFQ VSPQIAGAIK
AVTGVELVET L