Gene RPC_4621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4621 
SymbolxseA 
ID3972131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5160828 
End bp5162453 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content69% 
IMG OID637927732 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_534462 
Protein GI90426092 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCAA CGACAGCCGC GCTGCCGAAC TCGCCGGAAT TCACCGTCAC CGAACTCTCC 
TCCGCGCTGA AGCGGACGGT CGAGGACGCC TACGGTCATG TCCGGGTGCG CGGCGAGATT
TCCGGGTTTC GCGGCCCGCA TTCCTCCGGA CACTGCTATT TCGCGCTGAA GGATGAAGGC
GCCAAGATCG AGGCGGTGAT CTGGAAGGGC GTGCATGGCC GGATGCGTTT CAAGCCGCAG
GAAGGCCTCG AGGTCATCGC CACCGGCAAG CTGACCACCT ATCCCGGCTC GTCGAAATAT
CAGATCGTGA TCGAAGCCAT CGAGCCCGCC GGGATCGGCG CGCTGATGGC CTTGATGGAA
GAGCGCAAGC GCAAGCTCGG CGCCGAAGGG CTGTTCGACG AGGCCCGCAA GCAGCTGTTG
CCGTATCTGC CGGAGGTGAT CGGCGTCGTC ACCTCGCCGA CCGGCGCGGT GATCCGCGAC
ATTTTGCATC GGTTGCAGGA CCGCTTCCCG CGCCGCGTGC TGGTGTGGCC GGTCAAGGTG
CAGGGCGACG GCTCGGCCGA ACAAATCGCC GCCGCGATCC ACGGCTTCAA CGCGCTGCCG
GAAGGCGGAT CGATCCCGCG GCCCGATCTG TTGATCGTGG CGCGCGGCGG CGGCTCGCTG
GAGGATCTGT GGTCGTTCAA CGAGGAGATC GTGGTCCGCG CCGCCGCCGA CAGCATGATC
CCGCTGATTT CGGCTGTCGG CCACGAGACC GACATCACGC TGATCGATTT CGCCGCCGAC
AAGCGCGCGC CGACGCCGAC CGCGGCGGCC GAAATGGCGG TGCCGGTGCG CGCCGAATTG
TTCGTCGAGG TCGGCCAGCT CGGCCACCGC GCCATGCTTT GTTTCCAGCG CGGCCAGGAG
CGGCGCCGCA GCGAGCTGCG CGCCGCCACA AGGGCGCTGC CGTCGGCCGC GGCACTGCTG
GCCATTCCGC GACAGCGTCT TGATGGAGCG GCGGCGGCGC TGCCACGCGC GTTGCGCGCC
AACAGCCATG CGCATCACCG CCGCTTCGCA ACCGCCTGCG CCGGGCTGAC GCTGAAGGTG
TTGCGCGCCC AGGTCGCGCA CGCCACGCAG CGGCTCGCCG TCAATGGCGA ACGGCTGAAG
CATTGCGCGC GGGCCGGGCT GCGGCATCGC CGCGATCGGT TCGCGGCGCT CGATGCCCGC
TTCAAGGCGT CGAAGCTGGC CAATGTGCAG GCGCAGCGCA AAGCCTTGGC GCGCGACCGC
GAACGCACGC AGCGGCTCGC CGAGCGGGCA CGCCGCGCGC TGCTGATTGC GTTGCAGCGC
CAGCAGGCCC GGGTGGCGTC GTGCGGACAG TTGCTTGGTG CGCTGTCGTA TCGCGGCGTG
CTGGCACGCG GCTTCGCGCT GGTGCGCGAT GCGCAGGGGC TGGCGGTGCA TGCCGCGGCG
GCAATCGATC CCGGCGCGAG GCTCAGCCTG GAGTTTGCCG ACGGCCGGAT CGGCGCCACC
GCGGATGGCG AGAGCGCGGC TGCGCCGCGT AGTAAGCCCG CGGCAAAGCC GGCGACGGCG
AAGCCGGCCA GCACGACGAC GGCGAAGCGA GTGTCGAAAT CGGTCGATCA GGGCAGTTTG
TTTTAG
 
Protein sequence
MPPTTAALPN SPEFTVTELS SALKRTVEDA YGHVRVRGEI SGFRGPHSSG HCYFALKDEG 
AKIEAVIWKG VHGRMRFKPQ EGLEVIATGK LTTYPGSSKY QIVIEAIEPA GIGALMALME
ERKRKLGAEG LFDEARKQLL PYLPEVIGVV TSPTGAVIRD ILHRLQDRFP RRVLVWPVKV
QGDGSAEQIA AAIHGFNALP EGGSIPRPDL LIVARGGGSL EDLWSFNEEI VVRAAADSMI
PLISAVGHET DITLIDFAAD KRAPTPTAAA EMAVPVRAEL FVEVGQLGHR AMLCFQRGQE
RRRSELRAAT RALPSAAALL AIPRQRLDGA AAALPRALRA NSHAHHRRFA TACAGLTLKV
LRAQVAHATQ RLAVNGERLK HCARAGLRHR RDRFAALDAR FKASKLANVQ AQRKALARDR
ERTQRLAERA RRALLIALQR QQARVASCGQ LLGALSYRGV LARGFALVRD AQGLAVHAAA
AIDPGARLSL EFADGRIGAT ADGESAAAPR SKPAAKPATA KPASTTTAKR VSKSVDQGSL
F