Gene RPB_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1859 
SymbolxseA 
ID3908054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2123791 
End bp2125407 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content70% 
IMG OID637883753 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_485478 
Protein GI86748982 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.020819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGAC TGCTCGCCCC CGAAACCCTC GCCAATGTCG GTGAATTCAC CGTCTCCGAG 
CTGTCGCAGG CGCTGAAGCG GACGGTGGAG GACAGCTATG GGCACGTCCG GGTGCGCGGC
GAGATTTCCG GATTCCGCGG CGCGCATTCA TCCGGGCACT GCTATTTCGC CCTGAAGGAC
GAGAGCGCCA AGATCGAGGC GGTGATCTGG AAGGGCGTCG CGGGGCGGAT GCGGTTCAAG
CCGCAGGAAG GCCTCGAGGT CATCGCCACC GGCAAGCTCA CCACCTATCC CGGTTCGTCG
AAATATCAGA TCGTGATCGA GGCGCTGGAG CCCGCCGGCG TCGGCGCGCT GATGGCGCTG
ATGGAAGAGC GCAAGCGCAA GCTCGGCGCC GAAGGGCTGT TCGACGAAGC GCGCAAGCAA
TTGTTGCCGT ATCTGCCCGA GGTGATCGGC GTCGTCACCT CCCCCACCGG CGCGGTGATC
CGCGACATCC TGCATCGGCT CGAAGACCGC TTCCCGCGCC GCGTGCTGGT GTGGCCGGTG
AAGGTGCAGG GCGAAGGCTC GGCCGAACAG GTCGCCGCCG CGATCCACGG CTTCAACGCG
CTGCCCGAAG GCGGCCCGAT CCCGCGCCCC GACCTGCTGA TCGTCGCGCG CGGCGGCGGT
TCGCTGGAGG ATCTGTGGTC GTTCAACGAG GAGATCGTTG TCCGCGCCGC GGCCGAAAGC
ATGATCCCGC TGATCTCTGC GGTCGGCCAC GAAACCGACG TGACGCTGAT CGATTTCGCC
GCAGACAAGC GCGCGCCGAC GCCGACCGCC GCTGCCGAAA TGGCGGTGCC GGTCCGCGCC
GAATTGTTCG TCGAGGTGCA GAGCTACGCG CGGCGGATGA TGCTGTGCTG GTCGCGTGGC
CAGGATTCAC GCCGCAACGA ACTACGCGCT GCCGCCCGCG CGCTGCCCGC TGCGGGCGAA
CTGCTCGCGA TCCCGCGGCA GCGGCTCGAC GGCGCGGCGT CGTCCCTGCC CCGCGCGCTG
CGCGCCAACA CCCACGCCCA TCACCGCCGC TACGCCAAGG CCGCGTCGGG CATCACGCTG
AATGTGCTGC GCGCCCAGGT CAGCCACAGC GCGCAGCGGC TCGGCAGCAC CGGCGAGCGG
CTGAAGCATT GCACCCGCGC CACGCTGCGC CATCGTCGCG ACCGCTTCGA GAGTCTGGCG
ATCCGGCTGC AGGCCTCGAA GCTCGCCAAC GAGCAGGCGC AGCGGATGCG GATCGCGCGC
GAGCGCGAGC GGATGCTGCG GCTCGCCGAG CGCGCCCGGC GGGCGTGGGC GACGCTGCGC
GATCGCCAGC AGGCGCGCCT TGTTCAATCC GGCAAGCTGC TCACCGCCCT GTCGTATCGC
GGCGTGCTGG CACGCGGCTT CGCGCTGGTG CGCGACGCCG ACGGCCATGC GCTCCATGCG
GCGGCGGCGG TCAGTGCGGG GGCGCGGCTC AGCGTCGAAT TCGCCGACGG CCGCGTCGGC
GTCACCGCCG ATGGCGGCGG CGCGACGCGA CCGGACATTG CCAAGCCCGC CACGCCAGCA
GCGAAGCCGG CGCCGAAGCG GGTGTCGAAG CCGATCGATC AGGGCTCGCT GTTTTAG
 
Protein sequence
MARLLAPETL ANVGEFTVSE LSQALKRTVE DSYGHVRVRG EISGFRGAHS SGHCYFALKD 
ESAKIEAVIW KGVAGRMRFK PQEGLEVIAT GKLTTYPGSS KYQIVIEALE PAGVGALMAL
MEERKRKLGA EGLFDEARKQ LLPYLPEVIG VVTSPTGAVI RDILHRLEDR FPRRVLVWPV
KVQGEGSAEQ VAAAIHGFNA LPEGGPIPRP DLLIVARGGG SLEDLWSFNE EIVVRAAAES
MIPLISAVGH ETDVTLIDFA ADKRAPTPTA AAEMAVPVRA ELFVEVQSYA RRMMLCWSRG
QDSRRNELRA AARALPAAGE LLAIPRQRLD GAASSLPRAL RANTHAHHRR YAKAASGITL
NVLRAQVSHS AQRLGSTGER LKHCTRATLR HRRDRFESLA IRLQASKLAN EQAQRMRIAR
ERERMLRLAE RARRAWATLR DRQQARLVQS GKLLTALSYR GVLARGFALV RDADGHALHA
AAAVSAGARL SVEFADGRVG VTADGGGATR PDIAKPATPA AKPAPKRVSK PIDQGSLF