Gene RPD_4105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4105 
SymbolxseA 
ID4024627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4569989 
End bp4571599 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content70% 
IMG OID637964313 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_571225 
Protein GI91978566 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGAC TGCTCGCCCC CGAAACCCTC GCCAATGTCG GCGAATTCAC CGTCTCCGAA 
CTGTCGCAGG CGCTGAAGCG GACGGTCGAG GACAGCTATG GCCATGTCCG GGTGCGCGGC
GAAATCTCCG GGTTCCGCGG CGCGCATTCG TCCGGGCATT GCTATTTCGC GCTGAAGGAC
GAGAGCGCCA AGATCGAGGC GGTGATCTGG AAGGGCGTGG CGGGGCGGAT GCGGTTCAAG
CCGCAGGAAG GCCTCGAGGT CATCGCCACC GGCAAGCTCA CCACCTATCC GGGCTCGTCG
AAATATCAGA TCGTGATCGA GGCGCTGGAG CCCGCCGGCG TCGGAGCGCT GATGGCGCTG
ATGGAAGAGC GCAAGAAGAA GCTCGGCGCC GAGGGCCTGT TCGACGAAGC GCGCAAGCAG
CTTTTGCCCT GGCTGCCGGA CGTGATCGGC GTGGTCACCT CGCCGACCGG CGCGGTGATC
CGCGACATTC TCCACCGGCT GGAAGACCGC TTCCCGCGCC GGGTCCTGGT GTGGCCGGTG
AAGGTCCAGG GCGAAGGCTC GGCCGAACAG GTCGCGGCCG CGATCCACGG CTTCAACGCG
CTGCCCGAGG GCGGCCCGAT CCCTCGGCCC GATCTGTTGA TCGTCGCGCG CGGCGGCGGC
TCGCTCGAGG ATCTGTGGTC GTTCAACGAG GAAATCGTGG TGCGCGCCGC GGCGGAAAGC
ATGATCCCGC TGATCTCCGC GGTCGGTCAC GAGACCGACG TGACGCTGAT CGATTTCGCC
GCCGACAAGC GCGCGCCGAC GCCGACCGCC GCCGCCGAAA TGGCGGTGCC GGTGCGTGCC
GAACTGTTCG TCGAGGTGCA GAGCTTTTCG CGGCGGATGA TGCTGTGCTG GACGCGCGGT
CAGGATTCCC GCCGCAACGA ACTCCGCGCC GCCGCCCGCG CTCTGCCGGC CGCAAGCGAA
CTGCTCGCGA TCCCGCGGCA ACGGCTCGAC ACGGCGGCGG CGGGGCTGCC GCGGGCGCTG
CGCGCCAATA CGCATGCGCA TCATCGCCGC TTCGCCAAGG CCGCGGCGGG CATCACCCTC
AACGTTCTGC GGGCGCAGGT CAGCCACAGC GCGCAGCGGC TCGGCATCAC CGGCGAACGG
CTGAAGCATG GCGCCCGCGC CACGCTGCGC CATCGCCGCG ACCGCTTCGA CGGTCTCGCG
ATCCGGCTGC AGGCCTCGAA ACTCGCCAAT GAGCAGGCGC AGCGGATGCA GATCGCGCGC
GAGCGCGAGC GGATGCAGCG GCTCGCCGAG CGCGCGCGGC GTGCGCTGAC GACGCTGCTC
GATCGTCAGC AGGCGCGCCT GACGCAATCC GGCAAATTGC TGACCGCCCT GTCCTATCGC
GGCGTGCTGG CGCGCGGCTT CGCACTGGTG CGCGACGCCG ACGGCCACGC CGTCCATGCC
GCCGCAGCGG TGAGCGCCGG CGCGCAACTC AGCGTCGAAT TCGCCGACGG CCGCGTCAGC
GTCACGGCGG ATGGCGGCCA TGCCGGCGAA CCAGCAAAGC CGACGACGCC AGCGTCCAAA
CCAACGCAGA AGCGCACACC GAAGCCGGTC GATCAGGGGT CGCTGTTCTA G
 
Protein sequence
MARLLAPETL ANVGEFTVSE LSQALKRTVE DSYGHVRVRG EISGFRGAHS SGHCYFALKD 
ESAKIEAVIW KGVAGRMRFK PQEGLEVIAT GKLTTYPGSS KYQIVIEALE PAGVGALMAL
MEERKKKLGA EGLFDEARKQ LLPWLPDVIG VVTSPTGAVI RDILHRLEDR FPRRVLVWPV
KVQGEGSAEQ VAAAIHGFNA LPEGGPIPRP DLLIVARGGG SLEDLWSFNE EIVVRAAAES
MIPLISAVGH ETDVTLIDFA ADKRAPTPTA AAEMAVPVRA ELFVEVQSFS RRMMLCWTRG
QDSRRNELRA AARALPAASE LLAIPRQRLD TAAAGLPRAL RANTHAHHRR FAKAAAGITL
NVLRAQVSHS AQRLGITGER LKHGARATLR HRRDRFDGLA IRLQASKLAN EQAQRMQIAR
ERERMQRLAE RARRALTTLL DRQQARLTQS GKLLTALSYR GVLARGFALV RDADGHAVHA
AAAVSAGAQL SVEFADGRVS VTADGGHAGE PAKPTTPASK PTQKRTPKPV DQGSLF