Gene Phep_2199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2199 
Symbol 
ID8253305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2529389 
End bp2530783 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content48% 
IMG OID644935848 
ProductExodeoxyribonuclease VII 
Protein accessionYP_003092465 
Protein GI255532093 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.503662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTTC ACTCGTATCC ATCTAACCCT GCAGCCATCA GGCTCTCTGA ACTGGCCACA 
CAGATCAGAC AGGCCATAGA TGGGGTATTC GGCAACCGTA CCTTCTGGGT TATTGCCGAT
ATAAGCAGCC ATACCTATAA ATCCCAAACC AATTACCATT ATTTTGAACT GGTAGAGAAA
GACAAAGGTT CTGCAAAGAT CCTCAGCAAA ATAGCCGGCA GGGCCTGGGG AAATGCATCG
GTTAACATTT CCAATTTTGA ACAGGCAACC GGACAAAAGT TCCAGAACGA CATCAATGTG
CTGGTACAGG TAGCTGTGCA GTACAACCCG GCATTCGGCC TGCAGTTAAA CCTGCTTGAT
ATAGATACCA GCTTTACCCT GGGCCTTTTT GAACAGCAGC GCAAAGAAAC CCTGGCCCGC
CTGCTCCGGG AAAATCCGGA TTTTATACAA AAAGCAGGTG AACAGTATTT CACCCGCAAT
GGCAGCCTGG CCCTAAACCG GGTACTCCAG CGCATAGCTG TCATTTCTTC GGCCACTTCT
GCGGGTTATC AGGATTTTAA ACATACACTG GAAAACAATA CCTTTGGCTA CCGTTTTATA
ATAGATGATT ATTTTACGCC GGTGCAGGGC GAAGCCAATG CCAGACAGTT CCTGGCTAAA
ATTGTAGAGG TATTCCAGTC GGGCAAACCT TATGATACCC TGGTCATCAT CAGGGGCGGA
GGGGCACAGA CCGACTTTCT CATCTTCGAC AATTACGAGC TCAGCCGGGC CATCGCCAAA
TTCCCTGTAC CGGTAATTAC AGGTATCGGG CACCAGAAAA ATGAGACCAT TGCCGATTTA
ATGGCACATA CGGCAACAAA AACGCCCACA AAAGCAGCCG AACTGATCAT TGCACACAAC
AGGGCCTTTG AAGAAAACCT GCTGGGCATG CAAAAAATGA TGCTCATTAA AACCTATCAG
CTCATCAATT ACCATAAGGA CCGGCTCGCA CAACTCAACC AAACCACCAT CAGTACCAGC
AGGGGCCTGC TGCATGAGCA GCAGCTCCAC ATCATCAGCC TCTCGCGGAT GGTACTTGGC
AACCCCAGGA TCATCCTTTC CAACCGGCAA AAAGACCTCA GCAACCTCAT CGGCAATATG
CAATCCTACA ACCGGATGTA TTTTGCCAAT AAAAGAGGGT ATATCGGTCA TTTTCAGTCG
GTAGTGCGAC TCATGAGTCC CCAGAACATC TTAAACAAGG GTTTTGCCAT CCTTAAAGTA
AAGGGCAAAA TCGCCGGCAA TGCCGAAGAT ATTGAGGCAG GCACCGAACT TACCGTACGC
CTGGCGGCTA CAGAAATTAA AACTACAGTA ATTACCAAAT CAGCAATAGA TGGAAACGAC
GAATTTAACC TATGA
 
Protein sequence
MEVHSYPSNP AAIRLSELAT QIRQAIDGVF GNRTFWVIAD ISSHTYKSQT NYHYFELVEK 
DKGSAKILSK IAGRAWGNAS VNISNFEQAT GQKFQNDINV LVQVAVQYNP AFGLQLNLLD
IDTSFTLGLF EQQRKETLAR LLRENPDFIQ KAGEQYFTRN GSLALNRVLQ RIAVISSATS
AGYQDFKHTL ENNTFGYRFI IDDYFTPVQG EANARQFLAK IVEVFQSGKP YDTLVIIRGG
GAQTDFLIFD NYELSRAIAK FPVPVITGIG HQKNETIADL MAHTATKTPT KAAELIIAHN
RAFEENLLGM QKMMLIKTYQ LINYHKDRLA QLNQTTISTS RGLLHEQQLH IISLSRMVLG
NPRIILSNRQ KDLSNLIGNM QSYNRMYFAN KRGYIGHFQS VVRLMSPQNI LNKGFAILKV
KGKIAGNAED IEAGTELTVR LAATEIKTTV ITKSAIDGND EFNL