Gene Paes_2398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_2398 
Symbol 
ID6459042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011061 
Strand
Start bp55595 
End bp58642 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content48% 
IMG OID642729126 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002019740 
Protein GI194335174 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.632036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTCA AGGAAAGTGA TATAGAGCAG AAACTAATCG CAAAGCTCGA GGAGCTGAAA 
TATACCTATC GCTCCGACAT CCGCGACAAG GTCACGCTGG AGCAAAACTT CCGAGAGAAG
TTCGAGGCGC TGAACCGTGT CCATTTGACT GACGCTGAAT TTGCGCGCTT GCGTGATGAG
ATCATCAATG CTGATGTGTT TCAGGCAGCT AAGACCTTGC GTGAATACGG GCACCTCCAG
CGGGAAGATG GTACTCCGCT GGACTACATG CTTGTCAACA TCAGGGACTG GTGCAAAAAC
GACTTTGAAG TCATCAACCA GCTACGCATC AACACTGATA ACAGCCATCA CCGCTATGAT
GTCATTCTAT TGATCAATGG CCTGCCTGTT GTGCAGATAG AGTTGAAAAC CATCGGGATC
AATCCACGCC GTGCGATGGA GCAGATTGTT GACTATAAAA ACGATCCCGG TAACGGTTAT
ACCAATACGC TGCTCTGTTT CATGCAACTG TTTATTGTCA GCAACCGAAG CAATACCTAT
TACTTTGCCA ACAACCACAG CCAGCACTTT GCCTTCAATG CTGATGAGCG TTTTTTGCCT
ATCTACCAGC TGGCCAGTGA AGACAATATA AAGATCACCC ATCTGGATGA CTTTGCAGAT
ATTTTCTTTG CCAAATGCAC ATTAGGCCAA TTGATTAGCC GTTATATGGT GCTTGTTGTC
AGCGAGCAAA AGTTGATGAT CATGCGGCCG TATCAAATCT ATGCGGTCAA GGCCATTGTC
GATTGTATCC ACCAGAACCG GGGCAATGGT TATATCTGGC ATACCACTGG AAGTGGCAAG
ACGCTGACTT CATTCAAGGC ATCCACCCTG CTCAAGGATA ATCCTGATAT TGAAAAATGC
CTGTTTGTGG TTGATCGCAA GGACCTCGAT CGGCAAACCC GTGAGGAATT CAACAAGTTT
CAGGAAGGTT GTGTAGAGGA AAATACCAAC ACCGAGACCC TGGTTCGCCG ACTGCTCTCA
GAAGACTACG CTGACAAGGT CATCGTTACC ACCATCCAGA AGCTTGGACT TGCATTGGAT
GAAAACAGCA GGCGCAACCA GCAGCACAAA GAGAAGGGGA AGCTTACGTA TAAAGAGCGA
CTTGAGCCAT TGCGCGACAA GCGTATTGTT TTCATCTTCG ACGAATGTCA CCGTTCCCAG
TTTGGTGAAA ACCACCGGGC CATCAAGACC TTCTTTCCCA AAGCGCAACT CTTTGGTTTT
ACCGGTACCC CTATATTTCA AGACAATGCC TCCTACAAGA AGATCGATGG CACAGTAGGG
TCATACCGTA CCACCAAAGA CATCTTTGAA AAAGAGTTGC ACGCTTATAC TATTACCCAT
GCTATTGACG ATCGAAACGT GCTGCGTTTT CATATCGACT ATTTCAAGCC TGAAACCGCG
CAAACCAGAG CGGAAGAAGC TGGTAAGGTT AAAAAGCAGA ATAAGATAAA GTCTGGTGAG
GCTGTTACAC AGCAGGCTGT GGTCGAGGCG ATTCTGGACA AACATAATGC GGCAACAAAC
CAACGCCGTT TCAATGCCTT GCTTGCTACC GCTTCAATTA ACCATGCGAT TGACTATTAC
GAACTGTTCA AAGAGTTGCA GGCTGAACAT CAGGAACAAG ATCCGACGTT CAAGCCACTG
AATATCGCAT GTGTGTTTTC ACCTCCAGCT GAAGGCAATA AAGACATCAA GCAATTGCAA
GAAGACCTGC CACAGGAAAA GATTGACAAT CAGCAGGACC CGGAAAAAAA GAAATCTGCG
CTTAAAGCTA TCATTGCCGA TTACAACAAG AACTACGGAA CCAACCACAA TATCAACGAA
TTCGATCTTT ACTACCAGGA TGTGCAGAAG CGCATCAAGG ACCAGAAATA TTCCAACAGA
GATTATCCGC ATACAAACAA GATCGATATC ACTATCGTTG TAGACATGCT GCTTACCGGG
TTTGACTCAC AGTATCTCAA TACTCTGTAT GTAGATAAAA ACCTCAAACA CCATGGGCTG
ATTCAGGCGT TTTCACGTAC AAACCGCATA CTTAACGACA CAAAGCCCTA TGGCAATATT
CTCGACTTTC GTGGGCAGCA AACGGCGGTT GATAATGCCA TCGCACTCTT TTCCGGTGAA
GACAACAGCC GTGCCAGGGA GATCTGGCTG GTTGATCCGG CCCCTGAGAT GGTGAAAAAA
CTCGGGAGTG CAGTTACTGA GCTGCAAACC TTCATGGAAA CCCAAGGCTT GCCATGCACG
CCGGAAGCAG TTGCCAATCT CAAGGGCGAC AGCGCCCGAG CCGAGTTCAT CAACCGCTTC
AAGGAGGTCC AGCGCTTCAA AACCCAGCTT GACCAATACA CCGATCTTGA CGAGGAGCAA
CAACAAGCCA TCGAGCAAAT TCTCCCGGAA GAACAAGCGC GAGGCTTCAA GGGAGCGTAT
CTTGAAACAG CCCAGCGGCT CAAAGCTCAG CAAGGCAAGG GTGGCGTTGA GGCCGGTGAT
GAGCAGGCAA GCATCGATCA GCTCGATTTT GAGTTTGTGC TTTTTTCCTC CGCACTGATC
GATTACGATT ACATCATGGG TCTCATTGCC CGATACAGCC AGAATAAACC GGGTAAGCAG
AACATGAGCC GTGAGCAGTT GGTGAACATG CTTTCAGCCA GCAGCAATCT CATGGATGAA
CGCCAAGATA TCATCGACTA CATCAACAGC CTGCAGGCAG GCGAAGGGCT GAGTGAGGAA
GCTATCCGAG ACGGTTATCA GGTTTTCAAA GCACAAAAAG CTGACAGTGA GCTCAGCTCG
ATGGCAGACA AGCACGGTGT AGAGACCGAG TCGCTGAAAA CCTTCGTCGA AGCCATTCTC
GATCGCATGA TCTTCGACGG CGAACAACTC AGCGACCTGT TCGAGCCGCT GGAGCTTGGC
TGGAAAGCAC GTAGCAAGGC CGAGCTGGCA CTCATGGAGG ATCTGGTGCC ATTTTTAAAA
AAACAGGCCC AAGGCCGCGA AATATCAGGA TTAGCAGCAT ATGAGTAA
 
Protein sequence
MTLKESDIEQ KLIAKLEELK YTYRSDIRDK VTLEQNFREK FEALNRVHLT DAEFARLRDE 
IINADVFQAA KTLREYGHLQ REDGTPLDYM LVNIRDWCKN DFEVINQLRI NTDNSHHRYD
VILLINGLPV VQIELKTIGI NPRRAMEQIV DYKNDPGNGY TNTLLCFMQL FIVSNRSNTY
YFANNHSQHF AFNADERFLP IYQLASEDNI KITHLDDFAD IFFAKCTLGQ LISRYMVLVV
SEQKLMIMRP YQIYAVKAIV DCIHQNRGNG YIWHTTGSGK TLTSFKASTL LKDNPDIEKC
LFVVDRKDLD RQTREEFNKF QEGCVEENTN TETLVRRLLS EDYADKVIVT TIQKLGLALD
ENSRRNQQHK EKGKLTYKER LEPLRDKRIV FIFDECHRSQ FGENHRAIKT FFPKAQLFGF
TGTPIFQDNA SYKKIDGTVG SYRTTKDIFE KELHAYTITH AIDDRNVLRF HIDYFKPETA
QTRAEEAGKV KKQNKIKSGE AVTQQAVVEA ILDKHNAATN QRRFNALLAT ASINHAIDYY
ELFKELQAEH QEQDPTFKPL NIACVFSPPA EGNKDIKQLQ EDLPQEKIDN QQDPEKKKSA
LKAIIADYNK NYGTNHNINE FDLYYQDVQK RIKDQKYSNR DYPHTNKIDI TIVVDMLLTG
FDSQYLNTLY VDKNLKHHGL IQAFSRTNRI LNDTKPYGNI LDFRGQQTAV DNAIALFSGE
DNSRAREIWL VDPAPEMVKK LGSAVTELQT FMETQGLPCT PEAVANLKGD SARAEFINRF
KEVQRFKTQL DQYTDLDEEQ QQAIEQILPE EQARGFKGAY LETAQRLKAQ QGKGGVEAGD
EQASIDQLDF EFVLFSSALI DYDYIMGLIA RYSQNKPGKQ NMSREQLVNM LSASSNLMDE
RQDIIDYINS LQAGEGLSEE AIRDGYQVFK AQKADSELSS MADKHGVETE SLKTFVEAIL
DRMIFDGEQL SDLFEPLELG WKARSKAELA LMEDLVPFLK KQAQGREISG LAAYE