Gene Veis_3553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3553 
Symbol 
ID4691476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp3927676 
End bp3930630 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content52% 
IMG OID639851308 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_998289 
Protein GI121610482 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAA ACCAGATTGA GCAAGACCTC ATTGCCAAGC TACGCGAGCT TAAATACACC 
TATCGGTCGG AGATTTGTGA TAGAGCCTCG CTGGAGGCAA ACTTCCGCGA AAAATTTCAA
AAACTGAATC GTGTTCAGTT GAGCGACAGC GAATTTGATC GACTGAAGGA CGACCTGGTT
ACGGCAGATG TTTTCGCGGC ATCCCGGCGC CTGCGTGAGC CCGACAACTT CGAGCGCGAC
GATGGCACGC CGCTGTTCTA CACCTTGGTC AACACCAAGG ACTGGTGCAA AAACACCTTC
GAGGTAGTCA ATCAGCTCCG CATGAACACC GACAACAGCC ACCACCGCTA TGATGTGATT
CTGCTTATCA ATGGCGTGCC GGTAGTGCAA ATCGAGTTGA AAGCCCTGGG CATCAGCCCG
CGCCGGGCCA TGCAACAGAT TGTCGATTAC AAGAACGACC CAGGCAATGG CTACGGCAAA
AGTCTGCTTT GTTTCCTGCA ACTGTTCATC GTCAGCAATA GCACCAATAC CGGGTACTTT
GCCAACAACA ACAGTCGCCA CTTCAGCTTT AATGCCGACG AGCGCTTTTT ACCGTTCTAT
CAATTTGCTG ACAAAAATAA CAAGAAAATC ACCCATCTGG ACCAGTTCGC TGAAACGTTT
CTCGCCAAAT GCACCTTGGG CCAGATGATC AGTCGCTATA TGGTGCTGGT CGCCAGCGAA
CAGAAGTTGC TGATGATGCG CCCCTACCAG ATCTATGCTG TCGAGGCCAT TGTGGAATGC
ATCGACCAGA ACAGTGGCAA CGGTTACATC TGGCACACCA CGGGCAGCGG TAAAACGCTC
ACCTCCTTCA AGGCATCCAC CCTGCTCAAG GACAAGCGGG ATATAGACAA ATGCCTGTTC
GTGGTGGACA GAAAAGACCT CGATCGGCAA ACGCGCGAAG AATTCAACAA ATTTCAGGAA
GGCTGCGTCG AAGAAAACAC CAACACCGGA ACCCTGGTGC GGCGGCTGCT TTCCGATGAC
TATGCCGACA AGGTCATCGT CACCACCATT CAAAAACTTG GCTTGGCGCT GGATGACAGC
AAAAACTACA AGGAACGGCT GGCACCGCTA CGCAACAAGC GTGTCGTATT CATCTTCGAC
GAATGCCACC GCTCGCAATT CGGCGAGAAC CATGAGGCCA TCAAGGAATT TTTCCCCAAG
GCTCAATTGT TCGGCTTTAC CGGAACGCCC ATCTTCCGCG ACAACGCCAT TTACCAGCAA
GTTGAAGGCC AGCAGGCATC TTACAAAACC ACGAAAGACA TCTTTCAGCG GGAATTGCAC
GCATACACCA TCACCCACGC CATCGCCGAT TGCAACGTAC TTTGCTTCCA TATCGACCAT
TACAAGCCCC AAGGCAAGAA CAAGCCCAAG CCGGGCGAGA GTGTGCAGAA AAAGAAAGTC
ATCGAAACCA TTTTGAACAA ACACGACGCT GCCACCGCCG GTCGCAAGTT CAATGCCATG
TTGGCCACGG CTTCGATCAA CGACGCCATT GAATATCATG CCCTGTTCAA AGACATTCAA
GCCAAGAAGC AGGCCGCAGA CGCCTATTTT CAGCCACTGA ATATCGCTTG CGTGTTTTCG
CCGCCGGCCG AAGGCAACAA AGACGTGCAG CAGATTCAGG AAGACCTGCC GCAAGAAAAA
GCTGACAACG ATGAAGCATC GGACCAGAAA AAAGCCGCGC TCAAGGTCAT CATTGCAGAT
TACAACGCAG GCTTTGGCAC CAACCACAGC ATTGGCGAAT TCGATCGTTA CTATCAGGAT
GTGCAAAAGC GCATCAAGGA CCAGCAATAC CCCAACCAGG ACTTGCCCCA CAAGCAAAAG
ATTGACGTGA CGATCGTGGT GGATATGCTG CTCACCGGTT TTGATTCCAA ATACCTCAAC
ACCCTCTATG TCGACAAGAA TCTCAAGTAC CATGGGCTGA TTCAGGCGTT CTCACGCACC
AACCGCATGC TGAACGATAC CAAGCCATAC GGCAATATCC TGGACTTTCG CCAGCAACGG
GAGGCCGTCG ATGAGGCTAT CATCCTGTTC TCCGGCGAAA GCCTCGACCG CGCCAAAGAA
ATCTGGCTGG TAGATGCTGC CCCCACCGTG ATTGGCAAAC TCGATAGCGC TGTCACCGAG
TTACAGAAAT TTATGCAAAG CCAAGGACTT TCCTACGAGC CGGAGGATGT GAACAATCTC
AAGGGCGACG CAGCGCATGT CCAGTTCATC AAGCTGTTCA AAGAAGTGCA GCGCTTAAAA
ACCCAGCTTG ACCAATACAC CGATCTGAGC GTCGAGCACA AACAAAGCAT CGAAGCCCTG
CTGCCCGAAG GCACCTTGCG CGCCTTCAAG GGCATGTATC TGGAAACCGC CCAGCGCCTG
AAGAAAAAAC AGGGCAGCGA TGTGGCCAGT GACGAAGTGC GGCAACTCGA TTTTGAGTTC
GTGTTGTTTG CCTCTGCGAT GATTGACTAC GACTACATCA TGGGCCTGAT TGCCCGCTAC
TCGCAGCAAA CACCGGATAA GCGCGAAATG TCCCGCCCCC AATTGATTGG TCTGATCCAG
TCCGATGCCA AATTCATCGA CGAGCGTGAT GACATCGTTG CCTATATCGA CACCCTGCAA
GTGGGCAAGG GATTGAGTGA AGAAGACATT CGCCACGGTT TCGAGCGCTT CAAGGCAGAA
AAGAGTGCGA GAGAATTGGT CGGGATTGCC GAGAAGAACG GATTGGACAG CACTGCCCTG
CAAACCTTTG TGGAAGGCAT TCTGCGCCGC ATGATTTTCG ATGGGGAACA ATTGACTGAC
CTGCTGGCCT CGCTGGAGTT GGGTTGGAAA ATCAGGCGGC AAAAAGAACT GGCGTTGATG
GAAGACCTGA TTCCGTTGTT GCACAAACGC GCTCAAGGGC GGGAGATTTC AGGGTTGGCG
GCGTATGAGC AATAG
 
Protein sequence
MTENQIEQDL IAKLRELKYT YRSEICDRAS LEANFREKFQ KLNRVQLSDS EFDRLKDDLV 
TADVFAASRR LREPDNFERD DGTPLFYTLV NTKDWCKNTF EVVNQLRMNT DNSHHRYDVI
LLINGVPVVQ IELKALGISP RRAMQQIVDY KNDPGNGYGK SLLCFLQLFI VSNSTNTGYF
ANNNSRHFSF NADERFLPFY QFADKNNKKI THLDQFAETF LAKCTLGQMI SRYMVLVASE
QKLLMMRPYQ IYAVEAIVEC IDQNSGNGYI WHTTGSGKTL TSFKASTLLK DKRDIDKCLF
VVDRKDLDRQ TREEFNKFQE GCVEENTNTG TLVRRLLSDD YADKVIVTTI QKLGLALDDS
KNYKERLAPL RNKRVVFIFD ECHRSQFGEN HEAIKEFFPK AQLFGFTGTP IFRDNAIYQQ
VEGQQASYKT TKDIFQRELH AYTITHAIAD CNVLCFHIDH YKPQGKNKPK PGESVQKKKV
IETILNKHDA ATAGRKFNAM LATASINDAI EYHALFKDIQ AKKQAADAYF QPLNIACVFS
PPAEGNKDVQ QIQEDLPQEK ADNDEASDQK KAALKVIIAD YNAGFGTNHS IGEFDRYYQD
VQKRIKDQQY PNQDLPHKQK IDVTIVVDML LTGFDSKYLN TLYVDKNLKY HGLIQAFSRT
NRMLNDTKPY GNILDFRQQR EAVDEAIILF SGESLDRAKE IWLVDAAPTV IGKLDSAVTE
LQKFMQSQGL SYEPEDVNNL KGDAAHVQFI KLFKEVQRLK TQLDQYTDLS VEHKQSIEAL
LPEGTLRAFK GMYLETAQRL KKKQGSDVAS DEVRQLDFEF VLFASAMIDY DYIMGLIARY
SQQTPDKREM SRPQLIGLIQ SDAKFIDERD DIVAYIDTLQ VGKGLSEEDI RHGFERFKAE
KSARELVGIA EKNGLDSTAL QTFVEGILRR MIFDGEQLTD LLASLELGWK IRRQKELALM
EDLIPLLHKR AQGREISGLA AYEQ