Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_3013 |
Symbol | |
ID | 4693804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 3360938 |
End bp | 3364150 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639850771 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_997764 |
Protein GI | 121609957 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.36517 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTTT TATCTGAAGC CGCTATTGAG CTGGCACTGC TAGAGCAATT GCGGGGGCTG GGCTACCGCA TCGAGCAGGA AAAAGATATC GGCCCCGATG GTCCGCGTCC GGAACGCGAA AGCCACGATG AGGTCGTGCT CAAAAAGCGC TTCGCGGATG CTGTGGCGCG GCTCAATCCT GGCCTGCCGC TGGAGGCGCG GCAGGTTGCC ATGCGCCGGC TGATGCAGTC GGAGTTGCCA ACGCTGCTCG AAGAAAACCG CCGCCTGCAC AAACTGATGA CCGATGGGGT GGAGGTCGAG TACTACGCCG AGGATGGCAC GCTGACCACG GGCACGGTCG CGCTCATTGA CTTCAAGCGG CCAGAGCAGA ACGATTGGCT GGCGGTCAGT CAATTCGTCG TGATTGCCGG GCAATATCAG CGCCGCCCGG ATGTGGTGGT GTTTGTGAAT GGCCTGCCGC TGGTGGTGAT CGAGTTGAAG GCGCTAGGCA ATGCGACTTT GCTCGCGGCC TTCCAGCAGT TGCAGAACTA TAAGCAGCAG ATTGCGGCAT TGTTCAATAC CAATGCGCTG CTGGTGACTT CGGATGGTGT CCATGCCCGT GTTGGTTCGC TGTCTGCCGA TCAGGAGCGC TTTATGCCTT GGCGCACCAC GGATGGCACG GCGATTACGC CAAAGGGCAA ACCGGAGTTG CCAACCTTGA TCGAGGGCGT GTTTGCGCAT CGCCGCCTGC TCGACCTGTT GCGCCACTTT ACGGTGTTTG ATGAAACGGG TTCAGGCTTG GTCAAGATTA TTGCTGGCTA TCATCAGTTC CACGCCGTCA TGCGTGCGGT CGATTCAACC TTGCGCGCAT CGAGCCAATG GCAGGGGGTG CAAGAAGACC CACGCGATTA CGGCTGGTCC AGTGTCAAAA CCCAGGCCAA AGGTGACAGA AGGGCCGGCG TGATCTGGCA TACCCAAGGC TCAGGCAAAA GCCTGTTGAT GGTGTTCTAC GCCAGGCAAC TGGCACAGCA TGTGGCCATG GCGAATCCAA CCCTGGTCGT GCTGACCGAC CGCAACGACC TTGACGATCA ACTTTTTGCC ACTTTCTCCA GGTGCCGCGA TTTCATTCGG CAAACACCGG TGCAGGCCAA CAGCCGGGAG GATTTGCAAA AACTGTTGAT GCGCGCATCC GGCGGTGTGA TTTTTACCAC CTTGCAAAAG TTTGCCGAGT CAAGTCAGCC GCTGACCGAG CGTCACAATG TGGTGGTGAT TGCTGATGAG GCGCACCGCA GCCAATATGG TTTTCGCGCC AAACTCGATG CCAGGACGGG CGGGGTTTCC TACGGCTTCG CCAAATACCT GCGCGATGCC CTGCCTCATG CCTCATTCAT CGGCTTTACC GGCACGCCGA TCGAAGCCAG TGGTGTGAAC ACCCGGGCGG TGTTTGGCAA TTACATCGAT ATCTATGACA TCAGCCGTGC CGTCGAGGAT GGCGCGACGG TGCTCATCTA TTATGAATCA CGGCTTGCGC GCATCGAACT CGATGGGGAT GAGAGGCACA AAATCGATGC CGAAGTGGAT GCGATGATCG AGGACGACTC CGAAGCCGAA CAAGAGCGCT TCAAGAAAGA ATGGTCAACC GTGGAAGCCC TGGTGGGCAG CGATAAACGC CTCGCGCTGA TTGCGCAAGA CATGGTGACG CACTTCGAAG CGCGCATGGC CGCACTGGAT GGCAAGGCAA TGGCGGTGTG CATGAGTCGC CATGTCTGCG TGAGACTGTA CGAGGAAATC GTGAAACTGC GCCCCGGCTG GCACAGCGCC GACGACAAGG CGGGCGCGCT CAAAATCGTG ATGACGGGTG CCGCCAGCGA CCCCGCAGAA TGGCAACAGC ATGTCGGTAA CAAGGCGCGC CGCGATGTGC TGGCCAAGCG CGCTCGCGAC ATCAGGGACC CGCTCAAACT GGTGATTGTG TGCGATATGT GGCTCACCGG TTTTGATGCG CCCTGCATGC ACACGATGTA TGTGGACAAG CCCATGCAAG GGCATGGCCT GATGCAGGCG ATTGCGCGGG TCAATCGCGT GTTCCGCGAC AAACCCGCCG GATTGATCGT GGACTATATC GGCATCGCCC ATTATCTTAA ATCGGCGCTG CGACAATACT CGAAAAGCGA CCGGAAAAAT ACCGGTATCG ATGAATCCAA GGCCGTCGCG ATGATGATGG AGAAGTACGA AATCGTGCGC AACATGTACC ACGGTTTCGA TTACACGTTG GATGGAACAG CGCAAGAACG CCTGGTCATG ATGGCCGGCG CCATCGAGTG GATTCTCGAC ATGCAGCAAA AATCGGCGGC CAGGGAGAGC AGCCGGGAAG GCAAGAAGAC TGCGCATCGC CGCTATCAGG ATGCAGTGCT GGCGCTGTCC AAGGCTTTTT CCCTGGTGTC TGCTTCTGAC GAAGCGCGCA CCATTCGGGA AGAAGTCGGC TTTTTCCAGG CGATTCGCGA TGCACTGGTG AAAAGCAGTG GCGCAATGGG CGTGATGGGG CAAGAACGCG AATTGGCCAT TCGGCAAATC GTGAGCCGCG CGGTGATTTC TACCGAAATC GTCGACATTC TGGCCGCAGT CGGCATCAAG AGCCCGGATA TTTCCATCCT GTCCGACCAA TTTCTGACCG AAATCGCGCA GATGGAGCGC AAGAATCTCG CGCTGGAGGC CTTGCGCAAG TTGATCAATG ACGACATCCG TTCACGCAGC AAAGCCAACG TGGTGCAAAG CAAAGCCTTC TCGGAACGGC TGGAAGAGGC CATGGCGCGC TACCACGCCA ATGCCATCAC GACCGCCGAA GTGCTGCAAG AGTTGATCAC CCTGGCCAAG GACATTCGCG CCGAGCGCCA GCGCGGGGAA GAGTCCGGTT TGTCGGATGA GGAGATTGCT TTCTATGACG CGTTAGCCGA AAACGATAGC GCGCTGCAGA TGATGGGCGA TGACAAACTG CGGCTGATTG CCCACGAATT GCTGGTCAGC CTGCGCAACA ACATATCGGT GGATTGGGCG CACCGCGAAT CCGCCCGCGC GCAAATGCGC ATGCTGGTCA AGCGCATTCT GCGCAAATAT GGCTATCCGC CGGATTTGCA GGACGAGGCC GTGCGAACGG TATTGCGACA GGCTGAGGTG TCGATCAGGA GCATGTCCGG CAATGCGCTC GATGAAAATC TGAAGGGCTT CCGCCTGAAA TAA
|
Protein sequence | MAFLSEAAIE LALLEQLRGL GYRIEQEKDI GPDGPRPERE SHDEVVLKKR FADAVARLNP GLPLEARQVA MRRLMQSELP TLLEENRRLH KLMTDGVEVE YYAEDGTLTT GTVALIDFKR PEQNDWLAVS QFVVIAGQYQ RRPDVVVFVN GLPLVVIELK ALGNATLLAA FQQLQNYKQQ IAALFNTNAL LVTSDGVHAR VGSLSADQER FMPWRTTDGT AITPKGKPEL PTLIEGVFAH RRLLDLLRHF TVFDETGSGL VKIIAGYHQF HAVMRAVDST LRASSQWQGV QEDPRDYGWS SVKTQAKGDR RAGVIWHTQG SGKSLLMVFY ARQLAQHVAM ANPTLVVLTD RNDLDDQLFA TFSRCRDFIR QTPVQANSRE DLQKLLMRAS GGVIFTTLQK FAESSQPLTE RHNVVVIADE AHRSQYGFRA KLDARTGGVS YGFAKYLRDA LPHASFIGFT GTPIEASGVN TRAVFGNYID IYDISRAVED GATVLIYYES RLARIELDGD ERHKIDAEVD AMIEDDSEAE QERFKKEWST VEALVGSDKR LALIAQDMVT HFEARMAALD GKAMAVCMSR HVCVRLYEEI VKLRPGWHSA DDKAGALKIV MTGAASDPAE WQQHVGNKAR RDVLAKRARD IRDPLKLVIV CDMWLTGFDA PCMHTMYVDK PMQGHGLMQA IARVNRVFRD KPAGLIVDYI GIAHYLKSAL RQYSKSDRKN TGIDESKAVA MMMEKYEIVR NMYHGFDYTL DGTAQERLVM MAGAIEWILD MQQKSAARES SREGKKTAHR RYQDAVLALS KAFSLVSASD EARTIREEVG FFQAIRDALV KSSGAMGVMG QERELAIRQI VSRAVISTEI VDILAAVGIK SPDISILSDQ FLTEIAQMER KNLALEALRK LINDDIRSRS KANVVQSKAF SERLEEAMAR YHANAITTAE VLQELITLAK DIRAERQRGE ESGLSDEEIA FYDALAENDS ALQMMGDDKL RLIAHELLVS LRNNISVDWA HRESARAQMR MLVKRILRKY GYPPDLQDEA VRTVLRQAEV SIRSMSGNAL DENLKGFRLK
|
| |