Gene Acid345_3755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3755 
Symbol 
ID4069330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4435635 
End bp4438724 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content54% 
IMG OID637985777 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_592829 
Protein GI94970781 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0174443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTACTG TCTCCGCGTT CACCGAATCT GTGGTCGAAG AGGCGGCGCT TGCTTGGTTG 
GAAGGGCTGG GCTACACGGT ACTTTCCGGC CCAAGCATTT CTGCTGGAGA AATTGCCGCC
GAGCGGATCG AGGCTGATTA CTCCGACGTT ATTCTTGAGC GGCGCGTTAG ACAGACACTG
CAACGCCTCA ATCCTAATCT CCCACCAGAA GCGTTAGACG ACGCGTTCCG CAGGCTCACT
CACGACAATG AAACGTCTTT GATCGCTAAT AACCATGCCT TTCATCAGAT TTTGGTGAAT
GGCGTTACCG TTGAGTACAA CCGAGCCGAT GGTTCCATCG GCGGAGCCTT GGTGCGCGTT
ATTGACTTTG AGGCTGCCGA AAACAACGAC TGGCTCGCGG TGAACCAGTT CACGATCGTG
GAGGGACAGT ACACGCGGCG ACCTGACGTG CTGATATTTA TCAATGGTCT CCCGATCGCC
TTGCTGGAAC TCAAGAACGC TGCCGACGAA AATGCCACCA TTTGGACCGC GTTCAAGCAG
ATTCAGACGT ACAAATCCCA AATTCCATCG TTGTTCGCCT ACAACGAACT GCTCGTGATC
TCCGATGGTA TGGAGGCACG AGTCGGATCG CTCACTGCTG ATACAGAGCG GTTCATGCCT
TGGCGCACAA TCGGAGGCGA AGAGCTTGCC CCGACATCTC TGCCGCAGCT TCAGGTGTTA
ATCGAGGGGC TTTTCGAAAG GCGCCGTTTA CTTGACTTCA TACGTCACTT TGTCGTCTTC
GAAGACGAAG GACGCGGGGT GTTGATCAAG AAGATTGCTG GATATCATCA GTTCCACGCC
GTCAATAGGG CGTTAAGAGA GACGCTGTCC GCATCGGCGG AAGGCGGCGA TCGCAGAGTG
GGAGTGGTGT GGCACACACA GGGAAGCGGC AAGAGTCTCA CGATGGCTTT TTATGCCGGG
CGCCTTGTAC TTGAGCCGAC TCTTCAAAAT CCCACCATCG TCGTGATAAC AGATCGTAAC
GACTTGGACG ATCAACTTTA CGGGACATTC TCTCGCTGCC ATGAGTTGTT GAGACAGGCT
CCGCTACAAG CCCTGGATCG AGAGCACTTG CGGTCACTGG TTCAAGTGTC GTCCGGAGGC
GTGATCTTCA CAACAATCCA GAAATTCATG CCGGCGGAGG GCGACAAATA CCCGCAGCTT
ACAGATCGTC GCAACGTGAT CGTCGTCGCT GACGAGGCAC ATCGTAGTCA ATACGACTTT
ATAGATGGGT TTGCGCGGCA CATGAGGGAT GCCTTACCCC ACGCCTCATT TATTGGCTTC
ACAGGTACAC CCATCGAAAA GACCGATGTA AATACGCGCT CCGTCTTCGG CGATTACATC
AGCATCTACG ACATCCAGCG AGCCGTCGAA GATGGGGCGA CTGTGCCGAT CTACTACGAG
AGCCGTCTGG CGAAGCTCGA ACTCCCTGCT TCAGAAAAGC CCAAGGTCGA TACGGCTTTC
GAGGAAGCCA CAGAAGGCGA AGAGGTCGAT CGCAAAGAAA AGCTCAGGAC GAAATGGGCC
GCTGTGGAAG CCGTTGTGGG AACCGAGAAG CGGCTCAATA CGGTTGCTAA AGACCTCGTA
AAGCACTTCG AAGAGCGGCT AGAGGTCATG GACGGCAAAG CAATGATTGT CTGTATGAGC
CGCCGAATCT GCGTTGACCT GTACAACGAG ATTATGAAAC TTCGTCCAAC GTGGCACAGC
GAGAACGATG ATGCCGGAGT GCTCAAGATC GTGATGACGG GGTCAGCAAC GGACGGGCCC
GAGTGGCAAC AGCACATCAG GGACAAGCGA CGGCGTGAAG CGTTAGGTAA GAGATTTAAG
AATGCTAAAG ACCCATTCAA GATCGTGATC GTAAGGGATA TGTGGCTTAC AGGATTCGAC
GTTCCATCGT TACACACGAT GTACGCAGAC AAGCCGATGC GAGCGCATGG GCTGATGCAG
GCGATTGCCC GAGTCAACCG CGTGTTCAAA GACAAGCCCG GCGGACTGGT TGTGGACTAC
CTCGGCATCG CGCACGAACT CAAGCTTGCG CTGGCCACGT ACACAGAAAG CGGTGGAAAG
GGCCAGACGG CGCTGCCGCA GGAAGAGGCT GTCGCAAGGA TGATCGAGAA ATATGAAATC
TGTCGGGGAT TATTCCACGG GTTTGACTGG TCAATGTGGC TCACCGGGAC CTCCGCGCAG
AAGCTCGCGA TTCTTCCTAA CGCCCAGGAA CACGTCCTGA AACAGGAGGA CGGAAAGAAC
AGACTGTTGA AGGCGGTGTC AGATTTGTCA AAAGCATTCG CATTGGCTGT GCCCCACGAT
GCAGCTTTGA AGATTCGTGA CGATGTAGGC TTTTTCCAAG CAATCCGAAG CGTCTTGGCC
AAAACCGCTT CGCAACAGTA TCGTCCCGAG CAAGAGCTCG ATAACGCAAT TCGCCAAATC
ATATCCAGAG CGATGTCCTC TGGCGAAGTG GTGGATATTT TCGCAGCTGC GGGCCTTAAG
AAGCCCGATA TATCGATTCT TTCCGAAGAG TTCCTGAGAG AGGTCAGCAA CATCCCACAG
AAGAATCTCG CAGTAGAGCT GCTACGCAAG CTGCTCAATG ACGAGATCAA AAGTCGATCG
CGGCGCAACG TCGTGAAGGG CAAGCTCTTC TCCGCAATGT TGGAGAGCTC GATTCATAAG
TATCAGAACC GGGCTATTCA GGCCGCTCAA GTCATTGAGG AGTTAATCGG ACTCGCGAAG
GAGATGCGAG AAGAGTCGGC AAAAGGCGAG GAACTGGGCC TAAGCGAAGA CGAGGTGGCT
TTCTACGATG CACTCGAAGT CAATGACAGC GCCGTCAAAG TGCTTGGTGA ACCAACTTTG
AAGACTATTG CGCGCGAGCT TGTGGACACC GTTCGCCGAA ACACGACGAT CGATTGGGCC
GTGCGCGAGA ACATCCGTGC GAACCTGCGT GTGCTGGTGA AACGCATCCT GAGGAAGTAT
GGATACCCGC CAGACAAGCA GGAGAAGGCG ACCCTGACTG TGCTGGAGCA GGCGGAAGCG
TTGTCAGCGA ATTGGGCGGT GACTACGTAA
 
Protein sequence
MTTVSAFTES VVEEAALAWL EGLGYTVLSG PSISAGEIAA ERIEADYSDV ILERRVRQTL 
QRLNPNLPPE ALDDAFRRLT HDNETSLIAN NHAFHQILVN GVTVEYNRAD GSIGGALVRV
IDFEAAENND WLAVNQFTIV EGQYTRRPDV LIFINGLPIA LLELKNAADE NATIWTAFKQ
IQTYKSQIPS LFAYNELLVI SDGMEARVGS LTADTERFMP WRTIGGEELA PTSLPQLQVL
IEGLFERRRL LDFIRHFVVF EDEGRGVLIK KIAGYHQFHA VNRALRETLS ASAEGGDRRV
GVVWHTQGSG KSLTMAFYAG RLVLEPTLQN PTIVVITDRN DLDDQLYGTF SRCHELLRQA
PLQALDREHL RSLVQVSSGG VIFTTIQKFM PAEGDKYPQL TDRRNVIVVA DEAHRSQYDF
IDGFARHMRD ALPHASFIGF TGTPIEKTDV NTRSVFGDYI SIYDIQRAVE DGATVPIYYE
SRLAKLELPA SEKPKVDTAF EEATEGEEVD RKEKLRTKWA AVEAVVGTEK RLNTVAKDLV
KHFEERLEVM DGKAMIVCMS RRICVDLYNE IMKLRPTWHS ENDDAGVLKI VMTGSATDGP
EWQQHIRDKR RREALGKRFK NAKDPFKIVI VRDMWLTGFD VPSLHTMYAD KPMRAHGLMQ
AIARVNRVFK DKPGGLVVDY LGIAHELKLA LATYTESGGK GQTALPQEEA VARMIEKYEI
CRGLFHGFDW SMWLTGTSAQ KLAILPNAQE HVLKQEDGKN RLLKAVSDLS KAFALAVPHD
AALKIRDDVG FFQAIRSVLA KTASQQYRPE QELDNAIRQI ISRAMSSGEV VDIFAAAGLK
KPDISILSEE FLREVSNIPQ KNLAVELLRK LLNDEIKSRS RRNVVKGKLF SAMLESSIHK
YQNRAIQAAQ VIEELIGLAK EMREESAKGE ELGLSEDEVA FYDALEVNDS AVKVLGEPTL
KTIARELVDT VRRNTTIDWA VRENIRANLR VLVKRILRKY GYPPDKQEKA TLTVLEQAEA
LSANWAVTT