Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4267 |
Symbol | |
ID | 4073194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5069782 |
End bp | 5072481 |
Gene Length | 2700 bp |
Protein Length | 899 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637986299 |
Product | endonuclease |
Protein accession | YP_593341 |
Protein GI | 94971293 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATCG ACAACGCCCG GCTGATTGAG AACGCGGTTC TGGTATCTCT TCCAGATCAA CACGCTCCGA AAGAAGAGGC GATCCTTGAA CTCGCGACAA GGCTGAGGCA GGCGTTCCCG CTGAGCGACG ACGACTTTGC GTCGTTGATC AAACGATTAC ACGCAAAGTT AGCAATCACA ATGGATACAG GCGTCGTGTT GCTCGGGGAG GAGGAACACA CACCGTGGTT GAGTTCCCGC AAGGCCGTCA TCGATCCGTT CTATTGGCAG CGCTTTCTTC AGCTGCTTCA AAGAAAGGAC TGGCCGCCGA AAGTCTTAAG CACTCTGAAT TCAGTGACGG ACAATATTTT GGATTTGCTT GGAGACCCTG CTAAGCCCGG ATCTTGGAAG CGTCGTGGTT TGGTCATCGG CGACGTCCAA TCTGGGAAGA CAGCCACGTA CACCGCCTTA TCCTGCAAAG CGGGTGATGC TGGATATCGC CTGATTGTCT TATTGACAGG AACACTAGAA AGCTTGCGAC GCCAAACTCA GGAGCGACTC GATGAGGGTT TTGTCGGCTT CGACAGTTCT GGAATCTTGC GAAAGATCAG GAACAATCGC GCAGTAGGAG TAGGGACCTT AGATGCCCGC AGATCCGCTG GGGTCTTCAC ATCGCGAGAC CGCGACTTCA GTAAGACTTT GGTCAACTCG CTCGGCATCA GGATCAATTC AATCAAAGAG CCGGTTCTCG TTGTAGTGAA GAAGAACCGT AAGATCCTCG AGAATCTTGA GAAGTGGCTG ACGGAGTACA ACGCAGGGGA TGACGGCAAG ATTGATGTTC CCCTTCTCTT AATCGACGAT GAAGCGGATA GCGCATCGGT TAACACTAAT CCACTGTCGA CAGATCCCAC GGAAATCAAC AAACGAATCC GAGCGCTGTT GGCTTTGTTT AAGAGGTCGA GCTATATCGG ATTCACCGCA ACACCTTTCG CTAACATTTT CATAAATCCT GATAGCGAAA ACGATATGCT GGGCGACGAT CTATTCCCCA GGGATTTCAT CTACACCCTC GACCCACCGA CCAACTATGT TGGCCCTGTT GTTATGTTCG GGGATGAACC GCGGGACGGC ATTCTCGAGC CGATTTCCGA TGCCGAAAGC GTCTTTCCTT CTCGACATAA GTGCTCATGG CCAATCAACG ATTTGCCTCA GAGCCTTCGC GATGCGGTTA CTTCCTTTGT GATTGCCAAT ACAATTCGTG ATTTGCGCGG AGACAGCGCG ACGCATCGCT CGATGCTGGT GAACGTGAGT CGATTTACCG CTGTTCAGGA CCAAGTTGCA GTTTTGATCA ATTCCGATCT CAATCGCATT CAGCAGGACA TTAGGAATTA CAGTCAATTA GATCCTGCAA TAGCTTTACG GAATAAGACC ATTTCCGAGA TCCACCAGGT TTGGAGAAGC AGCTACAACA CCAAGGAATT CGCTTGGGAA GGCGTACAGC GGGCATTGCT TGCATCCGCA CTGCCAATTG TCGTCAAGGC CGTGAACCAG CGAACCGGCG CTGCGAGCTT GGACTACGCC AGCAATCGAG AGAACGGTCT GCGTGTGATC GCTATTGGGG GAAACAGCCT GTCTCGTGGG CTGACGCTGG AGGGCTTGAG TACCAGCTAC TTTTATCGCA ATTCCCAAAT GTATGACACC CTGCTCCAAA TGGGGCGTTG GTTTGGCTAT CGCGACAACT ATTCAGATCT CTGCAAAGTA TGGCTCTCAG AAGACGCAAT CCAGTGGTAT TCGCATATCA CGGCCGCAAC CGAGGAACTG CGGTTTGAAG TGAAGCGCAT GCGGAGAATG AACGCGACTC CGCGCGAATT CGGTCTCAAG GTGAGAGCAC ATCCTGATTC TTTGATTGTG ACCGCGCAGA ACAAGATGAG ACTCGCGCAC ACGATAGAAC GAGTGATCTC CATCAGCACC GAAGCTATAG AATCGACGCG GCTTAAGAGC AGCAGGGTCA TCATCTCGGC CAACAAACAG GTTGTTGCGA ATGCTATCGC CAATTTCGAG AGAGCTGGAA TTGCTTGCGA GAGCTCGGAA TGGAACAACC CCATCTGGCG AGAAGTTCCC AAGGAACTTG TAAGCGCCTT GATCAGAAAT TTCGAGGTTC ACCCGCTCAA TGTTGCGTTC CAAAGTGAGG ATCTCGCCGA CTACTTTACA AACACGACAG AACCGAAGTT GCAAAAGTGG GATGTCGTCC TTCCGAACGG CGGTGAGCCG GAAATTATCT TCGTCCGAAC TCGAGTCCGC CCTGCAAAGC GCTTTGTGCT GCCACGTGAC AATGGGATTC TTGTGTCTGG GAGAAATATG CGGGTAGGCT CCCGAGGGAT CGAACGAGAG GGTCTGCCGA GTGGGATCGT CAGGGAAATC AATGACCAAG CAAAACTCAC AAAAAAGAAT GTGTCGGATC ATGCTTTCCG CGAACGTCGC CCACGCCCCC TTCTTTTGAT CCATGTACTC GCGCCGTACA CAAGGGACGG GAACGGCGTT GAGGTTCCAT TCGATACCGG TGGAGAAGAA CTCATTGCGT TGGGACTGAG TATGCCGAAG TTCGATGATA GCGATGTCGC AAAGAGAGTT AAGTACAGAG TGAATCTCGT CGAGTGGAGA GCAATGCTGG AAGAGTCATT GGACGATGAT TTACCAGAGA ACGATGATGA CGCAGCTTGA
|
Protein sequence | MSIDNARLIE NAVLVSLPDQ HAPKEEAILE LATRLRQAFP LSDDDFASLI KRLHAKLAIT MDTGVVLLGE EEHTPWLSSR KAVIDPFYWQ RFLQLLQRKD WPPKVLSTLN SVTDNILDLL GDPAKPGSWK RRGLVIGDVQ SGKTATYTAL SCKAGDAGYR LIVLLTGTLE SLRRQTQERL DEGFVGFDSS GILRKIRNNR AVGVGTLDAR RSAGVFTSRD RDFSKTLVNS LGIRINSIKE PVLVVVKKNR KILENLEKWL TEYNAGDDGK IDVPLLLIDD EADSASVNTN PLSTDPTEIN KRIRALLALF KRSSYIGFTA TPFANIFINP DSENDMLGDD LFPRDFIYTL DPPTNYVGPV VMFGDEPRDG ILEPISDAES VFPSRHKCSW PINDLPQSLR DAVTSFVIAN TIRDLRGDSA THRSMLVNVS RFTAVQDQVA VLINSDLNRI QQDIRNYSQL DPAIALRNKT ISEIHQVWRS SYNTKEFAWE GVQRALLASA LPIVVKAVNQ RTGAASLDYA SNRENGLRVI AIGGNSLSRG LTLEGLSTSY FYRNSQMYDT LLQMGRWFGY RDNYSDLCKV WLSEDAIQWY SHITAATEEL RFEVKRMRRM NATPREFGLK VRAHPDSLIV TAQNKMRLAH TIERVISIST EAIESTRLKS SRVIISANKQ VVANAIANFE RAGIACESSE WNNPIWREVP KELVSALIRN FEVHPLNVAF QSEDLADYFT NTTEPKLQKW DVVLPNGGEP EIIFVRTRVR PAKRFVLPRD NGILVSGRNM RVGSRGIERE GLPSGIVREI NDQAKLTKKN VSDHAFRERR PRPLLLIHVL APYTRDGNGV EVPFDTGGEE LIALGLSMPK FDDSDVAKRV KYRVNLVEWR AMLEESLDDD LPENDDDAA
|
| |