Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3793 |
Symbol | |
ID | 4071077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4479648 |
End bp | 4481531 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637985816 |
Product | heparinase II/III-like |
Protein accession | YP_592867 |
Protein GI | 94970819 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACTTGGT ATTTCGCACG TATGCGCCAG ATGGGGCCGG CGGAGGTGCT TTATCGCACT CGCCAGGCCG CCAACGTACC CATTGATTAT TTGGGTAAAG GCCAACGCGC GCAATTTCAG GCTCCCCACG GGGAATGGTG GTCGCCGAGG AAATACCCTG TCACGTTCAA CCGGGCCGGC GCCTCGTTCG AAAAGATACG AATCTTCGAT CTCGAGTTCC CTGCGGATTT CGAGTTCGAC TGGCATCGTG ATTACCGCAA CTCCAAGTCG GTAGAGCGCA AATTCTCGCG CAGCCTGGAT ATCCGCGATC CCGAAGTAGT TGGCGACATT AAGTACATTT GGGAGATCAA CCGTCACCAG CATCTGAGTG CGCTCGCATA CTCGTCCCGT CCGGACGCGT CGGACATCGT TGCGCGCTCG CTATCGGACT GGCTCGATAG CAATCCGTAT ATGGAGGGTG TGAACTGGAC GAGCAGCCTG GAATTCGGTT TACGGCTCAT CTCCTGGGCT GCCATTTTTC CGACAATGCG AGAACACTTC GCACGCAATA AAGCACTCCG CGAGAAGTTG GCGACTTCAG CCTACTTGCA TATGAAGGCG ATTCGGCGGC ATTTGAGCCG TTACTCTTCC GCGAACAACC ACCTGATTGG GGAACTCGCT GGATTGTATG TGGGTGCGAC CTGCTTCCCC TGGTGGAATG AATGTGACAC CTGGCGAGAC TTCGCTCGGA GAGAACTGGA ACGTGAGATT CTCGCGCAGT TCACTCCGGA AGGTGTAAAT CGCGAGCAAG CGATGTCGTA CCAGTTCTTC ACGCTCGAAA TGTTGCTCTT TGCCGGATTG GTCGCACGTA ACTCTGGCGA CGCCTTTGAA GGGGCGTACT ACGAGCGCCT CCGCAAGGGT TTAGATTATG TTCTTCTCGT CGCGACGAAG AGTGGCGACC TGCCATGGTT TGGTGACTCG GATGATGCCC GCGGATTCTT GTTTTCTCCG AATGAATCAG AATTGCAAGC AGTCATGAGT TTGGGGGCTG GGCTTTTCGA TGATGAACGA TACCTGTCCT TTGCGCCCCG CGGGACGGTT GCAAGCAAGG CATTGCTCGG CCCAGCAACA GAACCGATCG TCCGCCGCGC GAGTACGCCG CAAGTCGGAA CTGCGGAACT CCTGCGTGAA GGCGGTATCG CCGTAATACA GGCCGATGAC TGGAAAGTCG TAATGGACGT GGGTCCGCTG GGATTCACAA CCATCGCGGC GCACGGTCAT GCAGATGCCC TTTCGTTATT GCTGGCAGTG AAAGACCGGT ACGTTCTTGT CGACCCCGGT ACCTATGCGT ACCATTCCCA TCCGGAGTGG CGTGCGTATT TTCGAGGCAC TGCAGCGCAC AACACCGCGC GAGTCGATGG CCAAGATCAA TCTGTCATGC GCGGAAGATT CCTGTGGGAC CAGAAGGCGA ACGTGAAGGT AAAGCACTTC GCCGAAGAGG CCACCGAGAT CTGCATTGCA GCGGAACACG ATGGTTACAC CCGGCTCTCT GATCCGGTGG TACATCGTCG CGCAGTAACG GTGAAGAAGC AGGAACGGAT GATCGAAGTT GAAGACGCCT TCGAATGTAA GGGCGATCAC TCCATCGAAC TATATTGGCA CCTCGTTGAA TCGCTGACGC CGGAGGCGTT GCCGGGCGGC AGTATTCGCG CCGAGGGCGA TGGGGTGAAA TTGGACTTCA CTTTCGAAGG ACAGCAAGGC GACATTGCGA TCATCCACGG TTCGGAATCG CCTATTCTGG GATGGCGATC GACGGAATTT AACGTAAAAC ATCCGACTTC CACGGTGCGC CGCTCGCTGC GTATTCGCGG AACGCAATCC ATTAAGACAC GAATTCAGTT TTGA
|
Protein sequence | MTWYFARMRQ MGPAEVLYRT RQAANVPIDY LGKGQRAQFQ APHGEWWSPR KYPVTFNRAG ASFEKIRIFD LEFPADFEFD WHRDYRNSKS VERKFSRSLD IRDPEVVGDI KYIWEINRHQ HLSALAYSSR PDASDIVARS LSDWLDSNPY MEGVNWTSSL EFGLRLISWA AIFPTMREHF ARNKALREKL ATSAYLHMKA IRRHLSRYSS ANNHLIGELA GLYVGATCFP WWNECDTWRD FARRELEREI LAQFTPEGVN REQAMSYQFF TLEMLLFAGL VARNSGDAFE GAYYERLRKG LDYVLLVATK SGDLPWFGDS DDARGFLFSP NESELQAVMS LGAGLFDDER YLSFAPRGTV ASKALLGPAT EPIVRRASTP QVGTAELLRE GGIAVIQADD WKVVMDVGPL GFTTIAAHGH ADALSLLLAV KDRYVLVDPG TYAYHSHPEW RAYFRGTAAH NTARVDGQDQ SVMRGRFLWD QKANVKVKHF AEEATEICIA AEHDGYTRLS DPVVHRRAVT VKKQERMIEV EDAFECKGDH SIELYWHLVE SLTPEALPGG SIRAEGDGVK LDFTFEGQQG DIAIIHGSES PILGWRSTEF NVKHPTSTVR RSLRIRGTQS IKTRIQF
|
| |