Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3761 |
Symbol | |
ID | 4069336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4444125 |
End bp | 4445795 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637985783 |
Product | hypothetical protein |
Protein accession | YP_592835 |
Protein GI | 94970787 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1637] Predicted nuclease of the RecB family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCCT TCTCCGAAGC CGAGCTGCGC GACTTTCTCG CTGACAACAT AACGAGGATT GAGCCGGGAC TGACGCTTCT CGATAAAGAA AAGTACATCC CGAACGCAAT CGGCACCCGG GGGTTCATAG ATTTGCTCGC TCAGGACGAA CGCGGCCATT TCGTACTCAT TGAGTTGAAG CGCAGTGATG CTGCGTCTCG CGAAGCCATT CACGAAGTAC ACAAGTACGT TGAAGGTGTT AAACGTCACT TGGGAGCGAG AGACGACGAG ATCCGTGTAA TTGTCGCCTC TACGGAATGG CGCGAGCTAC TTCTTCCCTT TTCGCAATTT CTCGAATCAA CCCGAATCAC GGCGGAAGGC ATTCGGTTGA TTGTAGGCGA GAGTGGTCTC CCTTCTCTCA CGGCTGAAAA GGTGGAGCCA GTTCGCGTCA GTCGCGGTCG ATTCTTGGCT CCTTGGCACG AACTGAACCA CTACACATCT GAGGACAGCC TCGCAAAGGG CATTCTCGAC TACGAGCATT CATGTCGCCT AAAGGGCGTT GACGATTATG TTCTGGTCGT CTTTGAGGCA AGTCCCGACT GGTATCCGCT GGCTCAGGAA GAGTTTCGGG CCTCCATGAT TCAAATGCAG GAGCAATTTG GAGTGCATGA CCCTGCCGAG ATAGATGAGA TGGTGGCCAA ACTGCCGAAC TTTCGTTTCG CTTTGTACTT CGCATCCCAG GTGCTCGGAC GAGAGTACTG CCTCGAAGTA CTGCGGCAGA ACTCCGAGGA CATGGAAGAG CATGAAGAGA TCATCGATGG CATGGAAGAA GAAGAAGCGC TGCAATATCT GAACGACGCG GTCCACAACT TGGCACCCAA AAAGCACAGA GACGGCTTTG AAATTGGCTA TCCCGGTAAG TTTCAAACTC GATTCCGTGT TGGGAACCTC TGGATTCTGA AGCGGATTCA GCGTTACGGA ATGTTTCAGA GGAACACATT GCTCTCCGAG GAAGAGATTC TGGAGGAGCT AGCCGGAAGT GAAGGAGTTA CGGGGCAGCG CTTCAAGCGG CAAATTACGA TCAACAACAA GAGCCACATC GCATCCGCTA AAGATGGTCT CCGAGAGTGC CTGGAACACA ACCCCGTTTG GCTTGCCCAC ACCCTGAAAA TCATCGACGA AATTGAGAAG GATTATCCTG AGCTGGAGGC CAGCATCGAC GTGTTCAACC CTTCGACTGG GTTGATGACG ATATATTTCG CAGCAACCCG GCCCGAGCCG TTTGCATTCA TTCCACTTTT CACCATCGCA GTGCGGGACG GCGCGGCGAC CATCAGAGCT TACCTTGGCT GTTTAGAAGG GGTCAGTTCA CCGATGAGCT TTCAATCGTT GATTGACAAA TACTATGATG GCCAGATTGG AGTGTTGCTG CTGTCGGTAA CGTGGGGAGG CTATGAGCAG AGGGACACAG ACATTCTCGA AGATTGCGGC CTCTTCTATC GTTCGTTCCG CGTAGATGAC ATCGGATCGA CGAACGCCTT TAGCAGTTTG AGAGACGAAC GGTGGCGGCC GGTACCCCAG TTTTTTCCAC CTGAGAAGTT CTCCGAACAC CTCGATCATC ATGGTGAATT TGTCATGGAA CTTCTTCGCG AGATTGGTTC TCGTGACAAG GGCAGCTACT TCGAGAGCTA G
|
Protein sequence | MPPFSEAELR DFLADNITRI EPGLTLLDKE KYIPNAIGTR GFIDLLAQDE RGHFVLIELK RSDAASREAI HEVHKYVEGV KRHLGARDDE IRVIVASTEW RELLLPFSQF LESTRITAEG IRLIVGESGL PSLTAEKVEP VRVSRGRFLA PWHELNHYTS EDSLAKGILD YEHSCRLKGV DDYVLVVFEA SPDWYPLAQE EFRASMIQMQ EQFGVHDPAE IDEMVAKLPN FRFALYFASQ VLGREYCLEV LRQNSEDMEE HEEIIDGMEE EEALQYLNDA VHNLAPKKHR DGFEIGYPGK FQTRFRVGNL WILKRIQRYG MFQRNTLLSE EEILEELAGS EGVTGQRFKR QITINNKSHI ASAKDGLREC LEHNPVWLAH TLKIIDEIEK DYPELEASID VFNPSTGLMT IYFAATRPEP FAFIPLFTIA VRDGAATIRA YLGCLEGVSS PMSFQSLIDK YYDGQIGVLL LSVTWGGYEQ RDTDILEDCG LFYRSFRVDD IGSTNAFSSL RDERWRPVPQ FFPPEKFSEH LDHHGEFVME LLREIGSRDK GSYFES
|
| |