Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1624 |
Symbol | |
ID | 4072550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1968707 |
End bp | 1971871 |
Gene Length | 3165 bp |
Protein Length | 1054 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983633 |
Product | hypothetical protein |
Protein accession | YP_590700 |
Protein GI | 94968652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.6016 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATTCGTT TGCGCAAGCA CTTTTCGGTC CTGTTGTTCT CACTCCTCTC TTCACTCGTC ATCGCGCAAC AACTCCCGGA TATGTCTCAA GCATGGCACT GGCGTCCCAT TGGCCCACTC CGTGGCGGAC GCACGCGTTC TGTCGCCGGT ATCCCGAACC AACCGAATGT CTTTTACATC GGGGTGGTGA ATGGAGGGGT ATGGCGCACG AATGATTATG GCCGCACCTG GGACTCAATC TTCGATAGCC AGCCAACGCA ATCGATCGGG GCAATCGCGG TGGCGCCGAG CGACCCGAAG GTGATTTACG TTGCCAGCGG GGAAGGCCTT CATCGCCCCG ATTTATCGGT AGGCGATGGA ATTTACAAAT CAACCGATGC TGGCAAAACT TGGACGCACC TCGGCCTGCG CGACGGTCAG CAGATTCCCG CACTTGCCGT CGATCCTCGC GATCCGAACA AACTGCTCGT GGCCGTCGCC GGACATCCCT ACGGCCCGAA CCCCGAGCGT GGCGTCTTCC GCTCCACCGA TGGCGGCCAG ACTTTTCAGA AAGTCCTCTA CAAAGATGAA AATGTCGGCG CATCTGACGT GCAGATCGAT CCGAGTAATC CCGACATCGT TTACGCAGGT CTTTGGGAGT CTCGCGAAGG GCCGTGGGAA AACGGCCAAT GGAACGGCAC CGGCGGCGGC ATCTACAAAT CAACTGACGG CGGCCAGACT TGGAACCAAC TTGCAGGTGG CCTGCCGGAT GGGATCATCC AAGTCTACGT TGCGATCTCG CCAAGCTCTC CGAACCGCCT CTATGCTTCG GTCGCTACAA AAGCCGGAGT TCACATATAT GGATCGAAAG ACGCAGGCAC AACATGGACA ACCGTGACCG ACGACGCGCG TCCCGAACAA CGAATCGGCG GTGGTGATCT TCCGGTGCCA AAGGTTGATC CACGCGATCC TGAAACGCTT TACATGACAA GCACCGTTAC ATGGAAATCT ACCGATAGTG GCAAGACTTG GATCGGTTTC CGCGGCGCGC CCGGCGGCGA TGACTACCAG AATATCTGGA TCAATCCCAA CGACCCTAAG ATCATCTGCA TCGTCAGCGA TCAGGGCGCA ATCGTCACGG TGAATGGCGG CGAGTCGTGG AGCTCCTGGT ACAACCAGCC CACTGCGCAG ATGTACCACG TGAATACCGA CAACGCTTTC CCGTACCGCG TTTGCAGCGG TCAGCAGGAG AGTGGCTCGG CGTGCGTCTC CAGTCGCGGC GACGACGGCC AGATCACCTT TCGCGAGTGG CACCCCGTCG CAGCCGAAGA GTACGGCTAC GCTGTCCCCG ATCCACTTGA TCCTGACATC GTAATTGGCG GCAAACTCAC TCGCTACGAC CGGCGCACCG GTCAGGCACA GAACATCTCA CCACGACCTT TACGCGGTCC CGACTTCCGC GTCGTCCGCA CGGAGCCGAT CGTCTTCGAT CCGAAGGACC CGCACATTCT CTACTTCGCT GCAAATGCAC TTTGGAAGAC GACGGACTAC GGCAAGCACT GGACTCAAGC CAGCCCCGAC CTCACGCGCA AAAATTTCTC GCTTCCGGCG AACATCGGAA AATTCAGCGA TCAGCCAACG GCGAAGGCAA AGCAACGCGG CGTGATTTAC GCCGTAGCCA TCTCACCTCT CGACACCAAG CGCATCTGGG CGGGCACCGA TGACGGCCTC CTGCACATCA CCGCCGATGG GGGCGCCCAC TGGACCGACG TTACCGGCAA CACACTCACT CCGTTCGAGA AAGTTTCCGT GCTCGAAGCC AGCCATTTCG ACGCACAAAC CGCTTACGCA GCCATCAACA CGCTGCGCCT CGATATTCTC AAACCGAAGA TCCTGCGCAC GCACGATGGC GGAAAAACTT GGGCGAATGT TCGAGAAGGT ATTCCCGACG GCGAAACTGT CAATGCTGTC CGCGAAGATC CCAAGCGCAA AGGTCTCCTC TTCGCCGCTA CCGAAAAGTC TGTTTACGCT TCGTTTGATG ACGGCGACCA CTGGCAATCT CTGCGTCTGA ATTTACCAGC CAGTTCCGTA CGTGATATTC AAATCCATGG CGACGATCTC GTCGCTGGAA CTCACGGACG CGGCTTCTGG ATTCTCGACA ACATTTCCGC GCTGCGCGAG ATTAAGCCGC AAGCATTCGA CCATCCAGTA CTCTTCCAGC CGCAGACGGC CATCCGCGTC CGCTGGAATA TGAACACCGA CACTCCGCTG CCGCCCGATG AGCCACGCTT GCCCAATCCT CCCGATGGCG CAATCATCGA CTACTTCTTG CCCGCAGGAT TCCATGGCGA AGTGAAGCTT GAAATCCACG ACGCCGCCGG AAAGGTTCTT CGCACATTCT CCAGCAACGA TCCCGTGCCG GCGGATGATC CGAAGCTGGC GATTCCCCGC TACTGGCCGC GCCCGCCACA ACCCCTTGAA GGGACACCGG GCATGCATCG CTTCTTGTGG GACATGCACC TTGCTGCCAT CCCCGGGATT CATGCAGAGT ACCCAATCGC TGCCGTGCCG CACGACACCG CCCCGGCGCC AACTTCGCCG TGGGTGCAGC CTGGCCGCTA CATGGTGGTA CTGCTCGCAA ATGGATCGCG AGCCGACATG CCGCTCACGA TCGAAATGGA CCCTCGGGTC AAAACAGCCA CTGCGGATCT CGCGCAGCAA TTCAACGCAT CGCATCAGCT CTATGAAGAC GCGAAGTTGA TCAGCGACGC CGCGGCCCAC GCGCAAGCGA TTCGCGAGCA ACTCGATCAA CTCCATTCCA AAGGCGGTGC AGCCGCCACT GACATCGAAG CCTTCAACAA GAAGATCGCC GAGATCGCCG GCGAAGAAGA AGATTTCGGC CCGCGTCGTC CTGGCACCGC TGAGACTCTA AGCAGCGTTC GGACTGGCGC TCTCTTCCTG ATGACGATGA TGCAAGACGC TGACGCCGCA CCGACTGAGG CAATGCTTAC CAAAGCCACT GAAATCCACA CGGCGACGCC AAAAGTCATC GAACGATGGA AGCAGTTCGT GCAGCAGGAG CTACCGAAGT TCAATGATCG TCTGAAGCAA GACAACCTCA GCCCGTTGAA TCCGCAGGCC AAGGTCCGCG ACGCAGAAGT CGAACTCCGT GGCAACGAAG AATAG
|
Protein sequence | MIRLRKHFSV LLFSLLSSLV IAQQLPDMSQ AWHWRPIGPL RGGRTRSVAG IPNQPNVFYI GVVNGGVWRT NDYGRTWDSI FDSQPTQSIG AIAVAPSDPK VIYVASGEGL HRPDLSVGDG IYKSTDAGKT WTHLGLRDGQ QIPALAVDPR DPNKLLVAVA GHPYGPNPER GVFRSTDGGQ TFQKVLYKDE NVGASDVQID PSNPDIVYAG LWESREGPWE NGQWNGTGGG IYKSTDGGQT WNQLAGGLPD GIIQVYVAIS PSSPNRLYAS VATKAGVHIY GSKDAGTTWT TVTDDARPEQ RIGGGDLPVP KVDPRDPETL YMTSTVTWKS TDSGKTWIGF RGAPGGDDYQ NIWINPNDPK IICIVSDQGA IVTVNGGESW SSWYNQPTAQ MYHVNTDNAF PYRVCSGQQE SGSACVSSRG DDGQITFREW HPVAAEEYGY AVPDPLDPDI VIGGKLTRYD RRTGQAQNIS PRPLRGPDFR VVRTEPIVFD PKDPHILYFA ANALWKTTDY GKHWTQASPD LTRKNFSLPA NIGKFSDQPT AKAKQRGVIY AVAISPLDTK RIWAGTDDGL LHITADGGAH WTDVTGNTLT PFEKVSVLEA SHFDAQTAYA AINTLRLDIL KPKILRTHDG GKTWANVREG IPDGETVNAV REDPKRKGLL FAATEKSVYA SFDDGDHWQS LRLNLPASSV RDIQIHGDDL VAGTHGRGFW ILDNISALRE IKPQAFDHPV LFQPQTAIRV RWNMNTDTPL PPDEPRLPNP PDGAIIDYFL PAGFHGEVKL EIHDAAGKVL RTFSSNDPVP ADDPKLAIPR YWPRPPQPLE GTPGMHRFLW DMHLAAIPGI HAEYPIAAVP HDTAPAPTSP WVQPGRYMVV LLANGSRADM PLTIEMDPRV KTATADLAQQ FNASHQLYED AKLISDAAAH AQAIREQLDQ LHSKGGAAAT DIEAFNKKIA EIAGEEEDFG PRRPGTAETL SSVRTGALFL MTMMQDADAA PTEAMLTKAT EIHTATPKVI ERWKQFVQQE LPKFNDRLKQ DNLSPLNPQA KVRDAEVELR GNEE
|
| |