Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0471 |
Symbol | |
ID | 4069466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 582758 |
End bp | 585130 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637982475 |
Product | hypothetical protein |
Protein accession | YP_589550 |
Protein GI | 94967502 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0674457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAACGT TCAGGCTGAC AGTCGCAGTC GTGTTGGCAT TTGTGTCGGG GGCGACGGTG TTGTCCCCGG CGCAACATGA GCCGCCACGC GCAAAGATTG TGAGCCCGCA CGGCGCGCTG GGTCTCGCGT GCGAGAACTG CCATACCTAC ACCGCCTGGC GTCCTCTACG TGCTGTTCCG GAATTTAATC ACGACAAAAC GAAGTATCCG CTGCGCGGCA CGCACGTGAA CGTGAGTTGC CGGCAGTGCC ATACGAGCCT GGTGTTCTCG AACGTCGGCA TGAAGTGTTC CGACTGCCAC GCGGACTTCC ACCGCCGTCA GATGGGTGCC AATTGCGAAT CCTGCCACAC GGTTAAGGGT TGGAAAGTGG GAATCCAGGC AATTCAAAAC CATCAAAACC GATTTCCGCT CGTGGGTGCA CATGCGACCA CACAATGTGA AGACTGCCAC GTGGGAGCCG CTTCAGGAAA ATTTGCGGGT CTGAGCACGG ACTGCTACTC CTGTCACTCC AAGCAGTTTG CAACGCCAGT GCTCGATCAT CGCTCCAGCG GATTTCCCGT TACGTGCGAG AGTTGCCACA CCATGGATAC ATGGCTCGGG GCGAAGTTCG ATCACCTAAA ATTCACGGGT TTCGCATTGA CGGGGATGCA CGCAAAGTTG GATTGCACTG CTTGCCACCT CAACGGCAAG TTCAGCGGAA CGCCGGCGAG CTGCTACGGC TGCCACACCA AGGAGTACAA CGGCACGACA AACCCCAGCC ACGTTAACGC TGGTTTTCCG CGCGACTGTG GTATGTGCCA CAGCACAAGC AGTTGGCTGA GAGCCACCTT TGACCACAAT AAGACGAAGT TCCCACTCAC CGGTGGACAC AAAACCGTGA AGTGCGAAAG CTGTCACATC GGCGGCAACT TCAAATCGTT ACCCACGGAC TGCAGCAGCT GCCATCTCTC CCTTTTCAAG ACGACCACCA ATCCCAGCCA CACCAAGGCA GGATTCCCAA CGGATTGCAG CATCTGCCAC ACCACCGCGA ACTGGACGAG CGCCAGCTTC GATCACGGTA AGTACACCAA GTTCCCGCTG ACCGGAACCC ACCAGACTCT AAAGTGCGTG GACTGCCACG TTGGCGGAAA TTACACAGGA ACTCCCGCGT CGTGTTCCGG ATGCCATATG AAGGACTACA CCGGAGCGAA AACGCCGAAC CATGCAGCCG CAGGATTTCC TACGCAGTGC CAGATGTGTC ATAGCACGAC CGCGTGGAAG CCATCGAGTT TCGATCACAG CAAGTCGAAA TTTCCTCTAA CCGGCGCGCA TTCTTCGGTG CAATGTGCGA GCTGTCACGT AGGCGGCAAC TACACCACGC TGCCGACCGA CTGCGTGGGA TGCCATCTCT CGCAGTTCAA GTCTGTGAAG GACCCGAACC ACGTCACGCT TGGATGGCCG ACCGACTGCA CGATCTGTCA CACCACAGCG ACTTGGGCCG ACGCCCATTT CGACCACACG ACGTATACGA AGTATCCGCT CAGCGGTAAG CATGCGACTG TGGCGTGCCT AAGTTGCCAT GTTGGTGGCA AGTACGCGGG CACACCCGCA GATTGCGCGT CGTGCCACAT CAAGGACTAC AACGGTACCA CCGACCCAAA CCACAAGGCG GCGGGTTTCC CCACCGATTG CTCCATCTGT CACGCGACCG CGGGATGGAA GCCGGCGACT TTCGATCACA ATAAAACCAA GTTCCCACTG ACCGGGCAAC ACACCAAAGT CGACTGCATT GCGTGCCACA AGAATGGCGT GTATGCGGGA TTGCCGACAA CGTGCGTGTC GTGTCACCTC AATGACTTTA ACAAAACGAC CAGCCCCAAT CACAAAACGA GTGGCTTCCC GACTACGTGC GAAGTTTGCC ATTCCACCAA TGGCTGGATC CCAGCGAGTT TCGACCACAG TAAGACAAGC TTCCCGCTCA CTGGACAGCA CACAAAGATC AAATGTGACG ACTGCCACAA GGGGAGCTAC AACGGTAGTT TGCCCAAGGA TTGCTATAGC TGTCATAAGA CCGATTACAA CGCTACTAGC AATCCGAATC ACAAAGCGGC TATGTTCCCG ACGACGTGCA ACACCTGCCA CAACACGACA ACGTGGCTGG GCGCGGTATT CAACCACACG TGGTTCCCGA TCTACTCCGG CAGCCACGCC GGCCGTTGGA CAAGCTGCGG CGACTGCCAT ACCAATTCAG CGAATTACGC AGTGTTCTCC TGCATCACCT GCCACCAACA TTCGCAAGCA AACACGGATC CTCATCACAA GGACGTACGC GGGTATTCAT ACGGCCCGAC GACCTGCTAT AGCTGCCATC CCACAGGTAC GGCGGGCGAC TGA
|
Protein sequence | MTTFRLTVAV VLAFVSGATV LSPAQHEPPR AKIVSPHGAL GLACENCHTY TAWRPLRAVP EFNHDKTKYP LRGTHVNVSC RQCHTSLVFS NVGMKCSDCH ADFHRRQMGA NCESCHTVKG WKVGIQAIQN HQNRFPLVGA HATTQCEDCH VGAASGKFAG LSTDCYSCHS KQFATPVLDH RSSGFPVTCE SCHTMDTWLG AKFDHLKFTG FALTGMHAKL DCTACHLNGK FSGTPASCYG CHTKEYNGTT NPSHVNAGFP RDCGMCHSTS SWLRATFDHN KTKFPLTGGH KTVKCESCHI GGNFKSLPTD CSSCHLSLFK TTTNPSHTKA GFPTDCSICH TTANWTSASF DHGKYTKFPL TGTHQTLKCV DCHVGGNYTG TPASCSGCHM KDYTGAKTPN HAAAGFPTQC QMCHSTTAWK PSSFDHSKSK FPLTGAHSSV QCASCHVGGN YTTLPTDCVG CHLSQFKSVK DPNHVTLGWP TDCTICHTTA TWADAHFDHT TYTKYPLSGK HATVACLSCH VGGKYAGTPA DCASCHIKDY NGTTDPNHKA AGFPTDCSIC HATAGWKPAT FDHNKTKFPL TGQHTKVDCI ACHKNGVYAG LPTTCVSCHL NDFNKTTSPN HKTSGFPTTC EVCHSTNGWI PASFDHSKTS FPLTGQHTKI KCDDCHKGSY NGSLPKDCYS CHKTDYNATS NPNHKAAMFP TTCNTCHNTT TWLGAVFNHT WFPIYSGSHA GRWTSCGDCH TNSANYAVFS CITCHQHSQA NTDPHHKDVR GYSYGPTTCY SCHPTGTAGD
|
| |