Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3904 |
Symbol | |
ID | 4072241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4618545 |
End bp | 4620260 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637985930 |
Product | Sel1 |
Protein accession | YP_592978 |
Protein GI | 94970930 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0190521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00495215 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATCTGCC CGAAGTGCCA GTCTGAAAAT CCTGAATTGA ACCGCTTCTG TGGAGCGTGC GGAAGCCGTT TGCAGGAGAC CGCTCCGGTA GATCAGGCCC GTCCGAGCAA CGGCAACGAT GCGAACAAAC CCGGAGTTCC GGGAGTCCGT CGTCCGTTCA TCGTTTCAAC CACAGCCATG GCGGACGTCA CAGCGCAGAT GCTGAACACG CGCGTCGTAC TGCCGTCACC ATCGGTTGCA AGCGCAACGC ATTCTGGATA CGTGGCTCCG CTGAAGCCGC GCGTGGAACA CATTGCGCAC GGCGAAGACC TGCCGGAGCC CGAAATCGCG CCGGTTGATC CCAATGATGA ACCGATGTTC GCCGAGGGCT CCGTAGAACA GCAAGAAGGT ACGGTTCACG AATTCGATCT GGATTCCCCG GAAGAGAAGG AAGCCGAAGA ATGGCTGGAG CGGACGGTCG CCGAGCACGA AGCGCACATG CCCCCGCCGC GAACAGAGAC CCCGGGCTCC ATCTTGAATC TGAGCGCGCC GGTTGAATCG GTGGCGTCTG AGCGGCTCGA AGAGCCGCCC GTAGAGCAAG AGCCGGTTCG CAATTCGTTT CTTCAATTCG ACCCGCCGTC GGAATCGACT GGCGGCAGCG TATCCGGGCC ATCGTTCCTG GGATTGGATG AGCCACCGTC GCAGGACTAT CTTCTCGAGG AATCTGGATC CCACACGGGA AGGAATCTCG TGTTGGTGGC GATCGTGGCC ATCGTCGCCG CGATGGGCTA CCTCGAGTGG CGTGCGAGTA GCCGTGGTGA ATCCACTAAT CCGGTGGATG TACTGCACTT GAAACTCCCG AAGAAGAAGG GGCAGGGTCC GGCCGAGGTT GCAACGTCGA CGACGACCTC GCCGGGCGGC TCTTCGACTA CCGAGAGCGC CAACAATTCC GGCAAACCTG ATCTGATAGC GGAGCCGAAC CAGCCAGCGG CACAGAGTAG CGCTGCCGCG GGGAATTCTC AGACTTCCGC TACGCCGGCG CCGAACGCGA ATCCAGGAAC TGCGGAAGCG AATCCGCCTT CGACCGCCGC GGCAACGACG AATGCGGCAG GTACCTCTTC TCCGCAGCCC GCTGCAGCCG CAACGAAATC GACACCCCCA CCGGTCGAGA AACAGACGAC CGAAGTCGCA AAGAATACGC CGCCGCCCGC GAAGAAACCC GAGCCTCTAC CGCAGTCTGA CGCGGCCACC GCGAAGCCGA CCGCCAGTAA GCCGGCGGCA GCGATCGCAA GCAAGCCCCC GACGCCTGCT GCCCAGACGC AAGAAACTGA TCCGACTCTG AATGCCGGCG GTGCCGAGCT GCAGAAAGGC AAAGCTGCAG GCGCAACCGA CGATGGCCGC ATGTGGCTCT GGAAAGCTGT GGCGAAGGGC AACGGCGAAG CTCCTGTACT TCTGGCGGAC ATGTATCTGC AAGGCAGAGG CGTCCCGAAA GATTGCGAGC AGGCGATGCT GCTGCTCAAC GCTGCCGCGA AGAAGGCGAA TCCTCGCGCA CGTTCGAGAC TTGGCTCGTT GTATGCCACT GGCGAGTGCG TTTCCCAGGA TCGGGTGCAG GCTTATAAGT GGATGACCTC GGCACTCGCC GCGAATCCGG GAAGTGATTG GATCGAAAAG AACCGCCAGC AACTTCTGAG CCAGATGACG GCGTCGGAGC GCAAGCGCGC CGCTGCAATT CAGTAG
|
Protein sequence | MICPKCQSEN PELNRFCGAC GSRLQETAPV DQARPSNGND ANKPGVPGVR RPFIVSTTAM ADVTAQMLNT RVVLPSPSVA SATHSGYVAP LKPRVEHIAH GEDLPEPEIA PVDPNDEPMF AEGSVEQQEG TVHEFDLDSP EEKEAEEWLE RTVAEHEAHM PPPRTETPGS ILNLSAPVES VASERLEEPP VEQEPVRNSF LQFDPPSEST GGSVSGPSFL GLDEPPSQDY LLEESGSHTG RNLVLVAIVA IVAAMGYLEW RASSRGESTN PVDVLHLKLP KKKGQGPAEV ATSTTTSPGG SSTTESANNS GKPDLIAEPN QPAAQSSAAA GNSQTSATPA PNANPGTAEA NPPSTAAATT NAAGTSSPQP AAAATKSTPP PVEKQTTEVA KNTPPPAKKP EPLPQSDAAT AKPTASKPAA AIASKPPTPA AQTQETDPTL NAGGAELQKG KAAGATDDGR MWLWKAVAKG NGEAPVLLAD MYLQGRGVPK DCEQAMLLLN AAAKKANPRA RSRLGSLYAT GECVSQDRVQ AYKWMTSALA ANPGSDWIEK NRQQLLSQMT ASERKRAAAI Q
|
| |