Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4260 |
Symbol | |
ID | 4073187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5059440 |
End bp | 5060615 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637986292 |
Product | hypothetical protein |
Protein accession | YP_593334 |
Protein GI | 94971286 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTGT ATCAGATAGC CATTTTCGCC GATGGTGCCG ACGGATATGC GGCCACTTTG CGGGGGACGC TACAGCGCTG TATTGCAGAG CTTGGCGTTC CAGCTGGTAT GGTGTCTTTT CTCGACGAGG CATCCGTTGC GACACGTGAT CACAAATCTC CGACAGTGGG CGTTTTCTTC GGCTTGACTC CGCACCCTAT CACCAATCCA ACCCTATCGA CTCTGATTGA GGAAGCTGCC CTGGTTCTAC CGGTCGTTCC TACTCTGGAC CGCTTCAGTG AATTTGTTCC CGATGATCTG CGCCCCATCA ACGGCATGGC ATTGCGACCA GAGGATTCGG CAATGGAAAG AATCTCCTCG GTGCTTCTCG AAGGTCTTGG TCTACTTCGC AAGAGCCGTC GGTTATTTAT CAGCTACCGT CGCGTCGAAA CGCAAGGGAT TGCGATTCAA CTTTATGAGC AGCTCGACGC CAATGGTTTT GATGTCTTCC TCGATTCTCA CAGCATCCGA CCCGGAGAGC CGTTTCAAGA AGTGCTATGG CATCGCTTGG CTGACACCGA TGTTGTAGTT CTCCTCGATT CGCCAGGGTT TCTGAGCAGC CGATGGACCG AAGAAGAACT TGCTCGAGCG AACTCCACCA ACCTACAAAT CTTACAGCTG CTTTGGCCTG AGAGCGCAAT GACGTCGGCT GCGGCTTTCA GCAAGCCCTT TGGCCTTGCT GACGGTGATT TTGACGTCGC GACGAATCAG CTCGGAGCTA CGGCGCGGTT ACGCGACGAA TGCCTGCGAA GAGTGACTAT TGAAGTCGAG TCTCTGCGAG CCCGCGCTCT CGCTGCGCGC CATGCGTACC TCGTGGAGGA GTTCTTCTCC GAAGCGAGAG CAGCCGGACA CAATCCACAA GTGCAGCCTG ATCGGTTCAT TCTGTTGGAG ACGAAAGGCG GCAATCGCTA CATCACGGTA CCGACCGTGG GAGTACCAGA CGCGGTTCGC TATCAAGAGG TGGAAGATGC GATGGCGCGA GATCCCAAGC ACCACCAGGA CATCATCCTG CTCTACGATG AACGCGGTAT CCGAGACAAG TGGATGAAGC ACTTGGCATG GCTTGATCGA CAAACCTTGC CGGTGAAAAG CCTTCAGGTT GCGAAGGCCC AGTCTTGGTT GGGAGGTCTG ACGTAA
|
Protein sequence | MSLYQIAIFA DGADGYAATL RGTLQRCIAE LGVPAGMVSF LDEASVATRD HKSPTVGVFF GLTPHPITNP TLSTLIEEAA LVLPVVPTLD RFSEFVPDDL RPINGMALRP EDSAMERISS VLLEGLGLLR KSRRLFISYR RVETQGIAIQ LYEQLDANGF DVFLDSHSIR PGEPFQEVLW HRLADTDVVV LLDSPGFLSS RWTEEELARA NSTNLQILQL LWPESAMTSA AAFSKPFGLA DGDFDVATNQ LGATARLRDE CLRRVTIEVE SLRARALAAR HAYLVEEFFS EARAAGHNPQ VQPDRFILLE TKGGNRYITV PTVGVPDAVR YQEVEDAMAR DPKHHQDIIL LYDERGIRDK WMKHLAWLDR QTLPVKSLQV AKAQSWLGGL T
|
| |