Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0100 |
Symbol | |
ID | 4069475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 104112 |
End bp | 105413 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637982100 |
Product | hypothetical protein |
Protein accession | YP_589179 |
Protein GI | 94967131 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.807355 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.588322 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCTATTC GCTCTCTCGC GCTACGCGCT CTTACACTCT GGTGCTTGAT CGTGCTGCTG TGCGCGCCCG GCGCCTACGC GTACTCGGTT CTTACTCACC AGGCAATCAT TGATCTGGCA TGGGACGATT CGATTCGCCC ATTTCTCTTG AGCCGGTATC CGAACGCGAC AGCAGAACAA CTTCAGGTTG CGCATGCCTA TGCCTACGGT GGGTGCGCGA TCCAGGACAT GGGGTACTAC CCATTCGGGC ACACCTTCTT CAGTGATCTC ACTCACTATG TGCGTGCCGG AGATTTTGTC GCCAGCCTGT TCCGGAACGC GCAGAACTTG AATGATTTGG CTTTCGCCGC GGGCGCGCTT TCTCACTACC TCGGCGATTC TTTCGGCCAC TCCATCGCCA CGAACCAGGC CACACCCATC GAGTTCCCAG ACCTGGGCGC GCGGTATGGG ACTGTGGTGA CGTACGAGCA GGACCCGCAT GCGCACGTTC GCACTGAGTT CGGCTTCGAT ATCGAGCAAG TCTCGAAGCA GCGGTTCGCG CCGCACTCGT ACCTCGTACA CATCGGGCTG CTGATACCAC GTCCTTTGCT GGAGAAGGCG TTCTTCGAGA CTTACGGCAT GCCGCTCCAC ACCCTGCTCG GCGAAGAGGG GCCGTCGATG CGGAGCTACC GGTCGGCGGT ACGCAGTTTC ATACCATTCT TCGCGCGCGG CGAGGTGGTG CTGCATCGGC ACGAGTTTCT GCAGGAGCAG CCGAGTCCGG AGTTCTCCAC GTATTCCGAG GAATCGGAGC ATGCCGACTT TCGAAACCAC TCGCCGCAGG GGTACCGGAA CCCTGGATTT GTCGGGCATC TGTCGGCGGC GATCGTCTGG ATCGTGCCTA AGAGGGGTCC GGCTGCAATG CTGGCAATCA AAATCCCCAG CCACGAGTCG CAGGAGCTTT ATGCGAAGAG CATGACTACG ACACTGGAGC ATCTGCACAA GCACCTCGGG GACCTCGGGC ACGGAGAGGT CACGACCTTT GCGCTGGCAG ACCGCGATCT TGACACCGGA GCGAGAACAA AGCCAGGTGG CTACGCGCGC ACGGACGCGA CCTACGCCAA GCTCCTGCAC GACGTGGTGA CCCGGCCGCA AATGACGATC CCACTGGGAT TGAAAGAAGA TGTTCTCGCA TACTATGCGG ACCTGAATGC GCCGATTACG ACCAAACAGA ACCCGAAGCA ATGGGCGCAG GTGCAGCAGG AACTGGAGCA TTTCCGGACC ATGAAAAGCA CCAGCCAGGT TCTGATTCCG AGCGAGCCGT AA
|
Protein sequence | MPIRSLALRA LTLWCLIVLL CAPGAYAYSV LTHQAIIDLA WDDSIRPFLL SRYPNATAEQ LQVAHAYAYG GCAIQDMGYY PFGHTFFSDL THYVRAGDFV ASLFRNAQNL NDLAFAAGAL SHYLGDSFGH SIATNQATPI EFPDLGARYG TVVTYEQDPH AHVRTEFGFD IEQVSKQRFA PHSYLVHIGL LIPRPLLEKA FFETYGMPLH TLLGEEGPSM RSYRSAVRSF IPFFARGEVV LHRHEFLQEQ PSPEFSTYSE ESEHADFRNH SPQGYRNPGF VGHLSAAIVW IVPKRGPAAM LAIKIPSHES QELYAKSMTT TLEHLHKHLG DLGHGEVTTF ALADRDLDTG ARTKPGGYAR TDATYAKLLH DVVTRPQMTI PLGLKEDVLA YYADLNAPIT TKQNPKQWAQ VQQELEHFRT MKSTSQVLIP SEP
|
| |