Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4280 |
Symbol | |
ID | 4071853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5086516 |
End bp | 5087682 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637986313 |
Product | hypothetical protein |
Protein accession | YP_593354 |
Protein GI | 94971306 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTATA AGAACAGCGA AATGAAACTG GATCAGTTCG TGAACTACGT CAACGATGAC AAGATCAATT TGATTCCTCC CTTCCAGCGC TACCACGTGT GGGGTGTGTC AGCGCGGAGG AAACTCCTCA CGAATATTGT CCAAGGTCGC CCAATACCGG CAATTTTCCT CTACCGCGAT GCGGCTGGAG ACAAGTACGC ATACAACATA CTAGACGGCA AGCAGCGACT TGAGAGCATC ATCATGTTTA TCGGCGACAA ACACGCTTCG TTGAAAGTCA ATAATCCGAA GAATTACTTT GCAGAACGCA AGTACAAGGA CGCTGTCGGA TTCAAGATTG ACCTCGGCGG AAAGAAGAAA CAAGGTCTTG AGGAGTTGGG TGAGGATTTA GTTAGAGACC TGCGGGAATA TGTAATTCCA ACAATCGAAA TTACACTCGA CCAAGATAAT CCTAGCGCAC TCGATGAAAT CATCAATTTA TTCGTGGATA TCAATTCTAC CGGCGAACCG GTGAAGCGGT TCGCCATTGT CCGCGCAATG TCAAAGGATA GGCTTCTCAA CAGTGTACGA ATGCTCATTT CGCGCAAGGA GCAACGTCAG AATGATTGGC TTTATTTCCC TAAGAAGAAC GAATTTACCC AGGTTCTGCA AACGATGCAA ATCATCATCG GTATGAGGGA CAGAAACTCA AAAGCAGACC GAATGTGGGA GATGCTCGTT GAATTCGCCA TGTTCTTGAG AACCAAGGAG CACAGAAATC CGGTAGATAT TTTGAAGGGA TTCATCCGCG CAGCCGCCAT GAAAAGCACA AACCCCCCGC TGACGACAGC AGAAACCACT AAACTGAGTC AACTCTTCTC CTTCATTCGA AAGCTATATG TGAGTGATGC CAATTTCCGT GCATCTCGTC TTGCCGTGAA CCAAATTCAT TTCTATACGA TGGTGACAAC CATCATCGGC GAAGACCTGC TCACAGAAAA CTCGAATGAA GGCCTAGCAG AAAAACTAAA GAGCTTCGCG GCTATCTTGG AGGGCCGACG GGTGACGTCT CGTCAGCTAT CAGCAAGGAT CAGGCAATAT CAAGAACTTT CAGAAAAGCA GACCACCCAC GTTGGTCGAA GGGAGTCAAG GCAGATAATT TTTAAAGAGG TTTTGGATGC CTTGTAA
|
Protein sequence | MRYKNSEMKL DQFVNYVNDD KINLIPPFQR YHVWGVSARR KLLTNIVQGR PIPAIFLYRD AAGDKYAYNI LDGKQRLESI IMFIGDKHAS LKVNNPKNYF AERKYKDAVG FKIDLGGKKK QGLEELGEDL VRDLREYVIP TIEITLDQDN PSALDEIINL FVDINSTGEP VKRFAIVRAM SKDRLLNSVR MLISRKEQRQ NDWLYFPKKN EFTQVLQTMQ IIIGMRDRNS KADRMWEMLV EFAMFLRTKE HRNPVDILKG FIRAAAMKST NPPLTTAETT KLSQLFSFIR KLYVSDANFR ASRLAVNQIH FYTMVTTIIG EDLLTENSNE GLAEKLKSFA AILEGRRVTS RQLSARIRQY QELSEKQTTH VGRRESRQII FKEVLDAL
|
| |