Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0557 |
Symbol | |
ID | 4073046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 685027 |
End bp | 686007 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637982562 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_589636 |
Protein GI | 94967588 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.245938 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0550645 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAAGT TCTTCAGCTT CGTGCTTCTC CTCGTCCTCG CCGTCGCGGG ATGGCTTGCC TGGGCGCTCT ACCTGCCCGT CGCGCCGTCA GAGCCGAAAT TCGTGCTGCT GCGTCCCGGC TGGACAACGC GCCATATCGC CCGCGAGCTG AAGGACAACG GCATCATTCG CTCCGACAAA GCGTTCCTCT TCATGCATAT CCTGCGCGGC GAGCGCAGCC TGAAGGCCGG CGAATATAAA TTCGATAGCC CGGCGAATGC GCTGAACGTC CGTGATCGCC TCACCCGCGG CGACATCTAC GTTCGCCAGG TCACGGTCCC CGAGGGCTAC AACATGTTCG ACATCGCACA GGCGGTCGAA CAGGCGGGCC TCGGCACTGC CGCCGAATTC CTCAACGCGG CACGCCAGGA TTTGTTCCTG CTCAAAGATG TCGATCCGAC AGCGAAATCC CTCGAAGGCT ATCTTTTCCC GGACACCTAT TCGTTCACGC GCACCATGTC ATCGCATGAC ATGGCCACCG CCATGGTGCA TCGCTTCAAG CAGGAGGCGA AGGCACTCAA TCTCGACAGC GATGTCCATC GCGTGGTGAC GATGGCCTCG ATCGTGGAAA AAGAAACCGC AGTTCCCGAC GAGCGCCCCC AGGTCGCCAG CGTCTATTAC AACCGGCTCG ACAAGAACAT GACGCTCGCT GCCGACCCGT CGGTGATCTA CGCCGCGCTC CTCAATAACC GCTACCGCGG CACCATCTAC CAGTCCGACC TGCAGTACGA CTCGCCCTAC AACACCTATA AGTACGCCGG GCTTCCACCG GGGCCAATCG CCAATCCAGG CCGCGCGGCG CTCGCCGCAG CCATGCATCC GGCGCAAACG CAGTACCTCT ATTTCGTCGC CGACGCGCAG GGCCATCACC GTTTCGCAGC AACCCTCGAC GAGCACAATC GCAACGTGCT GGCCTACCGG CGGGCAATAG CGGCGAAATA A
|
Protein sequence | MRKFFSFVLL LVLAVAGWLA WALYLPVAPS EPKFVLLRPG WTTRHIAREL KDNGIIRSDK AFLFMHILRG ERSLKAGEYK FDSPANALNV RDRLTRGDIY VRQVTVPEGY NMFDIAQAVE QAGLGTAAEF LNAARQDLFL LKDVDPTAKS LEGYLFPDTY SFTRTMSSHD MATAMVHRFK QEAKALNLDS DVHRVVTMAS IVEKETAVPD ERPQVASVYY NRLDKNMTLA ADPSVIYAAL LNNRYRGTIY QSDLQYDSPY NTYKYAGLPP GPIANPGRAA LAAAMHPAQT QYLYFVADAQ GHHRFAATLD EHNRNVLAYR RAIAAK
|
| |