Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0566 |
Symbol | |
ID | 4073055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 692983 |
End bp | 694167 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982571 |
Product | hypothetical protein |
Protein accession | YP_589645 |
Protein GI | 94967597 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.182463 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0679598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCACCC AACTTCGCGC TGGAATCCTC TTCCCGCCCT GTTTACCATG CTCCCATCTT GCTCGAAACG CAATTTGCCG CATCCAAACG AATGGAGAGC GAGTGATCGA TTTGCTGGAC CGCCGCGCAG TCCGCGTACT GACCACCGTG CTGGTTTTTG CCGCTGTTTT GGCGTTCATT TACGTTGCGC ACAAGACGCT CATTCTCTTT CTCTTCGCGC TTCTCTTCGC TTACTTGCTG GAACCCGTCG TCAGCCGCCT TGCCGGATGG ATGCGCAACT CGCGCGGCCT CGCAATCCTC ATTGTTTACG TTTTGCTCTT CGTCGCCGTG ACCATTGCCG GTGCACTGAT CGGGCCACGC ATCTTCTCCG AAGGCCAAAA ACTTGGCCAG CAACTCCCGG ATCTCTACGA CAAAGTCGCC TCTGGCAACA TCGCCTTCAC GCTCGGCTCC CGCCATGGCT GGAGCGCCGA AACATCCCAA CGTCTGAAGA CTCTGCTCGT CAGCCACCAG GATGAAATCG TAAGCGCCAT CAGCAGCGCC GGCACCCGGA CCGCCGCGAT GCTCACCAAC ATTGGCTGGA TCATCATCAT CCCCATCCTC GGCGCATTCC TGCTTAAAGA TAAGCGTGAC CTGCGCCTCT CGCTGCAGAA CATCGTGGGC GAGCCGCGCA AAAGAGAGTT TTTTGGTCAG CTCATCACTG ATGTTGACGC CATGCTCTCG CAATTCGTCC GCGCCCAACT CCTGCTCGCC ATCATCTCCG GACTGGTCTA TACCGCGGCA TTATCGGTGC TCCAGGTTCC TTATGCCTAC ATCCTCGGCG CCGTCGGCGG CCTTCTGGAG TTTGTCCCGC TGGTCGGCCC CGCCATCGCG GCGGTGGGCA TCGTAGGTGT CTGCTTCGGC ACCCCAAACT TCCACCACAC CCTCTGGGTC GTCGTCTTTC TCGGCGTCTG GCGCCTCCTT CAGGACTATG TGATCTCCCC GCGCCTGCTC GGCGGCAAAG TCGAACTCCA CCCTCTGCTC ACCATCTTCG GCGTTCTCGC TGGAGGCGAA GTCGCGGGTG TACTCGGGAT CTACCTCTCG GTTCCTGTGA TGGCCACGAT CCGCATCCTA TTCATTCACT GGCATCGATA CCGCGCCAGC GCCGAACTCG CGTCAGATAC CGCGCCCGTT CTCGTAGAAA AATAA
|
Protein sequence | MRTQLRAGIL FPPCLPCSHL ARNAICRIQT NGERVIDLLD RRAVRVLTTV LVFAAVLAFI YVAHKTLILF LFALLFAYLL EPVVSRLAGW MRNSRGLAIL IVYVLLFVAV TIAGALIGPR IFSEGQKLGQ QLPDLYDKVA SGNIAFTLGS RHGWSAETSQ RLKTLLVSHQ DEIVSAISSA GTRTAAMLTN IGWIIIIPIL GAFLLKDKRD LRLSLQNIVG EPRKREFFGQ LITDVDAMLS QFVRAQLLLA IISGLVYTAA LSVLQVPYAY ILGAVGGLLE FVPLVGPAIA AVGIVGVCFG TPNFHHTLWV VVFLGVWRLL QDYVISPRLL GGKVELHPLL TIFGVLAGGE VAGVLGIYLS VPVMATIRIL FIHWHRYRAS AELASDTAPV LVEK
|
| |