Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1882 |
Symbol | |
ID | 4073041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2260349 |
End bp | 2261500 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983891 |
Product | YVTN beta-propeller repeat-containing protein |
Protein accession | YP_590957 |
Protein GI | 94968909 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02276] 40-residue YVTN family beta-propeller repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0454626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCAGG TGCCCCGCTT CAGATTCCAA GCTGCGCTCG GCGCGGCGAT GTTGCTGTGC CTGGCCGGCT GTCGACCTGA AAGCAATGTC TCGCCCAATG ATCACGAGTT CGCCTACGTG AGCAACGGTA AGAGCGGCAC CGTCAGCGTG ATAGACGCCC GCTATTTTCG TCCGGTCACT ACGCTCATCG TTGGCAAAGA GCCAACCGGC ATCGCGATCA ACCCGGTAAA TAACGAAGTG TATGTGGTGA ATGCCGGGTC CGATTCGGTG TCGGTGATTG ACTCCGCGAA GAATGAGGTT CTGGCAACGA TCGGGGTTCA CAAGGCGCCG TATTTCATCT CCGTCAGCGC CGATGGGAAA CGTGGCTACG TGGCGAATTC CGGGTCCGGC AATTTGTCGG TGCTCGACCT GGAGAAACAT TCGGAGATCG CGACCGTCGC TGTGGGCTCA GCGCCAGGTG AGGCTGCGGT TACGCCGGAC GGGAAGACGG TGGTGGTCTC GAACCGTACA GCCAATTCGG TTTCGCTCAT TGACGCCGCA AAAAGCAGCG TGCGGACGAC GCTTTCGGTT TGCAAGGAGC CACAGGACAT CGTGGTGATG AAGAATTCGG CGAAGGCGTT TGTGGTCTGT CCGCCGACGA ACCAGGTAGC GGCTATCGAT CTGAAGCGCG ACGTGGTGCT CGCGCTATTG GACGTTGGGG GGTCGCCGAT TCAGCTCACG ATAAAGCCGG ATGGCGGCGA AATCTTCGTC AGCAACTTTG GCTCGAACAA TATTTCCTCG ATCGAGGCGT ACACCAACGA AGTCGGCAAC TCGTTTCCTA TCGGCGACAA ACCCACGATG GCCGTGACCA CCAGCGACAA TTCCCTGCTG TACGTGAGCG ACTTCGGCTC CGATTCGGTG TCGGTGTACA GCATTGACGA CGGCAAGGCG ATTGATTCGA TCCAGGTCGG CAGCCGTCCG GATGCGTTAG CTTTTACCAG CGACGAGTCG CACCTGCTGG TGGTCAACAG CGGCTCGGGC GACGTGGCTG TGGTGCGCAC CGTAAAGACC AAGCAGAACT CGAAGATCAG CCGCGAACGC TCGCTCGTGA CGATGATCCC GGTTGGCGCA CAGCCCAATG CAATCGCCAT TAAGAGCGTA ACAACGAAAT AG
|
Protein sequence | MMQVPRFRFQ AALGAAMLLC LAGCRPESNV SPNDHEFAYV SNGKSGTVSV IDARYFRPVT TLIVGKEPTG IAINPVNNEV YVVNAGSDSV SVIDSAKNEV LATIGVHKAP YFISVSADGK RGYVANSGSG NLSVLDLEKH SEIATVAVGS APGEAAVTPD GKTVVVSNRT ANSVSLIDAA KSSVRTTLSV CKEPQDIVVM KNSAKAFVVC PPTNQVAAID LKRDVVLALL DVGGSPIQLT IKPDGGEIFV SNFGSNNISS IEAYTNEVGN SFPIGDKPTM AVTTSDNSLL YVSDFGSDSV SVYSIDDGKA IDSIQVGSRP DALAFTSDES HLLVVNSGSG DVAVVRTVKT KQNSKISRER SLVTMIPVGA QPNAIAIKSV TTK
|
| |