Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2816 |
Symbol | |
ID | 4071819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3340513 |
End bp | 3341673 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984834 |
Product | NHL repeat-containing protein |
Protein accession | YP_591891 |
Protein GI | 94969843 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.306413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAGGCG CGGAGAATGT TTCGCAGCAG AACGGCCTGC AACTCGCGGA TGCTCCTCGG TCTTTGTTTC CACTTGTCGC GCGTGGCGGT GGCAGCGAGA TTCGGTACAT TGGCGCCATC TTCGCCGATG GAAAGTTTCG TGTGGGAGCG TTTGCAAGCG CGAACGCTCA GGAGCACGCG AACTTGTACG ACGGGACGAC TCCCCTTTCG CGGCCTGCAG ACGTTCCCCC GGCCGTCGAA CTTCACCCGC GCGAGCAGGC TCGGGAGAAC CTTTCTGCAA AGAACACGGC GCAGAAATCA GCTGCAGGGC GATCGGAAAT TGTTGGGTTG CTTGACGAGG TTGTGACCGC GGTTTACGGC CGCGAGCAGA GCCTGATGGG GCCTACGTCT GTAACCACGG ACCGCGACGG ACGCGTGATC GTCTCTGATC CGGCTGCGCT ATCGGTACAT GTGTTCGATC CGATGCGACC TTTCCGCATC CTGGCTGGGA CGCAATACCG ACTCCAACAT ATCGGCCCAG TTGCCACGGA TGCGGTGCGA AACATCTATG TCGCAGATCC GGTGCAGGGC GTGGTGGTGG AATTTGATCG CGAGGGCAGG TATCTCGGCG AGATCGGGAG GCTCGGCGAG GGTGAAGGCA TCTTTCACGA GCCAGTCGCA ATGGCGGTTG ATGTTCACCA TTTCTTGTAC GTTGCGGATG CAGAGCGCGA CATGGTTCTG ATGGTGAACA GCGAGGGTAA AATCCTGCGA CGTGCGGGCG GACGGCGCAA AGAGCTGGGC GTGAGTCTCG AACATCCGAG CGCGCTGGTG CTGAAGCATG ACCAGTTGTT CGTACTGGAC GCGAATGACA CGCGCGTGCA GGTGTTCGAT TCGCAGCTTC GACGGCGCAT GACATTTGAT ACCGGGTTAG GACCAGGGCA CAGAACACTG GATCTAGACA CTGCCGGAAA TATCTATGTG AGCGATGGGC GCACGATTTA CATATTTGAC GGAGAGGGAC ACCGGAAGGG CGAGTTTGGC CGGAAGGGCA GTCTTCGCGG GGAAATTAGC AGTGTAGCGG GGCTGTGGAT CGACGAGAAC GATCGCATGT ATGTGACGGA TAAAGAGAAT CGGCGCGTGG CGGTGTTTCA GATCAAGATG TTGGAAGACA CGGCACATTG A
|
Protein sequence | MIGAENVSQQ NGLQLADAPR SLFPLVARGG GSEIRYIGAI FADGKFRVGA FASANAQEHA NLYDGTTPLS RPADVPPAVE LHPREQAREN LSAKNTAQKS AAGRSEIVGL LDEVVTAVYG REQSLMGPTS VTTDRDGRVI VSDPAALSVH VFDPMRPFRI LAGTQYRLQH IGPVATDAVR NIYVADPVQG VVVEFDREGR YLGEIGRLGE GEGIFHEPVA MAVDVHHFLY VADAERDMVL MVNSEGKILR RAGGRRKELG VSLEHPSALV LKHDQLFVLD ANDTRVQVFD SQLRRRMTFD TGLGPGHRTL DLDTAGNIYV SDGRTIYIFD GEGHRKGEFG RKGSLRGEIS SVAGLWIDEN DRMYVTDKEN RRVAVFQIKM LEDTAH
|
| |