Gene Acid345_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1831 
Symbol 
ID4072892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2211376 
End bp2212401 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content58% 
IMG OID637983840 
Productribonuclease BN, putative 
Protein accessionYP_590906 
Protein GI94968858 
COG category[S] Function unknown 
COG ID[COG1295] Predicted membrane protein 
TIGRFAM ID[TIGR00765] YihY family protein (not ribonuclease BN) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.630422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAAC GGGCCGAATC AAAAGAGCTC GCGGTTTTGA GTCCAAAGAG TCCAGCTGCT 
CACCAAGTCG AACCCGAAGC ACCAGCAAAA GATGCGCCGA AACCTCCGCG GCACCGAAAG
CTAACGCTGC GCGGGTGGAA GAACGTTCTT CGACGGAGCG CGGTCGACAT CGATAACAAC
CACATTATGG CGTTCGCCGG ATCGTTGGCG TACTACTTCG TGCTCTCGTT GTTTCCGGCG
CTAATCGCGT TGGCTGCGGT TGTTTCGCTG CTACCGATGC CAGACTTGTT CCAGAACATC
ATGTCGGTGT TCGGGCGTGT GCTGCCAGAC GAGAGTATGA AGTTGGTGTC GAAGGTGGTT
GCCGACGTCA TTCGTCCCCA CAGCGGCCGC CTGCTCTCGT TCGGATTGAT CGGCAGCCTT
TGGACAGCGT CCAGCGGCTT CAGCGGAATG ATTGAGTCTC TGAACGTCGC TTACGGCGTT
CCAGAGACTC GAGCCTGGTG GAAGACACGC TTGTTGGCGA TCGGCTTGAC GCTCCTCGTC
GGCGGCATGC TCACGGTCTC GATTCTCTGT ATGACCGTTG GCCCACATTT TCTGGAAATC
TTTGCGGACA AAATCGGATT CGGCCCGATG TTCCTGCTCA CCTGGAAGAT CGTCCGCTGG
CCAATCGCGT TTGCACTCGT TGTACTGTCG ATCGAAGCAA TTTACTTCCT GGCGCCGAAT
GTGCGGCAAC ACTTCATGCA TACCCTGACC GGCGCACTGA TCGCGGTGGG GGCGTGGGTC
GTGCTTTCGT TGGCGCTGGG TGTTTACTTT GGGAAGTTCG CGCACTTCAA CAAGACCTAC
GGGGTGCTCG GCGCTGCCAT CGGATTGCTG ACCTGGCTCT ACTGGACGTC GCTGGCGATC
CTGGTCGGAG GGGAAGTGAA TTCGGAGATC ATCCAGGAAA CAGGGGATGG CAAGCTGCCC
CTCAAGCAGC CGCCGCCGGA CAAGGTGAAG CCTGTGCCGG CAGATGCGGC GCAGTTGGCG
GCCTAA
 
Protein sequence
MSERAESKEL AVLSPKSPAA HQVEPEAPAK DAPKPPRHRK LTLRGWKNVL RRSAVDIDNN 
HIMAFAGSLA YYFVLSLFPA LIALAAVVSL LPMPDLFQNI MSVFGRVLPD ESMKLVSKVV
ADVIRPHSGR LLSFGLIGSL WTASSGFSGM IESLNVAYGV PETRAWWKTR LLAIGLTLLV
GGMLTVSILC MTVGPHFLEI FADKIGFGPM FLLTWKIVRW PIAFALVVLS IEAIYFLAPN
VRQHFMHTLT GALIAVGAWV VLSLALGVYF GKFAHFNKTY GVLGAAIGLL TWLYWTSLAI
LVGGEVNSEI IQETGDGKLP LKQPPPDKVK PVPADAAQLA A