Gene Acid345_1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1539 
Symbol 
ID4072930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1881980 
End bp1882858 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content58% 
IMG OID637983548 
ProductMerR family transcriptional regulator 
Protein accessionYP_590615 
Protein GI94968567 
COG category[K] Transcription
[S] Function unknown 
COG ID[COG0789] Predicted transcriptional regulators
[COG1917] Uncharacterized conserved protein, contains double-stranded beta-helix domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0172454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.655759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCACGA AAAATTCAAT TCGTTCCGTT CACAAGAACG GCGGTCGCGG CGTAGCGATC 
GTGCCCGCGG ATGCCCCGGC GAACGCGAAC GACGCGCCCC TGCTGAAGAT CGGTGAAGTG
GCCGACACCG TGGGTATCTC CGCCTCAGTC ATTCGCTCGT GGGAAAAGCT CGGACTCATC
GCGCCGCAAC GTACCGGCAG TAAATACCGG CTCTATTCGC AGGAAGATGT ACGTCTGCTT
CAGCGCGCTC GCTATTTAAG CAGGGTTCGC GGCATGAACG CGCCGGCGAT CATCGAATTG
ATGCGCACCA AGGGCGAGAT CAAGGCTAGC CGCAACGGAT CGGCCGACAA CGTTGGACTC
CGCTTGCGAT TAGTGCGCAC TTCGCGTGGT CTCTCACTGG CTCAGGTCGC TGAAGCGGTA
GGAATCTCGG TTGGGTTCTT AAGCGCGATT GAACGCTCGG CCATGAGCGC ATCAGTTGCG
ACCCTCCGCA AACTCGCGAA GTTCTACAAG CTCAACATCC TCGATCTCTT CGACCAAAGT
GGTGGTCGTC AGTATCTTGT ACGCCCTGCA GAACGCCGCC AACTCGATGC CAGCGACGGT
GTGCGCATGG AATTGCTTGC GTGGGGCAAT CCCGTCATGG AGCCGCATCT GTTCCACGTC
GCGCCTGGCG CCGGTAGCGG CGACGAGGAG TACTCGCACG AAGGCGAAGA GTTTTTGTTT
GTGCTTACCG GCCTGCTCAA CCTGGTAGTG GATGGCAAAG AGTACCGCCT GCAGAAAGGC
GACAGCTTCT ACTTCGAGAG CTCCACGCCA CATCGCTGGT CGAATACAGG ATCGAAAGAA
GCCGTAGTCT TGTGGGTAAA TACGCCGCCG ACGTTCTAG
 
Protein sequence
MATKNSIRSV HKNGGRGVAI VPADAPANAN DAPLLKIGEV ADTVGISASV IRSWEKLGLI 
APQRTGSKYR LYSQEDVRLL QRARYLSRVR GMNAPAIIEL MRTKGEIKAS RNGSADNVGL
RLRLVRTSRG LSLAQVAEAV GISVGFLSAI ERSAMSASVA TLRKLAKFYK LNILDLFDQS
GGRQYLVRPA ERRQLDASDG VRMELLAWGN PVMEPHLFHV APGAGSGDEE YSHEGEEFLF
VLTGLLNLVV DGKEYRLQKG DSFYFESSTP HRWSNTGSKE AVVLWVNTPP TF