Gene Acid345_1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1274 
Symbol 
ID4073244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1548168 
End bp1549868 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content59% 
IMG OID637983283 
Productregulator of RNA terminal phosphate cyclase 
Protein accessionYP_590350 
Protein GI94968302 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG4650] Sigma54-dependent transcription regulator containing an AAA-type ATPase domain and a DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.48322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTATCC GGAAGTTGCT GATTGAGTGT ACTTTATGCG TTCGCGGTGG GAATTTATCG 
TGCTCACTAT TCGCTTATAT TCTAAGATTT ATACATATGA AGAACCGTGA GCGCCGTCCG
ACGGTCGTCG TGGGGCTGCT TGGAACCACG CTCGACCTTG GGAAACATCC AGACCGCTGG
CAGAACTGGC GACCATCGGT GGCAATTTGT CGGCAGCCGG ATCTCATTGT CGACCGCTTT
GAGTTATTGC ATGGGAGGCG CGAGCAATCG CTGGCCGAGT TGGTGCGAGC GGATATCGGG
ACGGCGTCGC CAGAGACTGA GGTTGTGCTG CACAACGTCG AGTTCAGCGA TCCGTGGGAC
TTCGAGCAGG TCTACGAGAA GCTGTTCTCG TTTGCGCGCG GCTACACGTT CGACGATGGA
GAGGACTACC TTGTGCACAT CACGACCGGC ACTCACGTCG CGCAGATTTG CCTGTTTCTG
CTGACGGAGT CGCGGCATAT TCCGGGCAAG TTGCTGCAGT CGTCACCGGA CCGCGAACGT
GGTGGCGCAG GTACGGTGAA GGTCATCGAC CTCGATCTGT CGAAGTACGA TCGCATCGCA
TCGCGATTTC GGCAGGAGCA GCAGGAGGGC GCGTCGTTCC TGAAGGGCGG GATTGAGACG
CGGAACGAGG GATTCAACCG GTTGATCGAG CGGATTGAGA AGGTGGCGAA TGCGACGCGT
GATCCGATCC TGCTCACTGG GCCGACCGGC GCGGGAAAGT CACGATTGGC GCGGAAGATC
TTCGAACTAA AGAAGCAGCG TCATGCGGTG GCGGCGAACT TTGTGGAGGT GAACTGCGCG
ACGCTGCGTG GCGATGGCGC GATGTCGGCG CTGTTTGGGC ACAGCAAAGG TGCGTTTACC
GGGGCGGTGA CCGAACGCCT GGGGTTGTTG CGCTCGGCAA ACGGCGGCGT GTTGTTCCTG
GATGAAATCG GAGAGCTGGG ACTCGATGAG CAGGCGATGC TGTTGCGCGC CATTGAAGAG
AAGGTGTTTC TGCCTCTCGG GTCCGACAGG GAAGTGAAGA GCGATTTCCA ATTGATTGCG
GGGACCAACC GCGACCTGTT GCAAGCGGTC CGGGACGGCA AGTTCCGCGA GGACCTGCTC
GCGCGTATTA ATCTTTGGAT GTTTCAACTG CCCGGTCTCG CCGACCGGGT GGAGGACATT
GAGCCGAACC TTGCGTTTGA ACTGGAGCAA TTCGCGGCGA GAAACGGCGA TCGAGTGGCG
TTCAACAAGG AGGCGTATCA GCACTTCCTG CAATTCGCGC GATCGTCGGA AGCGAAGTGG
TCGGGAAATT TCCGCGATCT GAATGCAGCC GTGGTGCGAA TGGCGACGCT ATCGGCGGGC
GGGCGGATTT CGATTGAGAT TGTGAGGGAA GAGATTCGGC GGCTTGCTGA CTCATGGTCT
CGCAGCGAGG GAGGGCATCC GAGCGATCTT TCTCTTGTCG TCGCCCCGGA AGTTCTCGTG
CAGATGGATT TGTTCGATCA GTTGCAGCTT GCCCAGGTCG TCGAAGTTTG TCGGGCCGCG
CGATCGCTGT CCGTCGCGGG GCGGCGTCTC TTTGGCGTTT CGAGGACCAG GAAGGCAACG
GGGAATGATG CTGATCGTCT CAGCAAATAC CTGGGACGGT TTGGATTGTC GTGGAACGAA
GTGCGCGCAG GGTCCCGCTA G
 
Protein sequence
MGIRKLLIEC TLCVRGGNLS CSLFAYILRF IHMKNRERRP TVVVGLLGTT LDLGKHPDRW 
QNWRPSVAIC RQPDLIVDRF ELLHGRREQS LAELVRADIG TASPETEVVL HNVEFSDPWD
FEQVYEKLFS FARGYTFDDG EDYLVHITTG THVAQICLFL LTESRHIPGK LLQSSPDRER
GGAGTVKVID LDLSKYDRIA SRFRQEQQEG ASFLKGGIET RNEGFNRLIE RIEKVANATR
DPILLTGPTG AGKSRLARKI FELKKQRHAV AANFVEVNCA TLRGDGAMSA LFGHSKGAFT
GAVTERLGLL RSANGGVLFL DEIGELGLDE QAMLLRAIEE KVFLPLGSDR EVKSDFQLIA
GTNRDLLQAV RDGKFREDLL ARINLWMFQL PGLADRVEDI EPNLAFELEQ FAARNGDRVA
FNKEAYQHFL QFARSSEAKW SGNFRDLNAA VVRMATLSAG GRISIEIVRE EIRRLADSWS
RSEGGHPSDL SLVVAPEVLV QMDLFDQLQL AQVVEVCRAA RSLSVAGRRL FGVSRTRKAT
GNDADRLSKY LGRFGLSWNE VRAGSR