Gene Acid345_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0533 
Symbol 
ID4069953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp657088 
End bp658173 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content59% 
IMG OID637982538 
ProductLacI family transcription regulator 
Protein accessionYP_589612 
Protein GI94967564 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.985534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.47776 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCGGC GTAAAACGAT GCCCCCTCCG CGGACATCGT CGGACCATAA AAAACCGATC 
AGTCTGAAGC GTCTCGCCGA GCATCTCGGG CTCTCTTCTG CCACTGTTTC CATCGTCATT
AACCGCAAGC CCCTCTCCGA CATGATCCCG GAAGAAACCA AGACCCGGAT ATGGGAAGCG
GCGAGCCGTT TCAACTATCG GCCCAACATC ATCGCTCGCT CTTTGCGCCA GCAGCGGACC
TACTCCATCG GGGTTTTGCT GCCGGAGTTT AGCGACGGTT ATTCCGCTTT GGTGCTAAGC
GGCATTGAAG ACTACCTTTT AGGGAAGGGG TATGCATGGC TGGCGGCAAG CCATCGTCAC
AAAGATGAAT TGATCCGCGA ATACCCGCAC CTGCTTTACA CCCGCGCGGT CGAGGGTTTG
ATAACCATCG ACACCCCCTA TGACGAGCAT CTGCCGTTCC CCGTCGTGTC TGTTTCCGGA
CACCAGACCA TCGAGGGCGT GACCAATATC GTGCTCAACC ACGATCGCTC CGCGGAGCTT
GCAATCGGTC ATCTCCACGA GCTCGGCCAT CGACGCATCG CCTTCATCAA AGGACAATCC
TTCAGCTCCG ATACCCAGGT CCGCTGGGAT TCGATCCGCA AGGCTTGCCG GAGCTTCGGC
ATTACCGTTG ACCCGCAACT CGTGGCACAG CTCGAAGGTG TGTCTCCTTC GCCGGAGCCG
GGATACCAAG CCGCGAAACG CATCCTCGCC AACAAAGTCG ACTTCACCGC GCTGTTCAGT
TTTAACGACG TCTCCGCCAT CGGCGCCATC CGCGCATTGC AGGAAGCCGA CCTCCATGTT
CCAGAAAGCG TATCCGTTGT CGGCTTCGAC GACATCGCCG TCGCGGCCTA CCACATCCCG
GCATTGACCA CCATCCGCCA GCCACTGGGT CACATGGGTT CACTCGCCGC CGAAACGCTG
GTCGAGCGCA TCGCTGCGCG CGGGAACGAA GGACCAGCAC TGCTCGAGGT CGAACCCGAA
CTCGTCGTAC GCGAATCGAC CGCACCTCTT TCTACCGCCA AGGCCGTCCC TTCAGGCAAG
GGATGA
 
Protein sequence
MPRRKTMPPP RTSSDHKKPI SLKRLAEHLG LSSATVSIVI NRKPLSDMIP EETKTRIWEA 
ASRFNYRPNI IARSLRQQRT YSIGVLLPEF SDGYSALVLS GIEDYLLGKG YAWLAASHRH
KDELIREYPH LLYTRAVEGL ITIDTPYDEH LPFPVVSVSG HQTIEGVTNI VLNHDRSAEL
AIGHLHELGH RRIAFIKGQS FSSDTQVRWD SIRKACRSFG ITVDPQLVAQ LEGVSPSPEP
GYQAAKRILA NKVDFTALFS FNDVSAIGAI RALQEADLHV PESVSVVGFD DIAVAAYHIP
ALTTIRQPLG HMGSLAAETL VERIAARGNE GPALLEVEPE LVVRESTAPL STAKAVPSGK
G