Gene Acid345_0899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0899 
Symbol 
ID4069110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1122129 
End bp1123214 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content55% 
IMG OID637982906 
ProductLacI family transcription regulator 
Protein accessionYP_589976 
Protein GI94967928 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.290552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGATG TTGGCATGGC CGCGCGGAAA CCCGGCAAGA AAATACATGC GCAGCAAGAT 
GTCCCTCGCC GCGTGACATT GAAGTTCCTC GCCGAATATC TTCAGCTCTC TACTACAACA
GTCTCGGTGG TGTTGAGCGA CTCTCCGCTC GCGATGACGA TCGCCCAGAA AACCAAGGAG
AGAATCTGGG CCGCAGTTGA GAAGTTCCAG TACCGCCCCA ACATGTTTGC GCAGTATCTG
CATTCCAAGC GCACCTTCAG CGTAGCCGTT CTCGTGCCGG ATATCGGAGA TGAATACTCG
TCGTCACTCA TTAGCGGCAT CGAGCGGCGA CTGTCTGAAG CAGGGTACAA ATATATCGTT
GCGAGCCATC GCGGTGCTCC GAAAGAAATC GAGACATCCC CGGAAACTCT CATGGATAGG
GCAGTCGAGG GCATGATTTT CATCAATACC CCCCTCCAGA AGAGACTTCC AATTCCCGTT
GTCGCTGTTT CTGACATCAC GACGGCACCG GGCGTGTCGA GGATTGTAAT CGACAATGAC
CGTGCAATTT GGCTCGGGCT TTCACATCTC AAGCAGCTCG GTCACAAGCG GATCGCATTC
TTCAAGGGGC CGGACCACAA CGGCGACACC GAAATGCGAT GGAAGGCCGT CCTCGAGAAT
TCCGAGAAAT TCGGATTGGA AGTCGAGCGC GAATTGACCG TCCAACTCGG AACATATCCA
GAAGTGAATG AATCAACAGT GTCCCACCAC GGGTACGCCG CTGCGATGAC GCTGCTCAAG
CGAACGCGGA GCTTCACTGC TTTAATGGCG TTCAATGACG GTTCCGCAAT CGGAGCGATT
CGCGCTTTCC AGGATGCAGG ACTCTCGGTG CCCAACGCCG TCTCGGTGAT CGGGATCGAT
GACGTTCCTC TGGGTGAGTT TATCTACCCA CGCCTTACCA CAGTCCGACA GCCTCTTGAA
CAAATGGGTC AGCTCGCCGC TTCAACACTC CTCGACCGGA TCAACGGAAT GACGGTGCTT
GAGGAGACAA AGGTCCTTCC CGAGTTGATT GTGCGAGAAT CCACTGCTCC GCAGCGCTAC
AGATAA
 
Protein sequence
MYDVGMAARK PGKKIHAQQD VPRRVTLKFL AEYLQLSTTT VSVVLSDSPL AMTIAQKTKE 
RIWAAVEKFQ YRPNMFAQYL HSKRTFSVAV LVPDIGDEYS SSLISGIERR LSEAGYKYIV
ASHRGAPKEI ETSPETLMDR AVEGMIFINT PLQKRLPIPV VAVSDITTAP GVSRIVIDND
RAIWLGLSHL KQLGHKRIAF FKGPDHNGDT EMRWKAVLEN SEKFGLEVER ELTVQLGTYP
EVNESTVSHH GYAAAMTLLK RTRSFTALMA FNDGSAIGAI RAFQDAGLSV PNAVSVIGID
DVPLGEFIYP RLTTVRQPLE QMGQLAASTL LDRINGMTVL EETKVLPELI VRESTAPQRY
R