Gene Acid345_2386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2386 
Symbol 
ID4071384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2819767 
End bp2820777 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content59% 
IMG OID637984402 
ProductLacI family transcription regulator 
Protein accessionYP_591461 
Protein GI94969413 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.737452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.472555 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCTGC GCGACGTTGC GGACTATTTG GGTCTGTCGT CCACGACGGT ATCACTGGTG 
CTCAATAACT CGCCGGTAGC GAAGACGCTG TCAGAAGAGA CCCGTGAGCG CGTGCGTAAG
GCCGCGGAGA AGTTGAGTTA CAAGCCCAAC TACTTCGCGC GGGCACTCAA CCAGAAACGC
AACTACCAGA TCGGCATACT GGTCCCTGAC TTCGGGGAAG GTTACAACAC CAGCTTCATG
ACCAACATCG AGCGCGAGCT CGTGGAACGC GGATATCTCT ACTTCGTTTC GAGCCACCAT
TGGAACCCGG AAGCGATCGA TTTGCGTTTG CGCAGCTTTG TGGAGCGAGG CGTGGAGGGC
GTAATCCTCA TCAACACGCC GCTCGCAACG CTTCCGGACG TGCCGTTGGT CGTGGTGGGA
AGTCAGAAGT TGAAATTCCG GAGCACGCAG ATTTCTCTCG ACAACGAAGC AGGGGTGAAC
GCGGCACTGC GGCATCTCTA TGCGCTCGGA CATCGGCACA TTGCATTCGT GAAGGGGCAT
GAAGGATCTG TGGACGCGGA GCCGCGATGG AGGGCATTTG TCGACGGCTG TTGCGAACTC
GGGCTGAGGA TCGATTCTAA GGCGGTGGTG CAGTTGCATC GCATCGACGA CGGACTGGAT
CCGATCGCAG AGGGCTACAA GGCGGCCGAG ACGCTGCTTG CCTCGGGAGC GCGATTCACG
GCGGTGGTTG CATTCAACGA TATGTCCGCG ATCGGCGCCA TGCGCAAGTT CAAGGATGCC
GGGATTGACG TGCCAGGCAG GATCTCCATC GTCGGGTTCG ACAATGTCCC GATTGCTGGC
TTGGTTGATC CACCACTTAC GACCATCAGC CAGCCAATTG AAGAGATGGC ACGAGTCGCA
ACGGCCGAGG TCATCGCGCA GATCGAGACA AGCGGAAGCT TCCGCCCAAA GCAGGTTGTG
GTGGAACCTG AACTCGTGGT GCGTCGCTCG ACGACCGCAC TTATCGCCTA A
 
Protein sequence
MRLRDVADYL GLSSTTVSLV LNNSPVAKTL SEETRERVRK AAEKLSYKPN YFARALNQKR 
NYQIGILVPD FGEGYNTSFM TNIERELVER GYLYFVSSHH WNPEAIDLRL RSFVERGVEG
VILINTPLAT LPDVPLVVVG SQKLKFRSTQ ISLDNEAGVN AALRHLYALG HRHIAFVKGH
EGSVDAEPRW RAFVDGCCEL GLRIDSKAVV QLHRIDDGLD PIAEGYKAAE TLLASGARFT
AVVAFNDMSA IGAMRKFKDA GIDVPGRISI VGFDNVPIAG LVDPPLTTIS QPIEEMARVA
TAEVIAQIET SGSFRPKQVV VEPELVVRRS TTALIA