Gene Acid345_1631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1631 
Symbol 
ID4072518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1979150 
End bp1980295 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content57% 
IMG OID637983640 
ProductLacI family transcription regulator 
Protein accessionYP_590707 
Protein GI94968659 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.213128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.697087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTTCTG GCGCTGAATC GACTATCACT TTGATTCCGA TGAACGCAAA AAAGGGCAAA 
AAGCACGCAG CCACCATTCG CGACGTTGCG GCCGAGAGCG GATTCTCTCC GGCAACAGTT
TCCATTGTTC TCAACAATGC GCCTTTGTCG CGATATATCC AGGCCACGAC CAAGGAACGA
ATCGTGAAGG CCGCGCGCAA GTTGGGCTAT CAACCTAACC AACTGGCGCG TTCCCTACGA
AGCCGGCGGA ACAACACCAT CGGCGTAGTC GTCTTCGATC TGACGGACCC CTTCTGCACG
CCGATCATGC GTGGCATCGA GAACACGCTG TACCAATCTT CCTACGTGTC CCTACTCGCG
GATGCCCACA ACGAACTCAG CCGTTTCGAG AAGTATCTCG AGATGCTGCT GGAGCGGCGC
GTCGAAGGGA TGATCGTCAT TGCGAACTGG ACCCTGGTGG ACATCGCGCT TCTCGCGGAC
CTCGAAAAGC GCAATATCCC TACGGTCGTC ATTGGCCGGG AACTGCAGAA CGAGACGATC
AATTCGATCA TGGTGGATAA CGAGGCGGGG GGCCGTATCG CGTTGCAGCA CCTGCATTCT
CTGCAGCATC GCAAAATCGC CTTTGTTCGC GGACCAAAAT CCGTGGTGGA CAGTCCGCTC
CGCTGGCAGG GAATCACCGA CTACGCGCAA TCGGTTGGAT TGCCAGTCGA TCCAAAACTG
GTGATCGAGT TACCCGATCG GAAGGAACCG AACTCCAGTT TCGAGGGTGG GTATCGCGCC
GTCGAGGAAC TCATTAAGCG AAAGCGTCCG TTCACCGCTG TTCTGGCTTT CGACGACATG
ACCGCACTCG GCGTGATGCG CGGCCTTGCA GAGCGTGGTA TATCGGTTCC CGATCAATGT
TCCGTCATTG GATTCGACGA CGTGGCGCCG GCTGCATTCT CGAACCCAGC GCTGACAACC
GTACGCCAGC CCATGGAAGG AATGGGTTCG ACCGCGGTGG AGATCGTGGT GGAGTCCTTG
AGTTCCGATT TGCGGCCGGG AGAAATTTCC GTCGTACACC GGAAGATATC TCCGGAACTG
GTAGTGCGGG CGTCCACGCG GGCAGCTTCC CTTACAAAGA CCGCATTTCC TCCCAGTGCC
GATTAA
 
Protein sequence
MLSGAESTIT LIPMNAKKGK KHAATIRDVA AESGFSPATV SIVLNNAPLS RYIQATTKER 
IVKAARKLGY QPNQLARSLR SRRNNTIGVV VFDLTDPFCT PIMRGIENTL YQSSYVSLLA
DAHNELSRFE KYLEMLLERR VEGMIVIANW TLVDIALLAD LEKRNIPTVV IGRELQNETI
NSIMVDNEAG GRIALQHLHS LQHRKIAFVR GPKSVVDSPL RWQGITDYAQ SVGLPVDPKL
VIELPDRKEP NSSFEGGYRA VEELIKRKRP FTAVLAFDDM TALGVMRGLA ERGISVPDQC
SVIGFDDVAP AAFSNPALTT VRQPMEGMGS TAVEIVVESL SSDLRPGEIS VVHRKISPEL
VVRASTRAAS LTKTAFPPSA D