Gene Acid345_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1686 
Symbol 
ID4069354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2043950 
End bp2045020 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content56% 
IMG OID637983694 
ProductLacI family transcription regulator 
Protein accessionYP_590761 
Protein GI94968713 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.155219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTAAAAC GGACGAAAGC GGCAGGCGAC AATCGCGGCA ACGGCACGGC TGAAAATCGA 
AAGCCGAACA TCACGATTGC TGATTTGGCT GCGCACCTGA AGCTGACGAA AGGTACCATC
TCCGCGGTCC TCAACAACTC CCCGTATTCG AAGTCGATTC CGCAGCACAC CAAGGACCGC
ATTCTCGCTG CTGCTGCTGA ACTCAACTAC CAGCCAAACT TCCTTGCTCG CTCGCTCCGA
CAAAAGCGGA GCTACAGCAT CGGTGTTGTT GCCGAAGAGA TCGGCGATCC CTACAGCAGC
GTCATTATTA GCGGTATTGA GTCTGTTCTC AGCCGGATGA AGTACATCTT TCTAACGGTC
GCGCATCGAC ACGACCCGAA TCTACTTCAG CAGTACTTCG ATATCCTCCG CACCCGTGGT
GTGGAAGGGA TCATCGCGAT CGACACCCGG ATCGAGTCCT CTCCTGAACT CCCGCTCGTC
GCCGTGCCTG GCTATTCGAA ATTTGACGGC GTGCATAACA TTGTTCTGAA CCATCGAACT
GCGGCCAAAG TGGCGCTTGA ACACTTGGTT GGTCACGGCC ATCGACGAAT CGCAATCTTG
CGCGGCCAGA TCCTCAGTTC AGATTCGGCA GAGCGTTGGC ATTCGATCCA AAAGGTCGCA
CAAGAGATGT CGATCAAAAT CGATCAGGAC CTTGTGGTGC AGTTGAGTGG CGACCACGCT
TCGCCGCAAC CTGGTTTCGA GGCCATTCAC GAACTGCAAG CTCGCCACGC CAAGTACACG
GCGGTATTCG CATACAACGA CATGGCCGCG ATCGGAGCGA TCCAGGCGCT GAAGAAATTC
GGCCTGCAGG TTCCGAGCGA CGTATCAGTG GTCGGATTCG ACGATGTGCG TGAAGCGACT
TTCTACTCGC CATCTCTTAC GACGGTACGC CAACCTTTGC GCAAAATGGG CGAGACGGCC
GCGGAAACCC TCGTCGGTCG AATTGAAGGC AAGACGGATC TGCCGGCGCA CGTGGAAGTG
GAGCCGGAAT TTGTGATTCG GCAGTCCACC GGTGCAGCAC GTTCCCTCTA G
 
Protein sequence
MVKRTKAAGD NRGNGTAENR KPNITIADLA AHLKLTKGTI SAVLNNSPYS KSIPQHTKDR 
ILAAAAELNY QPNFLARSLR QKRSYSIGVV AEEIGDPYSS VIISGIESVL SRMKYIFLTV
AHRHDPNLLQ QYFDILRTRG VEGIIAIDTR IESSPELPLV AVPGYSKFDG VHNIVLNHRT
AAKVALEHLV GHGHRRIAIL RGQILSSDSA ERWHSIQKVA QEMSIKIDQD LVVQLSGDHA
SPQPGFEAIH ELQARHAKYT AVFAYNDMAA IGAIQALKKF GLQVPSDVSV VGFDDVREAT
FYSPSLTTVR QPLRKMGETA AETLVGRIEG KTDLPAHVEV EPEFVIRQST GAARSL