Gene Acid345_3727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3727 
Symbol 
ID4069303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4404309 
End bp4405337 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content51% 
IMG OID637985750 
ProductFis family transcriptional regulator 
Protein accessionYP_592802 
Protein GI94970754 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAATCGG ACAACGAGGT TCGATCAAAT GGGCAGCGCC CGGTCGAACG CAGATTTGAA 
CAAATCATAG GCAAAAGTCC CGCGTTGGAA TCAGCATTTG CGGACGTGGA ACAGGTAGCC
CCGACTGATT CCACTGTGCT GATCCTGGGG GAGACCGGGA CAGGTAAGGA GTTGATTGCT
CGAGCTATTC ACAATATCAG TCCTCGCTGC GGACGTCCGT TTGTAAAGCT GAATTGCGCT
GCTATTCCAT TCGACCTTTT AGAAAGTGAG CTTTTCGGCC ACGAAAAAGG TGCTTTCACC
GGAGCGATCG CGCAGAAGAT TGGACGTTTC GACATGGCAA ACACGGGGAC GCTCTTCTTG
GATGAAATAG GAGACATTCC CCTGGCGTTA CAGCCTAAGC TCCTGCGTGT GCTACAGGAG
CAGGAATTCG AGAGGCTAGG AAGTTGCCGT ACCCATCGCG TTGATGTGCG CTTAGTCGCT
GCCACTCATC GCAATTTGAT AGAAATGGTT AAACGGGCTG AATTCCGCAG TGATCTCTAT
TATCGTCTGA ATGTTTTTCC TGTCACTCTC CCAGCTCTCA GGGAACGCAA GGAGGACATC
CCCCTACTTG TCGCTCATTT TGTTAAGGTT TTTGGGCAAC GCATGGGCAA ACGAATCCTT
AATGTCCCGC AGAGCACGAT GGACGCTTTA ACGGAATATT CTTGGCCCGG CAACATTCGA
GAGCTACAAA ATCTGGTGGA ACGTGCCGTG ATTCGATCGA ACGACGGGTT GCTTCCGAAT
CCGCTACCTC AATCTGCGAA TTATCAACAC AACCAGTTCA CTCAGGGCGT GTTCGCTGAT
TCTCAGCGGG AAGTGATATT GAAAATGTTA GATACGTGCG GTTGGATTCT CGGGGGGTCC
CGTGGAGCAG CCAGTCGGTT GGGACTAAAA CGAACCACCC TAATCGCAAA AATGAAAAAG
CTCGGGATCT CAAGGCCCCT CTCCGAAAGC GATAGACCTC AACTGACTGA ACAGCAAGAG
AGCGATTAG
 
Protein sequence
MESDNEVRSN GQRPVERRFE QIIGKSPALE SAFADVEQVA PTDSTVLILG ETGTGKELIA 
RAIHNISPRC GRPFVKLNCA AIPFDLLESE LFGHEKGAFT GAIAQKIGRF DMANTGTLFL
DEIGDIPLAL QPKLLRVLQE QEFERLGSCR THRVDVRLVA ATHRNLIEMV KRAEFRSDLY
YRLNVFPVTL PALRERKEDI PLLVAHFVKV FGQRMGKRIL NVPQSTMDAL TEYSWPGNIR
ELQNLVERAV IRSNDGLLPN PLPQSANYQH NQFTQGVFAD SQREVILKML DTCGWILGGS
RGAASRLGLK RTTLIAKMKK LGISRPLSES DRPQLTEQQE SD