Gene Acid345_3750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3750 
Symbol 
ID4069325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4425167 
End bp4426306 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content55% 
IMG OID637985772 
ProductFis family transcriptional regulator 
Protein accessionYP_592824 
Protein GI94970776 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.634996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGA ACCTTTCTGG CATCTCTCGA TCAGGCGAGA TCTTCGGCAG CATTCTCCTA 
ATCGATCCTG CGCTTGCTCG TCCAGCGCTC GTCCAACACC AACGAGTTGG AGAAATCCAC
TTGCGAACTT CCCCTCGCTT CAACGTGACC AGCAGTGAAA TACCTAGCTT CGAGGGAATT
GTCGGCTCCA GTTCGTCCTT GAGCAGAGCT TTGGATCGGG TGATGACCGT CGCACCGACG
GATGCTACTG TTTTAATCCA CGGCGAGACC GGGACTGGTA AAGAATTGAT CGCGCAGGCA
GTGCACCGTC TTGGTCGGCG TCGTAATGGC CGCTTCGTAC GATTCAATTG CGCCGCGATT
CCTCTCGGCT TGCTCGAAAG TGAACTCTTT GGCCATGAAA AAGGAGCGTT CACCGGCGCC
GTCGCCCGCA AGATCGGCCG CTTTGAACTC GCTAACAACG GAACGCTGTT CCTCGATGAA
ATTGGTGACA TCCCTCTCGA GTTGCAGGCA AAGTTGTTGC GTGTGCTTCA GGAACGGGAG
TTTGAACGAT TGGGCAGCAA TCAGACCCTG CATGTTAACG TACGCCTAAT CGCCGCCAGT
CACCGAGATC TTCGCCAAAT GGTACGCGAA GGCAAGTTTC GGGAGGATCT TTTCTACCGG
CTGAACATCT TCCCTATTAC GGTTCCAGCG TTGCGTGAAC GTCGTGACGA TATTCCCGCC
CTCATTCGGT ATTTCGCTGA GGATTGCGTT CGCCGCTTAG ACCGTCGGGT CAATCTCGTT
CCCCTCGAAA CAGTACGAGC TCTGACCGAA TATGACTGGC CTGGCAACAT TCGCGAGCTC
CAGAATTTCA TGGAACGATC AGTGATCCTA TCGCAAGGTG TCGAACTACA AGCCCCTCTC
GACGATCTCC GCTGGTCAAA ACCGGTGAAT GGGCCGGAGA CTCAAACGTT ATCCCAAGCA
GAATACGGGC ATATCCTAAG CGTCTTAAAA ACGACGAACT GGGTAGTCGG CGGCCCAGCG
GGTGCTGCCC TGAAATTGGG GTTGAAGCGT ACCACATTGA TCGGGAAGAT GAGAAAGCTT
GGCCTCTCGC GTTCGCGCGA AGCGAGCGTA CAGCATAGCG GCACAGTCAC GAATCGCTGA
 
Protein sequence
MSGNLSGISR SGEIFGSILL IDPALARPAL VQHQRVGEIH LRTSPRFNVT SSEIPSFEGI 
VGSSSSLSRA LDRVMTVAPT DATVLIHGET GTGKELIAQA VHRLGRRRNG RFVRFNCAAI
PLGLLESELF GHEKGAFTGA VARKIGRFEL ANNGTLFLDE IGDIPLELQA KLLRVLQERE
FERLGSNQTL HVNVRLIAAS HRDLRQMVRE GKFREDLFYR LNIFPITVPA LRERRDDIPA
LIRYFAEDCV RRLDRRVNLV PLETVRALTE YDWPGNIREL QNFMERSVIL SQGVELQAPL
DDLRWSKPVN GPETQTLSQA EYGHILSVLK TTNWVVGGPA GAALKLGLKR TTLIGKMRKL
GLSRSREASV QHSGTVTNR