Gene Acid345_3356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3356 
Symbol 
ID4071274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3979943 
End bp3981397 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content60% 
IMG OID637985378 
Producttwo component, sigma54 specific, Fis family transcriptional regulator 
Protein accessionYP_592431 
Protein GI94970383 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAA TTGCGGTCTC CCGCACGAGC AACAGTGCCC GCGAGCTGAG CGCCGCGGGC 
GTAGTGCTCA TCATTGACGA CGAAGCCGCG ATCCGCGAAT CGCTGCAGAC CCTGCTCGAA
CTCGAAGGCT ACCAGGTGGA CACCGCCGTG GATGGTGGCG ACGGCCTAAT GCAGATGGCG
GCGCATCCCT ACGACCTCGT TCTGCTCGAC TTCGCGCTCC CAGATCGCAA TGGCATCGAA
ATCCTGAAAG AGATCCGCAG CCGCGATACC GAAATTTCCG TGATCATGAT CACCGCCTAC
GGCACCGTGG AAAACGCAGT GAACGCGATG CAGGCGGGCG CCACCAACTT CATCCAGAAG
CCGTGGGACA ACGAGAAGCT GCTCGCCGAC GTAGGCGCTG CCATCGCACG CCGCCGCGTT
GAAGAAGAAA ACCTGCAACT GAAGCGCGCC CTGAAACAAC GCTACAACTT CGAAAACATT
ATCGGCAAGA GCGAGCCGAT GCTGCGCATC TTCGACCTCG TGACCCAGAT TGCGCCCTCG
CGCTCCACCG TACTTCTGCA AGGCGAGAGC GGCACTGGCA AAGAACTGAT CGCGAAAGCG
ATCCACATGA ACTCCACGCG CAAGGACCGC GCATTCGTGC CCATCAACAC CGGTTCGATG
CCGACCGACC TGTTGGAATC CACGCTGTTC GGGCACGTAA AGGGCGCATT CACCTCGGCC
ATCGCCAGCA AGAAGGGCCT GTTCGAAGTC GCCGATGGCG GCACGCTCTT CCTCGACGAA
ATCGGCACCA TGAGCATGGA GACGCAGGCC AAGATCCTGC GCGTGCTCCA GGACCGCAAG
TTTATGCACC TCGGCGGCGT GCAGGAGATC CAGGTAGACG TGCGCATCAT CGCCGCAACC
AACGTGGACC TGCGCCAGCT CGTAAAAGAA GGGAAGTTCC GTGAGGACCT CTTCTACCGC
CTGAACGTCA TCACTATCGA CCTGCCGCCG TTGCGCCAGC GCCGCCTGGA TGTCCCGCTC
CTCTGCGAAC ACTTCATCAA GAAGTTCTGC GAAGAAAACG CCAAGCCGCT CATGCGCATG
ACCCCCGAAG CCTTGCGCCC GCTGCTCGAT TACAACTGGC AAGGCAACGT TCGTGAGTTG
GAAAATGTCA TCGAACGTGC AGTGGTTCTC GCCAACGGTC CATCGATCAC CATCGATCTT
TTGCCCGACA ACGTCGTCGG CCGCGGCTCA AGCCTCTCGT TCGTCGAGAC CCGTCCTGAC
GCCTCGCTCT TCGAGATCAT GGAAGACTGC GAACGCCGGA TCATTGTGGA CATGCTGGAA
AAAGTCGGCT GGAACCAGAC CGAAGCCGCC GAGAAGTTTC GCATCCCGCT CTCCACGCTG
AACCAGAAGA TCAAGCGCCT CTCGATCGAG ATCAAGAAGA AGGCCTCGCG CGAGGCGCAG
CCGCAGGGAG CGTAG
 
Protein sequence
MPEIAVSRTS NSARELSAAG VVLIIDDEAA IRESLQTLLE LEGYQVDTAV DGGDGLMQMA 
AHPYDLVLLD FALPDRNGIE ILKEIRSRDT EISVIMITAY GTVENAVNAM QAGATNFIQK
PWDNEKLLAD VGAAIARRRV EEENLQLKRA LKQRYNFENI IGKSEPMLRI FDLVTQIAPS
RSTVLLQGES GTGKELIAKA IHMNSTRKDR AFVPINTGSM PTDLLESTLF GHVKGAFTSA
IASKKGLFEV ADGGTLFLDE IGTMSMETQA KILRVLQDRK FMHLGGVQEI QVDVRIIAAT
NVDLRQLVKE GKFREDLFYR LNVITIDLPP LRQRRLDVPL LCEHFIKKFC EENAKPLMRM
TPEALRPLLD YNWQGNVREL ENVIERAVVL ANGPSITIDL LPDNVVGRGS SLSFVETRPD
ASLFEIMEDC ERRIIVDMLE KVGWNQTEAA EKFRIPLSTL NQKIKRLSIE IKKKASREAQ
PQGA