Gene Acid345_3439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3439 
Symbol 
ID4070323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4057118 
End bp4058872 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content60% 
IMG OID637985461 
Productserine phosphatase 
Protein accessionYP_592514 
Protein GI94970466 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG1956] GAF domain-containing protein
[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCTT CCCAACAACA TCCCTCCATA GACGCGTTTC TGCTGGAGGT GGCCGATGTC 
GTCAACACGA CACTGGACCT CGACACGCTG CTTACGCGCG TGGCCGGATT GGTGCGCAAG
ATCATCAACT ACGACATCTT TGCCATCCTG CTGGTGAACG AAAAGGCGCA AGAACTGCGG
ATCCGCTTCC AGGTCGGGCA TCCGGCGGAG GCGATCGAGC GCACCCGGAT CAAGATTGGC
CAGGGTATTA CCGGAGCTGC GGCACAGACC CGCGAGCCGG TGCTGGTGAA TGACGTCTCC
AAGCACCCGG AATACATTAC CTCGGTCTCC GATGTACGCT CCGAACTGGC GATCCCGATG
ATCGTGAAGA ACAAGGTCAT CGGCGTAATC GACATCGAGG CGCCGCAGAA GAACTATTTC
ACCGAAGAGC ACAGCCGGTT GCTGACGGTG ATCGCCTCAC GCGTGGCGAT TGGCATCGAA
AACGCGCGGC TTTACACGCG GGTTTCGAAC CAGGCGAAGT CGCTGCTGCT GCTCAACGAG
ATCAGCCGCG AACTGACGTC GATTCTCAAC CTCGATCAGT TGTTGAAGCG GGTTGGCGAG
CTTCTTACGC GCGTGATTGA TTACCAGATG TTCTCCATCC TGCTGCTCGA CCCGATGGGC
GAGAAACTGC AACATCGCTT CTCACTGCGC TTCCAGGAGA ACATCCACCT CAAGCACGAT
ATCCCGCTGG GCCGCGGACT GGTGGGTTAT GCCGTCGGGA AGAACGAAGC TGTGCTCGTC
CCCGACGTCC GCAAAGATTC GCGTTACATC ATGCTGAATC CGGAGACGCG CTCTGAGCTG
GCGGTACCGC TGGTTTACAA GGGCAAAGTC ATCGGAGTCC TCGACCTTGA GCACACCAAG
CGCGGCTTCT TCACAGAAGA TCACAAGCGC ACGCTAACCA CGCTTGCCGC GCAGATCGCC
ATCGCGATTG AAAACGCCCG GCTCTACGAG CAGATCGTGA AACAGGAGCG GCGGCTCGAG
CAGGACATGG CGCTCGCCCG CGAGTTGCAG CATCACTTGC TTCCGGCGTC GCTACCGAAG
ATGACTCACG CTGAGGTCGC CGCCAAGTTC TCTCCCGCCC GCGCCATCGG CGGCGATCTC
TACGATTTTC TCCGGTACTC CGGCGGTTAT CTGCACGGCA TCGCGGTGGG CGACGTCAGC
GGAAAAGGAG CACCCGCGGC CATTTATGCC GCGCTGGCCA GCGGCATCCT GCGCTCACAT
TCGCAGGAAG AGCCGGGCGC CGCGGAGATG CTGAGCATCG TCAATTTATC GCTTTCCGAT
CGTCCCATTG ATGCGCAGTA CATCTCGCTG ATCTATGCCA TTTGGGATGA CTCGGCTCGG
ACTTTGCGAC TCGCGAACTC GGGACTACCG CGCCCCATGC ACTGCCGCAA GGGCAAGGTC
ACGCGCATTG ATGCGACCGG TCTGCCACTC GGGCTCTTCG CCAGCGCCGA GTATGAAGAG
ACTGCGATTC GCGCCGAAGC CGGCGACGTC TTCGTCTTCT TCTCCGACGG TATTCTCGAC
GCGCGCAACC GTGCAGGCGA ACTCTTCGGC AGTGGCCGCG TCGAAGGCCT GGTATGCGAG
TTTGCCGGCG AACCGGCCCA GAAGATTGTC GATTCGATCT ACAACGCCGT GTGCGATCAT
GCCGTCGGCG TGGAAACCTT CGACGACCAA ACCATTGTCG CGCTGAAAGT AAAAGCAGGA
AGGGCGCGCA AGTGA
 
Protein sequence
MKASQQHPSI DAFLLEVADV VNTTLDLDTL LTRVAGLVRK IINYDIFAIL LVNEKAQELR 
IRFQVGHPAE AIERTRIKIG QGITGAAAQT REPVLVNDVS KHPEYITSVS DVRSELAIPM
IVKNKVIGVI DIEAPQKNYF TEEHSRLLTV IASRVAIGIE NARLYTRVSN QAKSLLLLNE
ISRELTSILN LDQLLKRVGE LLTRVIDYQM FSILLLDPMG EKLQHRFSLR FQENIHLKHD
IPLGRGLVGY AVGKNEAVLV PDVRKDSRYI MLNPETRSEL AVPLVYKGKV IGVLDLEHTK
RGFFTEDHKR TLTTLAAQIA IAIENARLYE QIVKQERRLE QDMALARELQ HHLLPASLPK
MTHAEVAAKF SPARAIGGDL YDFLRYSGGY LHGIAVGDVS GKGAPAAIYA ALASGILRSH
SQEEPGAAEM LSIVNLSLSD RPIDAQYISL IYAIWDDSAR TLRLANSGLP RPMHCRKGKV
TRIDATGLPL GLFASAEYEE TAIRAEAGDV FVFFSDGILD ARNRAGELFG SGRVEGLVCE
FAGEPAQKIV DSIYNAVCDH AVGVETFDDQ TIVALKVKAG RARK