Gene Acid345_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0420 
Symbol 
ID4069646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp487964 
End bp488917 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content56% 
IMG OID637982424 
ProductLytR/AlgR family transcriptional regulator 
Protein accessionYP_589499 
Protein GI94967451 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3279] Response regulator of the LytR/AlgR family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0328897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0302219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAGA AAATCGCGCT CCGCGCGGCA TCCGCTTCAC AACCAGCAAC TTCTGTCCCT 
GATAGTTCGC CGCTCGTCCC TCATTCCAGT GCACCGGATT TACTTCGTGT TTTCGCTGGG
ATAAGGTTCG AGCAACTCGG GAGGGAGCGA GTCGAAGTGG CAACCAATCC TGTCGTCGAC
TTGGAACTGC GCGCGGTCAT CGTAGAAGAT GAACCCCTCG CGACTATGAA GCTCAAGAAA
CTTCTCGATG GCGAGCCTGG CATTCGTGTG ATGGGCCACG CGGCAAACAT CGATGAGGCC
GTCGCTCAAA TTAGGAAGTT GCAGCCTAAC GTCATCTTTC TTGATGTTCA GCTTCCGGGC
GGTAGCGGTT TCGAAGTTCT CGATCGCCTG CCCGAAGCAA ACCGTCCCTG CATCATCTTT
ACGACCGCAT ACGACCAGTA CGCAATCAAG GCATTCGACC TCCATGCCGC CGACTACCTA
CTGAAGCCCT TCGACCAGGA ACGCCTCCAT CGCGCAATCG AAAAGGCTCG CGTTCAGGCT
GCGGGCCCGG TTGACAAGGA ATGGACCCAA AGGCTGGCGA AGCTGATCGA GACTCTGCCA
AAGAATGGGC GCAACCCCGA AGAACGTTTG GTTTTCAAGA GCGGCGGACG CGTACTATTC
CTGGAGCAGG ACGAAATCGA GTGGATTGAG GCGGCTTCCA ATTACGTCCA CATCCACTGC
GGCGGGAACG TGGTGCACTC TGTGCGCAAT ACCATTCGCG ACGTCGAGAC GCGGCTCAAT
CCTCTCAAAT TCATCCGCGT CCATCGTTCG ATCATCGTGA ACGTCAATCG GATTAAGGAA
CTCCAGCCCT GTAACAGTGG CGAGCATGTG GTGGTGATGA AGAGCGGAAA AGAACTTTCC
TGCAGCCGCA CGTATCGCAT GGCGATCGCC TCCATCACTC GCGAGCGGCT TTGA
 
Protein sequence
MQQKIALRAA SASQPATSVP DSSPLVPHSS APDLLRVFAG IRFEQLGRER VEVATNPVVD 
LELRAVIVED EPLATMKLKK LLDGEPGIRV MGHAANIDEA VAQIRKLQPN VIFLDVQLPG
GSGFEVLDRL PEANRPCIIF TTAYDQYAIK AFDLHAADYL LKPFDQERLH RAIEKARVQA
AGPVDKEWTQ RLAKLIETLP KNGRNPEERL VFKSGGRVLF LEQDEIEWIE AASNYVHIHC
GGNVVHSVRN TIRDVETRLN PLKFIRVHRS IIVNVNRIKE LQPCNSGEHV VVMKSGKELS
CSRTYRMAIA SITRERL