Gene Acid345_3087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3087 
Symbol 
ID4072651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3667183 
End bp3668121 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content58% 
IMG OID637985106 
ProductPAS sensor signal transduction histidine kinase 
Protein accessionYP_592162 
Protein GI94970114 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.230742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAACGGA GTCATTTCAA TCCCGCCTGG TTAAAACTCA CTGGGCGCAC CCTCGCGGAG 
TCCATCGGTG AAGGTTGGAT CGCCTGTGTG CATCCTGATG ATCTCTTACC CCTGCGAGCG
GCGCGTACAA ATGAAAGTGA GGGGCATGCC GCCGAGTATC GCCTGAGAGA TGCCTCCGGA
AACTATCGTT GGATGCGCGA TATGGCTACC CCGGTGACGA TTGGCGATCA TCATGGGCTG
CTTGGCTATT GCATCGATAT TGAGGGGCAG AAGCAGGCGG AACATGCCCG TCGCGAACTC
GCGGAGCGCC TCGTCACCGC TCAGGAAGCC GAGCGTGCCC GCATCGCCCG CGAACTCCAC
GACGACATCA GTCAGTCCCT CATGATCCTG ACCTTCCAGA TGGAGCGCGC TGGAAAGCCG
GTCTCCAACG AGCCCGGCAA ACGCCATCCC GATGCGCCAG CGTTGGCCGA TGACCTTCGA
GAAGTCGTGC GACGAATCTC GAAGATGGCG CATGAACTGC ACTCTTCAAG TCTCGAGTAC
CTCGGATTGT CCCGTGCCAT CCAGGGCGCT TGCCGCGAGT TTTCGAAGCA ATGGAAGATA
CCTGTGGATT GTGAACTCGT CGATCTGCCC CAGAAGCTCG ACAACATGAC CTCCCTTTGC
CTGCTCCGCA TTACCCAGGA GGCGCTGCAC AACATCGCCA AACACAGCCG CGCCACCAAC
GTTACGGTTC AGCTCTCTAG CACGGAAGAT CAGCTTAAGT TGGTCATTCG CGACAACGGC
ATCGGTTTCG ATGTCGAGCA GGCAAAGCTG GCAGGCGGCA TCGGATTGCT GAGTATGCGC
GAACGAACCA ACCTCGTTCA CGGTACTCTC GCGCTCGTCT CAGTCCCCGG CCAGGGAGCC
ACCGTTGAAT GTTCCGTGCC GTTCAATCAT GACCATTGA
 
Protein sequence
MQRSHFNPAW LKLTGRTLAE SIGEGWIACV HPDDLLPLRA ARTNESEGHA AEYRLRDASG 
NYRWMRDMAT PVTIGDHHGL LGYCIDIEGQ KQAEHARREL AERLVTAQEA ERARIARELH
DDISQSLMIL TFQMERAGKP VSNEPGKRHP DAPALADDLR EVVRRISKMA HELHSSSLEY
LGLSRAIQGA CREFSKQWKI PVDCELVDLP QKLDNMTSLC LLRITQEALH NIAKHSRATN
VTVQLSSTED QLKLVIRDNG IGFDVEQAKL AGGIGLLSMR ERTNLVHGTL ALVSVPGQGA
TVECSVPFNH DH