Gene Acid345_3711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3711 
Symbol 
ID4070461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4384996 
End bp4386909 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content54% 
IMG OID637985734 
ProductLuxR family transcriptional regulator 
Protein accessionYP_592786 
Protein GI94970738 
COG category[T] Signal transduction mechanisms 
COG ID[COG4566] Response regulator 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.731459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTCAC CTGATCTTTC CAGCAAAATT GATGCGTTTC CCGAGCCGAC GCTTTTAATC 
GACCGCACCG GAACGGTTGT CGCTTCTAAT CGGAGTGCGT CGCAGTTGCT GCGGTTATCA
TCGGCGAGAA TCATCGGGAA ACCCCTTCGT AAGCTCGTTC GTGAGGATCC CGAGAAGCTG
GCGGGATACA TTCGCGGCTG GCTGCGCAAT TCCAAACAGG CCTCGGACAA GCGGACCACG
CTGACGGTCC TTCCCAGGGA TGGGCGAGAG ATTGTTTGCA GAGCTGTAGG AGCATTATTG
CCGGCTCGCG TGGATCAACC CCGCCTGCTT TGCCTTAGGT TTCTAGCTCT TACAATGACC
GCGACAGAGC AACGCGAACC GCATCTCCTT TCCGCGTGTA AAAAGCAAGC GCAACCGCAA
GCCGATCAAA GATGGCGAAC TGCCTTCGAG AACGCGGCGA TCGGAATCGT CATGGCGGAC
TTTAATGGTC GCTATTTCTC TAGCAATGCT GCGTTCCGCA GAATGTTGGG CTATACCGAG
GCCGATCTCT ACCGATTAAC CTTCGATGAA GTCACGTTCG AGGGTGACCG TGAAGCTAAT
CTGTTGTTGG TCCGCGAGCT CGTAGGCGGC AAACGACGAC ACTTCGAACT CGAGAAGCGC
TACCGCCGCA AAGACGGCAC TTTGCTTTGG GTACGGCAAC ACGTTGCACT AGTTCCAGGA
ATGCAAGGTG TTGCGCCATT TTGGCTAGGT GTTGTCGAGG ACATTACCGA CGGCAAGCGC
GTTGAACATG AGCTCAGCGT GCAGCGACAG AAATTTCTTG AGAGCGAGGC GCGGCTGCAG
GCGTTTTTTG AAAATAGCCC CAACCCGATT TTCATTAAAG ATCGAGAGGG CCGATATCTC
CACGTTAATA GAGAATTCAA ACGAGTTCTC TGCTCCGCGC AAAAACGAGT CCTAGGAAAG
AGAGACGACG AACTGTTCTC AGCCGAACAG GCCGCTGCTT TTCAGGCCAA TGACCGGCAG
GTGCTCGAAG CTGGCGTTCC GATGGAATTT GAGGAGACCG CTTTCGAAGA AGATGGGCAG
CACACCAGCA TCGTCCAGAA GTTTCCGCTA TTGAATGCGG AGGGAGAGAT TTACGGCATT
GGCGGCATTG TTACCGATAT CACTGAGCGT AAGAAATCGG AATCCGCGCT TCGATTCAGT
GAGGAGTCGT ATCGCGTTGT GGTCGAAACG GCCAACGATG CCGTTATCAG CACAGACGAA
ACCGGCACGA TCCGTTTTGC CAATTCCACT ACTTCGAGAG TCTTTGGATA TGACTCAACT
GAGCTGATCG GGAAGCCCTG GACTGTGCTG ATGACAGAAC ACCTGCGCCA GGTACATGAG
GCCGGATTCA GACGCTACTT GGAAACTGGC GTGCGGCATA TGAACTGGCA AGGTACCGAG
CTGGTTGGAC TGCACAAGAA TGGGAGAGAG TTCCCGGTAG AGATTTCAAT CGGAGAGCTT
GCCCGGAGTG GCCGGCGGAT GTTCACCGGC TTTATCAGAG ATATCAGTGA AAGAAAGCAG
GCAGAAGAAA TCCGAACCAC TGCATTTGAA TTCCTCACGA AGCCATTCAG CGATCAGGAT
CTGCTGGAGG CGATTCGCAT CGCACTGGAG CGACATCGCC ATAAGCAGGG ACACGAGCAA
GAGCTGGCGA AGCTTCGACG GCGCTTCGGG TTCTTGAGCT TCCGGGAGCG GGAAGTAACC
TCTATGGTTG TTTCGGGCAT GGCCAACAAA CAGATCTCCG TTAAACTCGG GATATCGGAA
AACACCGTTA AGGTTCACAG GAGTCGAGCA ATGAAGAAGA TGCAGGCCCG GTCTCTTCCT
GAGCTCGTCA GAATGATGGA ACAGCTGAAA GACTTTTTTG AAAAGCCAGC GTAA
 
Protein sequence
MGSPDLSSKI DAFPEPTLLI DRTGTVVASN RSASQLLRLS SARIIGKPLR KLVREDPEKL 
AGYIRGWLRN SKQASDKRTT LTVLPRDGRE IVCRAVGALL PARVDQPRLL CLRFLALTMT
ATEQREPHLL SACKKQAQPQ ADQRWRTAFE NAAIGIVMAD FNGRYFSSNA AFRRMLGYTE
ADLYRLTFDE VTFEGDREAN LLLVRELVGG KRRHFELEKR YRRKDGTLLW VRQHVALVPG
MQGVAPFWLG VVEDITDGKR VEHELSVQRQ KFLESEARLQ AFFENSPNPI FIKDREGRYL
HVNREFKRVL CSAQKRVLGK RDDELFSAEQ AAAFQANDRQ VLEAGVPMEF EETAFEEDGQ
HTSIVQKFPL LNAEGEIYGI GGIVTDITER KKSESALRFS EESYRVVVET ANDAVISTDE
TGTIRFANST TSRVFGYDST ELIGKPWTVL MTEHLRQVHE AGFRRYLETG VRHMNWQGTE
LVGLHKNGRE FPVEISIGEL ARSGRRMFTG FIRDISERKQ AEEIRTTAFE FLTKPFSDQD
LLEAIRIALE RHRHKQGHEQ ELAKLRRRFG FLSFREREVT SMVVSGMANK QISVKLGISE
NTVKVHRSRA MKKMQARSLP ELVRMMEQLK DFFEKPA