Gene Acid345_1393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1393 
Symbol 
ID4068928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1692161 
End bp1694431 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content60% 
IMG OID637983402 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_590469 
Protein GI94968421 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0285475 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGCGC CGGAACCAAT TCGTCCCCCC GACACTCGTC GCAAGCGGAC GATTATTGCC 
ATCGTCCTTG CTGTCCTCAT CTTCGTCCTT TTCACCATCA TCTTCTCGCA GGCGGCCTTC
AACCTTACCT TCCTGCATCC CGATACTTCC CAGCAAACTC TCATCTTTGC GGCACTTTCA
GCGCTGATTT TCCTGCTCTT CGTGGCGCTG GTGTTTGTGC TGCTGCGGAA CCTGGTGAAG
CTGTATTTCG ACAGCCAGAG CCGGGTGTTT GGCTCGCGCT TCCGTACCAA GATGGTGCTT
GGCGCCCTCG GGATTTCGCT CGGGCCGGTC ATCATCATGT TCATCTTTGC CTATGGGCTG
ATGAACCGCT CGATTGACCG CTGGTTCTCC AAGCCGATTG AAGAGGTAAG AGCGCGCGCC
GAGGGCGTGG CGTCGCTGCT GGCGAACTAC GCGATTGCGA ATGCGAATGC CGAAGCCCTG
AGCATTGCGG AGGGCGTCGA GGTAGACAAG GCCTACCAAA CCGGCAACTT TTCCAAGGTC
GTAGAAGAGA TGCGCCGTCA CGAGGCGACG CTGCAGGGCG GCTTTTCCGT GGCTCTGCAA
GACGACAACG CCGAGGCCGA GTTCCACGCT CCGCAGCCGT GGCGTCAGTT GCGGGCGCAG
ACGCCGGGAA TTCTTGCGCC GCAACCGCCC GATCATCCGC ACTCGATACA GATTGGCCAA
CGCGACTTCA TGGTGGGTGT AGCGCCCATC GGCAAGAACG GCCGCATCCT GGTGGCGATT
CCGCTGCCTG CAAATTACTC GCAGCAGTTG AAAGACATTG AGTTCAGCCA ACAGCAGTAT
TGGGAACTGC ACCAGCAGCG CAAACTCATT CGCCGCACCT ACATGGGGTT CCTGCTGCTG
CTGACAGCGC TGGTATTGTT CGCCTCCACG TGGGTGGCGT TCTATTTGTC GAAGCTGGTG
ACGCGTCCGC TGGTGGCCTT GGCCGAAGCG ACGCGCGAAA TGGCGATGGG ACGACTGGAT
TACCGCGTGG ATGTGGCGGC GCAGGACGAA CTCGGGGAGT TGGTGAAGTC GTTCAACAGC
ATGGCGGCGC AGATGGAGAG CAGCCGCGAA AAGATCGATG CTTCCACCCG AGAGCTTGCG
ATGGCGAACG TCGAAATCGA GGAACGGCGT CGGTATCTTG AGACCATCCT CGAAACCATC
CCGACCGGCG TGCTTTCACT GGATGCCCAG CGCCACGTTA CCCGCGTGAA TACGGCATTC
CGGCGGCTGG CGAGACTGTC AGAAGGCTAT ACGGCGAATC CAGGCACTAC GCTGACAGAC
ATTTTCCCCG AAGATGTGGT GCACGATCTG GAACACATGC TGCGTCGCGC CGATCGCATG
GGCACGACCA CCAGCCAGAT GGAAGTCTCG ACGCCACGTG TGCAGTTGAA TGTGGCCATG
ACGGTGTCGT CGCTGGACCC GCGGCCTCGA AATTCGGCAT TCCGCCTCGG CTATGTGATC
GTCGTGGAAG ATCTTTCAGA CTTGCTGAAG GCGCAAAAGC AGGCCGCATG GCGCGAAGTT
GCGCGGCGCG TAGCGCATGA GATCAAGAAC CCGCTGACGC CGATTGCGCT GGCCGCAGAA
CGCATTCGTC GGCATCTCGA CCGCGGGTTG CAGCCGGACG CAAATTCGCT GGCGGTGATC
CACAGTTGCT CAGACACGAT TGCAAGCTCT GTCGAAACGC TGCGCAACCT GGTGGACCAG
TTCTCGGCGC TGGCGCGGTT CCCGGCATCG CAGCCGCAGG CGTCGGACAT CAACGAGGTA
GTGCAGAGTG CGCTGCTGAT GTTTGAAGGA AGGCTGGAAG GGATTCGGGT CACAACCTTC
CTCGCGCCCG ATTTGCCGAA GGTGATGGCC GATCCCGCGG CGGTGAAGCG CGCCGTCGCA
AACCTGGTGG ACAATGCAGC CGAAGCACTG AACTCCTCCA TGCTCCGCGA GATCCAGATC
GCGACAAACC TCACGGGCAG CCGCGATATG GTGGAGATCG TCGTCGCCGA CACTGGCCAC
GGCGTGACCA GCGAGGTCAA AGAGAAGCTT TTCCTGCCGT ATTTTTCGAC GAAGCAGCGT
GGAACCGGGC TGGGGCTTGC TATCGTGAGC CGCATCATCG AAGATCATCA CGGAACGATT
CGCGTGGAAG AGAACTCTCC TGTCGGCACC CGATTTATCG TGGAGTTGCC GGTAGCGCCG
GAGAGCGCGA TCGCGACAGC AGCCGAGCAT GCACAACATT CTGATCGTTG A
 
Protein sequence
MAAPEPIRPP DTRRKRTIIA IVLAVLIFVL FTIIFSQAAF NLTFLHPDTS QQTLIFAALS 
ALIFLLFVAL VFVLLRNLVK LYFDSQSRVF GSRFRTKMVL GALGISLGPV IIMFIFAYGL
MNRSIDRWFS KPIEEVRARA EGVASLLANY AIANANAEAL SIAEGVEVDK AYQTGNFSKV
VEEMRRHEAT LQGGFSVALQ DDNAEAEFHA PQPWRQLRAQ TPGILAPQPP DHPHSIQIGQ
RDFMVGVAPI GKNGRILVAI PLPANYSQQL KDIEFSQQQY WELHQQRKLI RRTYMGFLLL
LTALVLFAST WVAFYLSKLV TRPLVALAEA TREMAMGRLD YRVDVAAQDE LGELVKSFNS
MAAQMESSRE KIDASTRELA MANVEIEERR RYLETILETI PTGVLSLDAQ RHVTRVNTAF
RRLARLSEGY TANPGTTLTD IFPEDVVHDL EHMLRRADRM GTTTSQMEVS TPRVQLNVAM
TVSSLDPRPR NSAFRLGYVI VVEDLSDLLK AQKQAAWREV ARRVAHEIKN PLTPIALAAE
RIRRHLDRGL QPDANSLAVI HSCSDTIASS VETLRNLVDQ FSALARFPAS QPQASDINEV
VQSALLMFEG RLEGIRVTTF LAPDLPKVMA DPAAVKRAVA NLVDNAAEAL NSSMLREIQI
ATNLTGSRDM VEIVVADTGH GVTSEVKEKL FLPYFSTKQR GTGLGLAIVS RIIEDHHGTI
RVEENSPVGT RFIVELPVAP ESAIATAAEH AQHSDR