Gene Acid345_1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1389 
Symbol 
ID4068924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1685170 
End bp1686822 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content59% 
IMG OID637983398 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_590465 
Protein GI94968417 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.531963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0257718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCTG AGATCCACGA ACGTACCTGG CTTCAGTGGC TGGTCAAGGT ACGTATCATC 
ATCATCACCT TCCTGCTTGG AATTGAACTC GCGATCACCA ACATCACGCC GAGTTCAGTT
TCCACCAGGC TCTTCGTCAG CGTCATCGTG CTCTGGTACA CGGTGGCCGC CTTCCTTATT
CTTCTCGCTG CCATTTGGCG CGAAACCCGC GTCCAGTCTC ATCTCCAGGT TTTCACAGAC
CTTTTCTTCG TCACCGCGGT CATCTTCGCG ACCGGTGGCG TCGACACCTC CTTTAACTTC
CTCTATCCGC TGGTCATCAT CATGGCCAGC GTGCTGCTCA CGCAGACCTG GACTTACATC
ACCGCGGTCC TCTCGGCGAT CGCTTTTACG CTCGTGCTGC AATTGGGATA CTGGGGCACC
ATCCCTTCCT ACGGCTTGCA GCACACCGAT TCGCGCAGCC TCAACATCGT CATCCTCGTC
AACTGGTTCG CGTTCATCGC GGTCGCGTAT CTTGCCGGCC GCCTCGCTGG ACGCCTTCGC
CAAATCGGCG TCGAACTCGC CGACAAGAGC GGTGAACTCC TTAACCTCCA GGCGCTGCAT
ACCAACATCA TCCAGTCCAT CAGCGCGGGG CTTATCACCA CCGGCAACGA CGGACTGATT
CACGTCGTCA ACAAAGCCGC TTCACGTTTT ACCGAGCGCG ACGAAAACGA ACTCATCGGC
ACTTCCATCA GCGACCACTT TCTCGACCCG TTGCCGATCG TCGCTTCTGC GCCGGTGCAT
GCCGAAATTC GCATGAAGAC ACCTACGGGG CGCCAGAAGA CTTTCAGCAT GATTGGTTCG
GCACTGGTGG TGCCGGAGCG CGGCGCCGTC GGCTATATCT ACACCTTCGA CGATCTGACA
GAACTGCGCC GCCTCGAACG CGAAGTGCGC CTCCGCGACC GCCTCTCTGC TGTTGGACGC
ATGGCCGCCG GCATCGCCCA CGAAATCCGC AATCCGCTGA CCTCGATCGC CGGATCGACC
AAGATGCTCG CCAGCATGTC GGACCTCAAC GAAGAGCAAC AGACACTCGC GAATATCGTG
ACGCGAGAAT CCGATCGCTT GAACTCGATC ATCACCGACT TCCTCTTCTA CGCACGTGAC
AAGAAATTCG AGCTCCGTGA GATTGACGTC ATCCCGGTGC TGAATGACAC CCTGGTACTG
TTGCAACACC GGCCCGGCAT GAACGTCGCC ATCGAGCGCC GCTTCGAAGC CGACAAAGCT
CTCTGCATGG CCGACGGCGA CAAGCTCAAA CAAGTCTTTT GGAACCTCAG CGACAACGCA
TGCCGCGCCA TGCCCGACGG AGGCACGCTC ACCGTCACGG TTCGCCCCGA CCATGAAGTC
TGGCGGGTGC ACTTCGGCGA CACTGGCCCC GGCATGACCG GACCGCAACT CGAAAAGATC
TTCGAACCCT TTCAAACTGA GTTCTACGGC GGCACCGGCC TCGGCCTCGC CATCGTCTAT
CAAATTGTGC AAGGCCACGA GGGCAAGATC TCGGTCCGCT CCGCGCCCGG ACGCGGCACC
GAATTCATGC TCCAACTGAA ACGCCCTACA AAACAATCGT TGCTGGCCGA AACCGAACCC
GTCTCCGCCG CGGCTTCAAA GGACGTCCGA TGA
 
Protein sequence
MRAEIHERTW LQWLVKVRII IITFLLGIEL AITNITPSSV STRLFVSVIV LWYTVAAFLI 
LLAAIWRETR VQSHLQVFTD LFFVTAVIFA TGGVDTSFNF LYPLVIIMAS VLLTQTWTYI
TAVLSAIAFT LVLQLGYWGT IPSYGLQHTD SRSLNIVILV NWFAFIAVAY LAGRLAGRLR
QIGVELADKS GELLNLQALH TNIIQSISAG LITTGNDGLI HVVNKAASRF TERDENELIG
TSISDHFLDP LPIVASAPVH AEIRMKTPTG RQKTFSMIGS ALVVPERGAV GYIYTFDDLT
ELRRLEREVR LRDRLSAVGR MAAGIAHEIR NPLTSIAGST KMLASMSDLN EEQQTLANIV
TRESDRLNSI ITDFLFYARD KKFELREIDV IPVLNDTLVL LQHRPGMNVA IERRFEADKA
LCMADGDKLK QVFWNLSDNA CRAMPDGGTL TVTVRPDHEV WRVHFGDTGP GMTGPQLEKI
FEPFQTEFYG GTGLGLAIVY QIVQGHEGKI SVRSAPGRGT EFMLQLKRPT KQSLLAETEP
VSAAASKDVR