Gene Acid345_1656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1656 
Symbol 
ID4069804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2002212 
End bp2003372 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content58% 
IMG OID637983665 
Productsigma-54 activating ATPase 
Protein accessionYP_590732 
Protein GI94968684 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0864275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCTC CAATCATCAA CGAACTCACC GCGAACCCGG GGGTTCTCAT TGCAAGTCCG 
AACCCGGCGT TTCGTCGCCA GGTGATCGAC ACGCTGCCGA CTACCTGGCG GCCTGTACTG
GAGGCGCAGG GAGGGGCGGA CGCGCTTGGC AAGCTCGAAG CCAGCGATTG CCGAATGCTG
CTTCTCGATC GGCAGTTGCT GGACCTCGAT GTCGAGGAAT TGGCGGGACT GGTGAGACTC
CGGTATCCGG GGGTGGACGT TCAGACGCTG GAAACGCCTC GCATAAAAGG CAGCGCGGTT
GATATTCCCG AGAGGCATAG CAACATCAAC CAACTCGCGA TTGACACGCA ACTCTCACCG
CTGGATGGCA TGATCGGCAA TTCGGAGCGA ATGGGATCGG CGTATCGAGC TATCCGCAAA
GTCGCACCAC GGGACACGCC GGTATTGGTC ATGGGAGAGA CGGGCACGGG AAAAGAGCTG
GTAGCACAAG CAATTCATCG GCTCAGCCGC CGCTGTGAAA AAGCCATGGT AGTGATCAAC
TGTGCGGCCA TTCCCGAGAG TCTTTTGGAA AGCGAACTCT TTGGGTATGT CCGCGGCGCG
TTCACGGGCG CAGCACAGAC GCGGCAAGGA CGGATACAGG CGGCGAACGG GGGGACCTTG
TTCCTGGATG AAATCGGTGA GATGCCGTTC GAATTGCAGG CAAAGCTCCT CCGCTTTCTC
GAAACCGGCG AACTGCAACG CCTCGGAAGC TCTGAGACTT GGCGAGTCGA TGTGCGCTTG
GTTGCGGCGA CGAACCGGAA CCTGCGAGAG AGCGTCCAGA TGCAACGGTT TCGCGCTGAC
CTGTTCTATC GGCTTTGCGT TTTTCCGATC GTCCTACCAC CTCTGCGAGA CCGAAACGGG
GATATTTCCC AGCTTGCGAC CCACTTTCTC TCTACCTTCG ATCGCGATTG TTACTTCACC
CCGGCGGCAA TCAAGAAGCT CGACGCTCAT GACTGGCCGG GAAACGTACG CGAACTGAAG
CATGTGATCG AGAGGGCGAC TATTCTCGCG AACGACAAAG CCATCACGGT CGAGGACGTG
GTCCTCGATG CGGAAGCAGT CATGAACTAT GCAACCGATC TCAGCGTGCG GGGAGTGAAC
CATGCCGGAA CTTTCAACTG A
 
Protein sequence
MISPIINELT ANPGVLIASP NPAFRRQVID TLPTTWRPVL EAQGGADALG KLEASDCRML 
LLDRQLLDLD VEELAGLVRL RYPGVDVQTL ETPRIKGSAV DIPERHSNIN QLAIDTQLSP
LDGMIGNSER MGSAYRAIRK VAPRDTPVLV MGETGTGKEL VAQAIHRLSR RCEKAMVVIN
CAAIPESLLE SELFGYVRGA FTGAAQTRQG RIQAANGGTL FLDEIGEMPF ELQAKLLRFL
ETGELQRLGS SETWRVDVRL VAATNRNLRE SVQMQRFRAD LFYRLCVFPI VLPPLRDRNG
DISQLATHFL STFDRDCYFT PAAIKKLDAH DWPGNVRELK HVIERATILA NDKAITVEDV
VLDAEAVMNY ATDLSVRGVN HAGTFN