Gene Acid345_3892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3892 
Symbol 
ID4072227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4604301 
End bp4605779 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content62% 
IMG OID637985916 
ProductGntR family transcriptional regulator 
Protein accessionYP_592966 
Protein GI94970918 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.179019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.485504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGTC GTCCTTCATC GGCAGTGCCA ATGATCGGAG TCGATCGTCA TGCCGCACTG 
CCGCTGCATC GGCAGTTGTA TGAGAGCTAT CGCTCGGCCG TCCTCGGCGG CCGATTGCGG
CCGGGGCAGA TGGTGCCCTC CACCCGCGCG TTGGCAATGG AGTTGGAAAT CTCCCGAATG
CCGGTGTTGA CGGCATACGC ACAACTGCTC GCGGAGGGAT ATTTCGAAAC GCGCTCCGGG
ATCGGAACCG TCATCAGCGA GGCACTGCCC CAGCAACGTT CGGTGGCGGG CAAAGCGCAG
ATTCCGCCTC CAATCGAGCC CGGTCCGCGG AATGTCTCGT CGCGATGTTT GACGCCCTAC
CGGCCCTTCC TGCCTCCCTG GATCGCGGGG AGAGGCGCGT TCAACGTGGG AGGCGTGGCG
CTGGAACACT TCCCGCATCG TACCTGGACG CGACTGGTCG CGCGCAGCGC GCGCAACGGC
GCGGGCATTG GTCCGGCGCG CGACGTGATG GGGTCGATCG AGTTACGCGA AGCGGTTGCC
GATTATCTGC GGACGTCTCG GGCTGTGAAT TGCACGGCCG ATCAGATCAT GATCACGAAC
GGCTCACAGA ACGGCGTGGA ACTCACAGTG CGCGCGCTGC TCGATCCCGG TAGCGCGGTC
TGGATGGAAG AGCCTGGTTA CCAACTCGCA CGCGACGTGC TGAGCATGGC GGGATGCCGC
ATCGTGCCGG TGCCAGTGGA CCAGCAAGGC ATCGATGTGG TGCGCGGAAT CAAGATGTGT
CGCAATGCTC GCGCTGTGAT CGTTACCCCG TCACACCAAT TCCCTCTTGG GATGACGATG
AGCGTTTCGC GCCGGTTGGA GTTGCTGGCT TGGGCGTCGA AGCAAGGTTC ATGGATCATC
GAGGACGATT ACGACGGCGA ATTTCGGTAT GAGAGCAAGC AGATCGCGTC GTTGCAAGGC
CTCGATCGCG ACGGCCGCGT GATTTATATC GGTACGTTCA GCAAAGTGTT GTCATCTGCA
CTGCGCATTG GCTATGTGGT GATTCCGCGC GACCTGGTGC CGCACTTCAT GCGAGTTCGG
TGGTCTACCG ATCTTGGCTC AGAAGAGCTG ATCCAGACGG TCGTCAACGA CTTCCTGCGT
GAAGGCCATT TTGCGCGACA CCTGCGGAAC ATGCGCCTAA CCTATCGCGA ACGGCGGAGC
GTGTTGGTGG AGTCTCTCGA GAAGACGCTC GGAGGCGAGG CTGAAATCGT GGGAGCAGAG
GCGGGGTTAC ACCTGGTCGT GCTTCCAAAA GGCCTGAAGG ACGACGTGGA GATTTGCAAG
GCAGCGGCAC GGGAGCCACT GTGGCTGTGG CCGCTTTCAC ACTGCTATCA CGGTCCAGCC
GCAAAGCACG GGTTTGTCCT CGGCTTCGGC GGGACACCGC CAGAGAGAAT TCCGAGGGCG
GTGAAGCAGA TCCGCGAGGT GTTGCGCGGA CGGAGATAA
 
Protein sequence
MSSRPSSAVP MIGVDRHAAL PLHRQLYESY RSAVLGGRLR PGQMVPSTRA LAMELEISRM 
PVLTAYAQLL AEGYFETRSG IGTVISEALP QQRSVAGKAQ IPPPIEPGPR NVSSRCLTPY
RPFLPPWIAG RGAFNVGGVA LEHFPHRTWT RLVARSARNG AGIGPARDVM GSIELREAVA
DYLRTSRAVN CTADQIMITN GSQNGVELTV RALLDPGSAV WMEEPGYQLA RDVLSMAGCR
IVPVPVDQQG IDVVRGIKMC RNARAVIVTP SHQFPLGMTM SVSRRLELLA WASKQGSWII
EDDYDGEFRY ESKQIASLQG LDRDGRVIYI GTFSKVLSSA LRIGYVVIPR DLVPHFMRVR
WSTDLGSEEL IQTVVNDFLR EGHFARHLRN MRLTYRERRS VLVESLEKTL GGEAEIVGAE
AGLHLVVLPK GLKDDVEICK AAAREPLWLW PLSHCYHGPA AKHGFVLGFG GTPPERIPRA
VKQIREVLRG RR