Gene Acid345_1666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1666 
Symbol 
ID4069814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2015940 
End bp2017232 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content51% 
IMG OID637983674 
Productputative transcriptional regulator 
Protein accessionYP_590741 
Protein GI94968693 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCGA CATTGGCGCA ATCACTCCAG AGTGTTCTAG ATGAATTGTC AGTAGGAATC 
ACTGCCAATG GTTATCGCTC TCTAAACCTA CGTGTTCTCC TAGAGAATCC GAACAAGTTC
GTAACGGGAT CGCTAACCTT CAGCGAAAAC AGCTTTCGCG TATCCGATCC GGAGATACAT
AAGTACAACG ATGCGACGCT CATCTCAGTG GTTTCTGACG TCGAGATGTG GCGCGATTTC
TTGGTGCGGA TGCTCACGGA TAACTTCTCG TTATTTGGTA CAGAGATCCC ACTATCGTTG
GCGTTCAGCC AGTCACAACC GGATACCGAT CTCAATTCGG ACATTTCCTC TCCGCGCTTC
GCGTATCACT TCGATCTTAA TAATCGCTAC TACCATCAGA CCCGAGGAAC ACTCATGGCG
ATAGGGCGAC CACCGTATGC CAGTATCGAG GATGCGTCAG CGCGATTCGT GCATCAATTG
AAATCACGAA CAGGTGGGAT TCTCCACGAA GGCAGGCTGG TGGTGAGCCT TCCGACGAAA
CCGCTCATCG CTTCCGCCGA ATGGATTCCT GGCGAGCTCC GAATAAGATT TCATGAAGGG
CTGCAGCCGC GTTATCAATT GGATATTTTG TTTTGGACGG GCGCCTCCGT CATCGAATCT
CAATCACATT CCTCGCTCGG GCGAGACTTT ACGTCCGCCG TTCCGAAACG AACCACGAGA
ATTACGGCAT ACCTAATCCG CGATGATGGG AATGTCGCGC ATAGCTTCGA ATTGCGGTCT
CCGTATACTT TCGTCGGGGA AAAAACAGCA ACACTCAGCT TGGAACAGCT GGTAAGAGCC
GACATCGCCG CCGGCGAAAG CGAGATACGG GAGATGAAGG CGTTTTTTAA CCCTGACGGA
AACACCGAAA TGAAGACGAG AGTGCTTCAT ACGACCATTG CGTTCGCTAA TACGCAGGGA
GGAACGATCT ACATCGGCGT TGAAGATAAC GGCGAATTTT CAGGAACTCC GAAACTGCGT
GGCGTGAGTT CCAAACGCGT TCCCTCAGAG TGTGCGAAGG ACATCTCCAA GAAACTGAGA
AAACTCATCA TCGAGAACAC ACGACCGGTT GTGGAAGTGC AAGCCAACGA AATCTGCCTT
TCAGGGGATT GGGTGATTGC GCTTTCGGTC GGGAAGTCCA CCGAGGTGGT GAGCACTCAT
CGAAATGTAG TGGTGATTCG AGCGGGTGCC AGTAACCGAC TCCCTGACCC AAAATGGTTC
GAACAGCGCA AAGCCGACAA CTCTTGGTTC TAA
 
Protein sequence
MASTLAQSLQ SVLDELSVGI TANGYRSLNL RVLLENPNKF VTGSLTFSEN SFRVSDPEIH 
KYNDATLISV VSDVEMWRDF LVRMLTDNFS LFGTEIPLSL AFSQSQPDTD LNSDISSPRF
AYHFDLNNRY YHQTRGTLMA IGRPPYASIE DASARFVHQL KSRTGGILHE GRLVVSLPTK
PLIASAEWIP GELRIRFHEG LQPRYQLDIL FWTGASVIES QSHSSLGRDF TSAVPKRTTR
ITAYLIRDDG NVAHSFELRS PYTFVGEKTA TLSLEQLVRA DIAAGESEIR EMKAFFNPDG
NTEMKTRVLH TTIAFANTQG GTIYIGVEDN GEFSGTPKLR GVSSKRVPSE CAKDISKKLR
KLIIENTRPV VEVQANEICL SGDWVIALSV GKSTEVVSTH RNVVVIRAGA SNRLPDPKWF
EQRKADNSWF