Gene Acid345_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1122 
Symbol 
ID4069237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1398160 
End bp1399176 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content51% 
IMG OID637983131 
ProductXRE family transcriptional regulator 
Protein accessionYP_590199 
Protein GI94968151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000113456 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGATCAAG AAACGAAGTC ATTAGCTCTC CGCAAACGGA TCCAGCAACA TTTCCAAACA 
GAATCCGCGG AACAAATCGT CGAGAATTCA AGACGACTGG TTGGGGCAAC AGACCCTCAC
GGCTTCGGGG CAATATCAAT CGAACCAGAG AACCCGATCT TGGTGCATCC AAGCCCTAAA
TCGGTTCCGC TGAACGCATA TTTGGCTTCA GCATTGACTG GATTGGATGA GGTGGAAAGG
TCGCTCATCA TTCACCTCTC TGATGTCGTA GCGCTCGTGT GCCGATCAGT CGACATAGAT
TTGTATGAGC CTCGTAAGAG CACGGATCCA GTTCACCATG CGGACGTGTC TGCCACCGAA
GTTTTCATCA CAGACCGAAA ACGGGTGGTG AGCTCGGACC TTCTTATACA TCTATGTCAC
TTTCCCAGCA CTGGATCCGG CGAAGAACTT AGCTTTGCAT ATGAGTCTCT GGTTCCGATT
ATCTTGATCG CTCCAGGTGA ACGAAGCGTC AGCCGGATGG TCACTGGTAT TCCTAGCCTC
AAGATAGACA TACGATACAG AGAACCTGAG CACCTGCGCG CCATGCTAGA AGAACGACTG
ATCGAGATCC GCCCCTTCTT GGAACAGCGG AAACTTACGA TCGACGGATT CAGTCAAAAT
ATCGTTGGAT CTAGGATCCG GGAATTACGC CTCGAAGCCG GTCTTTCGCT AGGCGATTTG
GCAAAACGGG TCGGACTAAC CGAGCAAGGG CTGCAAAACA TCGAAGAGAA TGTGGACACG
ATCTCGAATC CTGGGCTAAC AGTGTTGCGA TGGATCGCGA CCGCGCTGAA GACGACAGTC
GCAGAGCTGG TTGACCCGGA TTATGCAGAA AACGTGATCG CTGGGATTCG GTCTACTTTC
AATGAGCGCG CGAGCGCAAT TGCCGCCCGC TTTAGTGGAA TATCCCAAAA GGATAAGAGA
GCCCTCCTGC GGCGTTATCT GCACCGAACG CTAGTATTGC TCGACGAAGA GGAATAA
 
Protein sequence
MDQETKSLAL RKRIQQHFQT ESAEQIVENS RRLVGATDPH GFGAISIEPE NPILVHPSPK 
SVPLNAYLAS ALTGLDEVER SLIIHLSDVV ALVCRSVDID LYEPRKSTDP VHHADVSATE
VFITDRKRVV SSDLLIHLCH FPSTGSGEEL SFAYESLVPI ILIAPGERSV SRMVTGIPSL
KIDIRYREPE HLRAMLEERL IEIRPFLEQR KLTIDGFSQN IVGSRIRELR LEAGLSLGDL
AKRVGLTEQG LQNIEENVDT ISNPGLTVLR WIATALKTTV AELVDPDYAE NVIAGIRSTF
NERASAIAAR FSGISQKDKR ALLRRYLHRT LVLLDEEE