Gene Acid345_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3601 
Symbol 
ID4072823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4260739 
End bp4261890 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content60% 
IMG OID637985624 
Producthypothetical protein 
Protein accessionYP_592676 
Protein GI94970628 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.897301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0489209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGTGGG GTCAGCGATA CGTGTACCTG GCGGTTGCCA CCCTCCTGTT CTCCCCCGTT 
TTTCTGCGCG CCCAACTCAC GACCGACGAC CACGTGCTGG GTTTCGAGTT CTGGCCCACC
AAAGAGATTG CTTCGCAAAA AGACATTGTC GGCAGCCAGG TTTGTGCCAG TTGTCATGCC
GACAAAGCCA ACACGCAGAA GATCACGCCG ATGGGAGAAA CTTCCGTCCA CGCCGTGGAC
GCCTCCCCCC TCCGCGATCA TCCCGCACTG ACGTTTAAGG GTGGCGCCGC CACGTACGAG
ATCCACACCG ACGGGACCCA CGCAACCTTT AGCGCCACCG TCAATGGGCA GTCCAAGTCC
GCAGACCTGC TATGGGCCTT CGGCAACGGT CACCTGGGAC AGTCTTATCT TTTCAAGAAA
GAGGACGGCT ATTACTACGA AGCGCGAGCG TCGTACTTCG AGGTACTGAA ATCGCTGAGT
TGGACGCCGT CCCGCGAACT GACGCGGCCC GAGTCGGCAG ACGAGGCAAT GGGACGCCGC
ATTCCCGATA CCGAGTTGAA GAAGTGTTTC GGTTGCCATA CCACGGGATC GAACGTCGCC
GGACGTCTCA CCGAGACCAA CGTGAAGTCT GGCGTGAGTT GCGAAGCCTG CCACGGACCT
GGGGCCTCGC ATGCGGCGGA AGCTGCGGTG GCCATGTCGG CAGGAACGCC GGATGCCGCG
CGCGGCGGCA TTCTCAATCC CGGCAAGCTT TCACCCAGTG ATTCCGTGGA TTTCTGCGGT
GCCTGCCACA TTTCCTATTG GGACCTGACG CTGCAGCGCG GGCGCGGAAT CGCCACATTA
AAGGCGCAGC CCTTCCGCCT GGAGCAGAGT AAGTGCTGGC AAAAGGGCGA TGCTCGCCTG
ACCTGTACGG CCTGCCACGA TCCCCACAAG CCGCTGGTCA CGGAAGCCAA ATCTTACGAC
CACAACTGCC TGCAGTGTCA CGTGCTGATG GGCCAGAAAC CCACGGCCGA GCAAATCGGT
AAAGCCTGTC CCAAGTCCAC CAGCGACTGC GCTACCTGCC ACATGCAGAA GATCGAATTG
CCCGACTTCC ATCATACTTT CACCGATCAC CGGATTCGGA TCGCCAAAGC CGGAGAGCCG
TTCCCGGATT AA
 
Protein sequence
MKWGQRYVYL AVATLLFSPV FLRAQLTTDD HVLGFEFWPT KEIASQKDIV GSQVCASCHA 
DKANTQKITP MGETSVHAVD ASPLRDHPAL TFKGGAATYE IHTDGTHATF SATVNGQSKS
ADLLWAFGNG HLGQSYLFKK EDGYYYEARA SYFEVLKSLS WTPSRELTRP ESADEAMGRR
IPDTELKKCF GCHTTGSNVA GRLTETNVKS GVSCEACHGP GASHAAEAAV AMSAGTPDAA
RGGILNPGKL SPSDSVDFCG ACHISYWDLT LQRGRGIATL KAQPFRLEQS KCWQKGDARL
TCTACHDPHK PLVTEAKSYD HNCLQCHVLM GQKPTAEQIG KACPKSTSDC ATCHMQKIEL
PDFHHTFTDH RIRIAKAGEP FPD