Gene Acid345_3674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3674 
Symbol 
ID4072277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4344327 
End bp4345739 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content60% 
IMG OID637985697 
Producthypothetical protein 
Protein accessionYP_592749 
Protein GI94970701 
COG category[S] Function unknown 
COG ID[COG3538] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0875618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTA CCCGTCGTGA CGTCGCCAAA CTCGGCGCAG CCGCACTCAT CAACGCCTGC 
GCCGAGCGAT CCTCCATCGC ACAGCAAGCT CCTTCATTCG ACCCTGCAAA AGGCCGCCCC
GCACCGAACC AGCGCAAGTT CCAAAGTGCC GCCGTCGAAG ACGTGATCGC CAAGACGAAG
GCGAAACTCG GCGACACGCG CCTTGCTCGG CTCTTCGAAA ATTGCTTCCC CAACACGCTC
GACACCACCG TTCGCACCGG CTCCCTCTAT GGCAAGCCCG ACACCTTCGT CATCACCGGC
GACATCCCCG CCATGTGGCT GCGCGACTCC AGTGCTCAGA TCTTTCCATA CCTGCCGCTC
GCCAAGCGCG ATCCCGATCT CCAGCGCCTG CTCGCCGGCG TAATCCACCG CCAGGTGCGC
TGCATCAACA TCGATCCCTA TGCGAATGCC TTCAACTTCG ATCGCGAAGG CAGCGAGTTC
GATCACGATC TGACGCAAAT GCGTCCTGAA CTCCACGAAC GCAAGTGGGA GATTGATTCG
TTGTGCTATC CCGTTCGCCT CGCATACCAC TACTGGAAGA CCACCGCCGA TAGCAGCGTC
TTCGATGAGC CTTGGCACGA AGCCGCTCGC AAAATCGCGA AGACGTTCCG CGAGCAGCAG
CGCAAGACCA GCCTTGGCCC CTACTCATTC CGCCGCGAAA CCACCACGCC CAACGACACG
CTCAACCTCG ACGGCTACGG CAACCCCGTG CGCCCGGTTG GCCTGATTGC CAGCGGCTTC
CGCCCCTCCG ACGACTCCTG CATCTTCCCG TTTCTCATCC CGTCGAACCT CTTCGCGGTC
CTCTCGCTTA CGCAGTTGAG CGAAATTCTC ACCCAGATCT ACAAAGATGA ACCGCTCTCC
CGCGATTGCA ACGCTCTCGC GCAGGAGGTT CGCGAAGCGA TCAACAAATA CGGCAAGACG
CAGCATCTGA AGTACGGCGA GATCTACGCC TACGAAGTGG ATGGCTACGG CGGCCAACTT
CTGATGGACG ATGCCAACGT CCCCAGCCTG CTCTCGCTGC CGTATCTCGG CATCTGCGAA
CCGTCCGATG CCATCTACCA GAACACCCGC CGCTTCGTTC TCAGCGAAGA CAATCCATAC
TTTTTCAAAG GCAAAGCCGC CGAAGGGATC GGCGGCCCGC ACGTCGGCCT GAATATGATC
TGGCCGCTCT CGCTGATCAT TCGCGGCCTT ACCAGCACCG ATCATGTCGA AATCGACAAT
TGTCAGAAGA CTCTCGTCGC CACCGACGCC GGCACCGGCT TCATGCACGA ATCGTTCGAC
AAGGACGATC CGTCAAAGTT CACTCGCGCA TGGTTCGCCT GGGCCAATAC GCTCTTTGGG
GAGTTCGTCC TCAAGACGCT GGCGCTGAAA TAA
 
Protein sequence
MNITRRDVAK LGAAALINAC AERSSIAQQA PSFDPAKGRP APNQRKFQSA AVEDVIAKTK 
AKLGDTRLAR LFENCFPNTL DTTVRTGSLY GKPDTFVITG DIPAMWLRDS SAQIFPYLPL
AKRDPDLQRL LAGVIHRQVR CINIDPYANA FNFDREGSEF DHDLTQMRPE LHERKWEIDS
LCYPVRLAYH YWKTTADSSV FDEPWHEAAR KIAKTFREQQ RKTSLGPYSF RRETTTPNDT
LNLDGYGNPV RPVGLIASGF RPSDDSCIFP FLIPSNLFAV LSLTQLSEIL TQIYKDEPLS
RDCNALAQEV REAINKYGKT QHLKYGEIYA YEVDGYGGQL LMDDANVPSL LSLPYLGICE
PSDAIYQNTR RFVLSEDNPY FFKGKAAEGI GGPHVGLNMI WPLSLIIRGL TSTDHVEIDN
CQKTLVATDA GTGFMHESFD KDDPSKFTRA WFAWANTLFG EFVLKTLALK