Gene Acid345_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0158 
Symbol 
ID4070070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp166381 
End bp167322 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content54% 
IMG OID637982158 
ProductDSBA oxidoreductase 
Protein accessionYP_589237 
Protein GI94967189 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1651] Protein-disulfide isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00718984 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCTCTGT TCAAGAGTCT GGCAGTCGTA GTTCTGGCGG TCTGCGTGGG CTGCTCCGCC 
CAGTCATCGT CCAATACGCA ATCGCAGAAC AAGCTCTCGG ACGCCCAGCT GAACCGGCGA
ATCCAGGACC AAATTCGAGC TCATTTCAGC GTTCCTCCGC AGGTCAATAT AACGGTTGGT
GCGCCCAAGC CGAGCGACTT CAGCGGATAC GATGCGGTGA CCGTTGCGAT CTCGGACGGT
GATAAATCGA AGAACTACGA ATTCCTGCTT TCCAAGGACG GCAAGACCCT CGCCCGGCTT
TCGAAAATCG ATCTCAGCAA AGATCCCTAC GAAGAAAACA TGGCGAAGAT CGACATAGCG
AACCGGCCAG TGCGTGGCAA CAAAGACGCC AAGGTCACGA TCGTCAATTA CGACGATTTC
GAGTGCCCCT TCTGCGCCCG TATGCACAGC GAACTCGTCA ACGTCCTGAA GCAGTATGGC
GACAAGGTGC GGATCATCTA CAAGGACTAT CCCTTGACCG AAATCCACCC GTGGGCCGAT
CGAGCCGCCG TGGACTCAAA CTGCATTGCC AGCCAGAATA CGGACGCCTA CTGGGACTTT
GCCGACTACG TGCATAGCAA CCAGCCGGCA ATCACCGGTA AAAAAGAGGA ACATCGTTCC
GTTGCGGCAA TGCAGGAAGC CGTCGACAAG GTTACGCTGG ACATCGGACG TAAACACAGT
TTGAATGTGG ATCAGCTACA AGCGTGCATC AAGAATCAGT CCGAGAGCGC CGCGCTGAAG
AAGTCGGTTT CGGAAGCCAA TGGTTTAGAC GTGAGCGCAA CGCCCACGAT GTTCGTGAAC
GGCGAAAAGC TCGAAGGTGC AATTGAAGAA GACGCATTGA TTGATGTAAT CAAAAAGCAC
TTGCAAGAGC AGGGCAGCAC CGGCACGGGA GCGTCGAAAT AA
 
Protein sequence
MSLFKSLAVV VLAVCVGCSA QSSSNTQSQN KLSDAQLNRR IQDQIRAHFS VPPQVNITVG 
APKPSDFSGY DAVTVAISDG DKSKNYEFLL SKDGKTLARL SKIDLSKDPY EENMAKIDIA
NRPVRGNKDA KVTIVNYDDF ECPFCARMHS ELVNVLKQYG DKVRIIYKDY PLTEIHPWAD
RAAVDSNCIA SQNTDAYWDF ADYVHSNQPA ITGKKEEHRS VAAMQEAVDK VTLDIGRKHS
LNVDQLQACI KNQSESAALK KSVSEANGLD VSATPTMFVN GEKLEGAIEE DALIDVIKKH
LQEQGSTGTG ASK