Gene Acid345_0291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0291 
Symbol 
ID4070512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp306628 
End bp308268 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content60% 
IMG OID637982292 
Productcytochrome c, class I 
Protein accessionYP_589370 
Protein GI94967322 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.85938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGTTT CGCGAACCCG CTCTATTTCT GTTATCGGAA TTTCATTGTT GCTCGCTGTA 
TTCGCCATCG CCCAGAACAA AGCATTTCAT AACGCCCCTG CCTCCGCGGC GGCAACGAAG
AACCCCGTCG CTGGAGACGC TGCGGCGATC AAAGCCGGTA AGAACATCTA TTCGCAGAAC
TGCGCTGCCT GCCACGGGCC CGATGGCGCG GGCACCGGAA ATGTTCCGTC GCTGAAGACT
GGCAAGGCGC AGGAAGCGAA GGACGGAGAA CTGTTCTGGT TCATCACCAA CGGCGATGAG
AACAACGGCA TGCCGTCGTG GAAAGGCCTG CCGCAAAGAC AGCGCTGGCA GGTGGTGAGG
TACATCCGGG CGATGAAGAC TGCCGGGGCT GCGGCGCCAG CAAGTGCGGC GGCTTCCACT
ACGACGGCGA GTCTGCCGAA GGCTTCCGGC AACGGGCCCT TTATTGACTA CCGCGATGAG
AAGCCGGGGA CGGTCCGCAA GATTACAGCG AAAGATTTGC CGCCGCCCTA TGCGACGAAG
TCGGCGGGCA ATGGTCCGCA CGTAGTGCCG CGTCCGCAGA ATGCGTGGCC GCAGGTACTG
CCGGATTTCA AGATCGATGT GTTCGCAAGC AACCTGAACA ATCCGCGCGA GATTGTTACC
GCGCCGAATG GCGATATCTT CGTCGCCGAG ACCGAACCCG GGAACATCAA GATCTTCCGC
GGCATGACTG CCGACGGAAA GCCGCAGCAG ACTTCAGTCT TCTTAAGCGG ATTGAAGGAG
CCGTTCGGAA TCGCGTTCTA TCCGCCGGGG CCGAATCCAG AGTGGATCTA TATCGGCAAC
ACCAACGCCG TGGTGCGCTA TCACTACACG AACGGCGACC TGAAGGCGCG GGGCGAAGCA
CAGAAGCTGG TGGACCTGCC GACCGGAGGT CACTCGACGC GCAACGTTCG CTTCAGTAAC
GACGGCAAGA CGATGTTCAT CGCCGTGGGC TCGGAATCGA ATGTGGACGA TCCCGAAGAG
AACACCGGCG AGAAGAACCG CGCGAACATC CTCGCGGCGA ATCCCGATGG CAGCAACGTG
CACGTGTTCG CGGCAGGCAT TCGCAATCCT GTCGGACTTG CGGTGAATGC CCAGACCGGC
GAGCTGTGGA CTTCGATCAA CGAGCGCGAT GCTCTCGGCG ACAACCTCGT GCCCGACTAC
ATCACGCACG TGCAGGAAGG CGGTTTCTAC GGCTGGCCGT ACTACTACAT CGGCGGAAAC
CAGGACCCGC GGCACAAGGG CAAGCATCCG GAGCTGAAGA ATAAGGTCAT TGTGCCGGAT
GTGCTGATCC AGCCGCACAG CGCGTCGCTG GGAATGACGT TTTACAACGG CAAACAATTT
CCGGCGGAGT ACCAGGGCGA CATCTTCGCC TGCGAGCACG GCTCGTGGAA CAAGGCGGTG
CGCGTGGGCT ACGAAGTAGT TCGCGTGCCG CTACACCAGA CGAATCATGC GACCGGCGAG
TATGAGGATT TTGTGACCGG ATTCGTAACA CCGGATGGAA ACGTATGGGG GCGTCCGGTG
GGCGTGACCG TCGCGCCGGA TGGATCGTTG TTGATCACGG ACGACGGATC GAACGCAATC
TGGCGCGTGA GCCACAAATG A
 
Protein sequence
MQVSRTRSIS VIGISLLLAV FAIAQNKAFH NAPASAAATK NPVAGDAAAI KAGKNIYSQN 
CAACHGPDGA GTGNVPSLKT GKAQEAKDGE LFWFITNGDE NNGMPSWKGL PQRQRWQVVR
YIRAMKTAGA AAPASAAAST TTASLPKASG NGPFIDYRDE KPGTVRKITA KDLPPPYATK
SAGNGPHVVP RPQNAWPQVL PDFKIDVFAS NLNNPREIVT APNGDIFVAE TEPGNIKIFR
GMTADGKPQQ TSVFLSGLKE PFGIAFYPPG PNPEWIYIGN TNAVVRYHYT NGDLKARGEA
QKLVDLPTGG HSTRNVRFSN DGKTMFIAVG SESNVDDPEE NTGEKNRANI LAANPDGSNV
HVFAAGIRNP VGLAVNAQTG ELWTSINERD ALGDNLVPDY ITHVQEGGFY GWPYYYIGGN
QDPRHKGKHP ELKNKVIVPD VLIQPHSASL GMTFYNGKQF PAEYQGDIFA CEHGSWNKAV
RVGYEVVRVP LHQTNHATGE YEDFVTGFVT PDGNVWGRPV GVTVAPDGSL LITDDGSNAI
WRVSHK