Gene Acid345_3112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3112 
Symbol 
ID4070226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3697517 
End bp3699634 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content61% 
IMG OID637985131 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_592187 
Protein GI94970139 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCTG TATCGCGACG CGACTTTTTG ACGACGAGCG CTACGGCGGC GGCAGGGTTG 
GTGGTGGCGC TCCATCTGCC GTTGTCGAGT GAGGCAGAAA GTGCGGAGTT TGCGCCCAAC
GCTTACGTCC ATATTTCGCC TGAAGGCAAA GTGACGATCG TGGTGGCGCG CTCGGAGATG
GGGCAAGGGG TGAGGACTTC CCTGCCGATG ATTCTTGCGG AGGAACTCGA TTGCGACTTC
TCGCAGATTG CGATTGAGCA GGCAGGCGCG AGCACGTTGT TTGGCGACCA GACAACCGGG
GGAAGCGCAA GCGTGCGGAC CTGCTGGGAT CCAATGCGGA AGGCCGGGGC GCAGGTGCGC
GCAATGCTGG TGAGCGCGGC TTCGGCGCAT TGGAAAGTCG ATTCGTCAGG TTGCACGACG
GAGAATGGCT TCGTGATTCA TGCGGCTTCA AATCGAAAAG CGAGTTATGG ATCGCTGGTC
GGGGCGGCGG CGAAGTTGCC AGTTCCCGCA GAGCCGAAGC TGAAAGACGC GAAGGATTTC
AAACTGATCG GCAAGCCGAA GCAGCGTCTG GATACGAAGT CGAAGACGAA CGGGTCGGCG
ATCTTCGGAA TCGATTTCAA AGTGCCGGGA ATGAAATATG CGGTGCTGGT GCGGGCGCCG
AAGTTTGGCG CGACAGTAAA GAGCGTGGAT GACGCTCGCG CCAAAGGCGT GGCGGGCGTG
ACCCACGTGG AGAAGATTGG CGACTCGGCA GTCGCGGTAG TGGCGGATTC GGTGTGGGCG
GCGATGTCGG GGCGACGCGC ACTGAAGGTC ACGTGGAACA ACGGAGAGAA CGCCACGCTG
GATTCGGAAG CGGTATCGCA GTCATTGCGT GAGGCGGCGA AGAAAAAGGG CGTGGCGCTG
TTCAATGCCG GAGATGTGGC GAAGGGCGCG GGCAAGAAAG TTGAAGCGGA GTTCGAGACG
CCGTTTCTTG CGCATGCGCC TTTGGAGCCG GGAAACTGCA CCGCACAGTT CCGCGGCGAT
TCCTGCGAAT TGTGGGCGCC GACGCAGGTT CCGCAGGATG TGCGCGATTC CGTGGCAGCG
GCTCTGAATC TCAAGCCGGA GCAGGTGAAG GTCAACGTCA CGCTGATGGG CGGCGGCTTC
GGGCGCAGGC TTGAACACGA CTACGGCGTG GAAGCGGCGC TGGTATCGAA GGCCGTCAAT
CTGCCGGTGA AGGTGATCTG GACGCGCGAG GACGATATGA AGTACTCGAC GTATCGTCCG
GTGAGCCTGC ACCAGATCAG CGCGCACGTT GGCGCGGATG GGTATCCGAC GGAGCTCACG
CATCGGATCA TCTCGCCTTC GATTAGCCGG CAAAAGGGGA CGAAGCTGGA TGACGGGATC
GATCCCGATC TGAAGGATGA GGGCGCGTTT ATCTATCCGG TGGCGAATGT GCTGCTGGAA
TATGTGGACC TCGATACCGC GGTTCCGTTG GGCTGGATGC GCTCGGTGTA TGCGTCGCAA
GTGGCGTTTG CGAATGAGTG CTTCCTTGAT GAACTGGCGG AGGTAGCGGG AAAGGACCCG
CTGGAGTATC GGCTGCACTT GCTGCGCGAA GACAAAGAGA TCAAGTTCTG GGACACGACG
TGGAGCACGG CACGGATGCG CGGCGTCTTA AAGCTTGCGG CGGAAAAGGC CGGGTGGTCG
AAACCGGTGG CCAGCGGGCG TTTTCGTGGG ATCGCGGCGC ATGCTTGCTT TGGTAGCTAC
GTGGCGGAGG TCGTGGAGAT CTCGCGCAAC GAGGACCAGC CGAAGATCGA ACGCGTTGTT
GTCGCGGCGG ATTGCGGGAC GGTGGTGAAT CCGAACATCC TGGAGCAGCA GTTGCACAGC
GCGGTGGTGT TTGGGCTGAC GCAGACGCTC TATGGGAAGA TCACGGTGCA GGGGGGCGCG
ATTGCGCAGG CGAATTTTGG AGACTATCAG TTATTGCGGA ATGCGGACAT GCCGGTGATT
GAGACGCATT TTGTGGAGAG CAGCGAGGCG CCGTCGGGGA TTGGGGAGCC TCCGGTGCCA
CCGATGGCGC CGGCGTTGTG CGGGGCGATT TATGCGGCGA CGAAGAAGAG AGTGAGAGCG
TTGCCGATTT TGACGTAG
 
Protein sequence
MSPVSRRDFL TTSATAAAGL VVALHLPLSS EAESAEFAPN AYVHISPEGK VTIVVARSEM 
GQGVRTSLPM ILAEELDCDF SQIAIEQAGA STLFGDQTTG GSASVRTCWD PMRKAGAQVR
AMLVSAASAH WKVDSSGCTT ENGFVIHAAS NRKASYGSLV GAAAKLPVPA EPKLKDAKDF
KLIGKPKQRL DTKSKTNGSA IFGIDFKVPG MKYAVLVRAP KFGATVKSVD DARAKGVAGV
THVEKIGDSA VAVVADSVWA AMSGRRALKV TWNNGENATL DSEAVSQSLR EAAKKKGVAL
FNAGDVAKGA GKKVEAEFET PFLAHAPLEP GNCTAQFRGD SCELWAPTQV PQDVRDSVAA
ALNLKPEQVK VNVTLMGGGF GRRLEHDYGV EAALVSKAVN LPVKVIWTRE DDMKYSTYRP
VSLHQISAHV GADGYPTELT HRIISPSISR QKGTKLDDGI DPDLKDEGAF IYPVANVLLE
YVDLDTAVPL GWMRSVYASQ VAFANECFLD ELAEVAGKDP LEYRLHLLRE DKEIKFWDTT
WSTARMRGVL KLAAEKAGWS KPVASGRFRG IAAHACFGSY VAEVVEISRN EDQPKIERVV
VAADCGTVVN PNILEQQLHS AVVFGLTQTL YGKITVQGGA IAQANFGDYQ LLRNADMPVI
ETHFVESSEA PSGIGEPPVP PMAPALCGAI YAATKKRVRA LPILT