Gene Acid345_0463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0463 
Symbol 
ID4069458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp561575 
End bp562573 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content59% 
IMG OID637982467 
Productelectron transfer flavoprotein, alpha subunit 
Protein accessionYP_589542 
Protein GI94967494 
COG category[C] Energy production and conversion 
COG ID[COG2025] Electron transfer flavoprotein, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCTA TATTTTTGAT TGCACTGCAG CGCAATGGTG CATTAGACGA TACCGTCGCG 
GAATTGATTG CGACTGCCAA AACCTTGGAT CCGACGGTGC AAGCGACAGC GATTGCCTGT
GGATACGGCC CGGAACTCGA TGCGGTCTGC GCAGCCCTTA CCAAGAGCGT GCCGACCACA
TGGAAGATCT CGCACGAACA ATTCGCGACC TTCAATGCAG AAGCCATCCG CGAAGGACTG
GTTCGGATTC TGCCGAAGGG CGCAATCGTT CTCGTCGCGC ACGATCACTT CGGTATGGAC
TTGGCGCCCG GTCTCTCCGT GAAGCTTGGG GCAGCGTACG TGCCGGATGT CCTCGTGGTC
GAAGCAGGAC AACGACTGGT CCGTCAGGAG TTCGCGGGTC AGTTCAACGC ACGGATGGAA
TGTGATTTCT CTGGAGGAGC GGTGATCACG GTTCGCCCTG GTGTGTTCAA GCCCGGTTCC
CAGGTTGAGG TAAACGGCGT AGTGGTTGAC AAATCGAGTG AAGTAAGCGC GATCTCGGTC
CGTCGCCGCT ACGTCACAAC GATCCCCGCT CAGGTCGGAG ACGTGGATAT CACCAAGCAG
ACCGTGTTGG TATCGGTGGG CCGCGGCATC CAGGAAGCTG AGAATATCGA GCTCGCCCAA
GAGTTGGCTG ACGCCCTTGG CGGCGCGGTG AGTTGTTCGC GACCAGTGGT CGACGCCAAG
TGGCTAGACA AGTCACGCCA AGTCGGATCG TCCGGCGCGA CCGTGAGCCC GAAGGTTTAC
CTGGCTTGCG GAATCAGCGG ATCGTTCCAG CATATGGCCG GCATCAAAGG CTCGCCCTTC
CTGGTCGCGA TCAATAAGAA CCCGGCAGCT CCGATCTTCC AATTCGCTGA CGTAGGAATT
GTGGACGACC TGCTCGAATT CCTGCCCGCG CTGACCGAAC GCGTTCGCGA ACTGACGGAA
CGGCAAGGCA CAAAGCAGAC GGCCTCTGCC GCGCACTAA
 
Protein sequence
MNPIFLIALQ RNGALDDTVA ELIATAKTLD PTVQATAIAC GYGPELDAVC AALTKSVPTT 
WKISHEQFAT FNAEAIREGL VRILPKGAIV LVAHDHFGMD LAPGLSVKLG AAYVPDVLVV
EAGQRLVRQE FAGQFNARME CDFSGGAVIT VRPGVFKPGS QVEVNGVVVD KSSEVSAISV
RRRYVTTIPA QVGDVDITKQ TVLVSVGRGI QEAENIELAQ ELADALGGAV SCSRPVVDAK
WLDKSRQVGS SGATVSPKVY LACGISGSFQ HMAGIKGSPF LVAINKNPAA PIFQFADVGI
VDDLLEFLPA LTERVRELTE RQGTKQTASA AH