Gene Rcas_3597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3597 
Symbol 
ID5541098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4695542 
End bp4696570 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content62% 
IMG OID640895716 
Productelectron transfer flavoprotein alpha subunit 
Protein accessionYP_001433664 
Protein GI156743535 
COG category[C] Energy production and conversion 
COG ID[COG2025] Electron transfer flavoprotein, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTG AACAGCCAAC CATTCCGCGC ATCTGGGTGT TCGTCGAGCA ACAGGAACAT 
CAGGTGCATC CGGTATCGTG GGAATTGCTC GGCGCGGCGA AGCGGCTGTC GGCGGATCTG
CCGGGCAGCG TGGTCGAAGC GGTGCTGCCA GGACATCAGG TTGCCGATCT GGCGCCACAG
GCGTTTCAGT ATGGCGCAGC ACGCATCTAT CTGATCGATA ATCCAGTGCT GGAGATCTAC
CGCAACCTGC CGTATGCCGT TGCTGTCAGC CAGCTGGTGA AGGAGCATCG TCCGGAGATT
TTCCTGATCG GCGCAACGAC GCTTGGGCGC GACCTGGCAG GCTCAATTGC CACGCGCGTC
GGCACCGGTC TGACTGCCGA CTGCACCGAA CTCTCAATCG ATCCCGCCAA CCATATTCTG
GCAGCCACCC GTCCGACATT CGGCGGCAAC CTGATGGCGA CAATCCTCTG CCGCCGCCAC
CGACCGCAGA TGGCGACTGT TCGCCCACGG GTGCTGCCAA TGCCGGACCC CGCACCGGAC
GCGACCGGCG AAGTAGTGAC CGTTCCGTTC GATATGCGCG AGGAGGACGT TCCGGTCAAA
CGATTGCGGT TGATCCGCGC CGAAGAGCAA CCCAACATTG AGTATGCCGA AGTGATCGTC
GCCGGTGGAC GTGGTATGGG GGGACCGGAG GGATTTGCGC TCCTCCAGGA ATTGGCAGAC
GCACTCGGCG GCATGGTTGC AGCCAGCCGT CCGGTGGTGG ACGCCGGATG GATGGACGCC
AGCCGGCAGG TGGGACAAAC GGGCAAAACG GTGCGTCCCA AGTTGTACAT TGCGGCGGGA
ATCTCCGGCG CAGTGCAGCA TCGGGTCGGC ATGAGCGGCG CTGATGTCAT TCTGGCAATC
AACACCGATC CAAACGCGCC GATCTTCCAG ATCGCTACGA TGGGGATCGT CGGCGATCTG
TACGAAGTGA TCCCCGCGTT GATACGTCAG GTGAAAGGAC AGTCGTATGA CGGGCAGGCT
CACCTTTGA
 
Protein sequence
MNAEQPTIPR IWVFVEQQEH QVHPVSWELL GAAKRLSADL PGSVVEAVLP GHQVADLAPQ 
AFQYGAARIY LIDNPVLEIY RNLPYAVAVS QLVKEHRPEI FLIGATTLGR DLAGSIATRV
GTGLTADCTE LSIDPANHIL AATRPTFGGN LMATILCRRH RPQMATVRPR VLPMPDPAPD
ATGEVVTVPF DMREEDVPVK RLRLIRAEEQ PNIEYAEVIV AGGRGMGGPE GFALLQELAD
ALGGMVAASR PVVDAGWMDA SRQVGQTGKT VRPKLYIAAG ISGAVQHRVG MSGADVILAI
NTDPNAPIFQ IATMGIVGDL YEVIPALIRQ VKGQSYDGQA HL