Gene Rcas_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2203 
Symbol 
ID5539684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2844642 
End bp2845769 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content61% 
IMG OID640894336 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001432304 
Protein GI156742175 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.345355 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.953856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACC ACTCCAATCC TCCCGGTCTG CGCTTCGGTA TCATTGGCAG CGCTGCTGGA 
ATCGCCGAAA GCCATCTGAA AGCGCTTACC GAACTTCCAG GGGCGACGAT TGTTGGCATG
GCAGACATCG CCATCGAACG CGGCGAGGCG CGCGCCAGAG CAGTCGGTTG CCCATTCTTC
GCCGATCACC GCGCAATGCT CGACGCTGTG CGCCCTGATG TCGCCGTTAT CTGTGCGCCA
CACCCACTGC ATGCCGCGTT GGCGATTGAT TGCCTGGACG CCGGCGCCCA CGTGCTGGTC
GAAAAACCGC TGGCGGTCAG CGTCTCCGAA GCGGATGCAA TGATCGCCGC CGCAGACCGC
GCCGGACGCT TGCTGGCAGT CTGTTTCCAG CAACGGTTTC GCCCGGTCAT CGAACATGCC
CGCACCCTGA TCGAATCCGG CGCAATTGGC GATATTGTGC GCGTACTCTG CGTCGAACCC
TGGTTTCGCA CCCAGTTTTA CTACGACTCG GCAGCCTGGC GCGGCACATG GCGCGGGGAA
GGCGGCGGGG TCTTGATGAA TCAGGGACCT CACCCGCTCG ATCTGCTCTG CCACCTGACC
GGTTCTCCGG CAAAGGTCTG GGGATGGGTG CGCACGATGG GGCACACGAT CGAGTGCGAG
GATGTTGCGC AAGCATTGCT GGAATATCCC AACGGCGCGC CCGGCTATAT CTATTTCAGC
ACGGTCGAAG CAGGTTCCGA ACGTCGCATG GAGATCGTCG GCGACTGTGG CGCGCTCGTG
ATTGTCTTTG ATAACCTGAC GATCCATCGC TTCGCCGTAC CGCTGAGTGA GTATCGCACA
ACGGTGCGCG AGATGTGGAG TCAACCACAG GTTCAAACCG AGACGCTCCG ACTGCCCAGT
GATATTGGCG AACATGGCGG ACACCTTGGG GTCTATCTTG ATCTGGTGCG GGCGATTGCT
GAAGGGCGTC GTCCGCGTTG CGACGCGCGC GAGGCGCGCA TATCACTTGA ACTGTCGAAC
GCGATCATCT ACTCCGGTAT GACCGGTCAA CCGGTGACGC TTCCGCTTGA CCGTCAGGCG
TATGATGCGT TGCTCGACGA TCTGAGAGCG GGAAGGAGAA AGTTGTGA
 
Protein sequence
MTDHSNPPGL RFGIIGSAAG IAESHLKALT ELPGATIVGM ADIAIERGEA RARAVGCPFF 
ADHRAMLDAV RPDVAVICAP HPLHAALAID CLDAGAHVLV EKPLAVSVSE ADAMIAAADR
AGRLLAVCFQ QRFRPVIEHA RTLIESGAIG DIVRVLCVEP WFRTQFYYDS AAWRGTWRGE
GGGVLMNQGP HPLDLLCHLT GSPAKVWGWV RTMGHTIECE DVAQALLEYP NGAPGYIYFS
TVEAGSERRM EIVGDCGALV IVFDNLTIHR FAVPLSEYRT TVREMWSQPQ VQTETLRLPS
DIGEHGGHLG VYLDLVRAIA EGRRPRCDAR EARISLELSN AIIYSGMTGQ PVTLPLDRQA
YDALLDDLRA GRRKL