Gene Rcas_3572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3572 
Symbol 
ID5541073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4665152 
End bp4666231 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content62% 
IMG OID640895691 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001433639 
Protein GI156743510 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.573102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTT CAACCATCAT CAATGTCGGT ATCGTCGGCG CCGGCAACAT TGCCCGCGGG 
CGCCATCTTC CCTGTTTCAA GAAGCATCCG AATGTCCGCC TCGCCGCCAT TTCCGACGTC
GTCGTCGATC TGGCTCAATC CGCCGCGACG GAGTACGAGA TTCCCGGCGT GTACAGCGAT
TACCGCGAGA TGTTCGAGAA AGAGAACCTC GACGCGGTGG TGATCTGTAC GCCGAACAAG
TTCCATGCCC CGGCGTCCAT CGCTGCGCTC GACGCCGGAC TGCACGTGCT CTGCGAGAAG
CCAATGGCGC TCGACCCGGT CGAAGCGCGC GCCATGGTCG CGGCAGCCGA GCGAAACAAA
AAGATCCTCA GCATCGCATT TCACTACCGC CACATGGCGC CGGTGCGCGC CGCCCGCCGC
GTCGTCGACT CAGGCGAACT GGGTCTTGTG TATATGGCGC GGGTGTACGC CCTTCGTCGT
CGCGGTGTTC CCTCATGGGG CACATTCGTC CAGAAGCACA TCCAGGGTGG TGGCGCCATG
ATCGATTTCG GCGTCCATCT GCTCGACACG GCGCTCTGGC TGATGGGCAA TCCACAGCCG
GTCGAGGTCT GCGCCAGCAT CTCGCAACAT CTCGGCAAAG CGCCGAACGT CAACCCCTGG
GGGCAGTGGA ACTACCGCGA GTTCACTGTC GAGGATCAGG CAGCGGCGTT CATCCGTTTT
GCCAATGGCG CCAGCATGCT CCTCGAGTGC TCATGGGCGC TGAATATCCC CGAAAACTAC
GAGAATGTCT CACTTTCGGG AACGACCGCC GGGCTGGAAG TCTTCCCGCT GAAGGTCAAC
AAGGCGCACC TCGATATGCT CGTCAGTTGG AAGCCCGACT GGATGCCGGG CGAACGCGAC
AATCCGGGCG ATGTGCAGAC CGCCGATTTC GTCGGTGCAA TCCTTGAAGG ACGGCAACCG
GTGTCGCAGG CACACCAGGC ATTGCAGGTC ACGGAAATTG TGGATGCAAT CTACCGCAGC
GCCGAAGCAG GCGCCGCTGT GCGCCTCGAC GGTGCAACGT CACATCCATC CCTTCACTGA
 
Protein sequence
MTASTIINVG IVGAGNIARG RHLPCFKKHP NVRLAAISDV VVDLAQSAAT EYEIPGVYSD 
YREMFEKENL DAVVICTPNK FHAPASIAAL DAGLHVLCEK PMALDPVEAR AMVAAAERNK
KILSIAFHYR HMAPVRAARR VVDSGELGLV YMARVYALRR RGVPSWGTFV QKHIQGGGAM
IDFGVHLLDT ALWLMGNPQP VEVCASISQH LGKAPNVNPW GQWNYREFTV EDQAAAFIRF
ANGASMLLEC SWALNIPENY ENVSLSGTTA GLEVFPLKVN KAHLDMLVSW KPDWMPGERD
NPGDVQTADF VGAILEGRQP VSQAHQALQV TEIVDAIYRS AEAGAAVRLD GATSHPSLH