Gene Rcas_3633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3633 
Symbol 
ID5541135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4748791 
End bp4749981 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content58% 
IMG OID640895753 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_001433700 
Protein GI156743571 
COG category[R] General function prediction only 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.380399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.315982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGCC ATTATCCTGT TGTGATCGCT GGCGGCGGCA CTGCCGGACT GACCGTCGCT 
GCTCAATTGG TCAACGTTCT CGATCCCTCC CGCATTGCTC TTATCGAACC AAACGATCGT
CACTTCTACC AACCGCTCTG GACGCTGGTC GGCGGCGGCG TCTTTCCCAA GGAAGAGTCG
ATGCGTGATG AGGCGGAGTT CATTCCGGCT GGCGTAACCT GGATCCAGGA GTATGTCAGC
GCCTTCGATC CGGATCAGAA TCGGGTGGCG CTCAAGAATG GCGATGCGCT GACCTACGAC
TATCTGGTTG TGGCACTTGG CATTCAGATC GATTGGGATA AAATCAAGGG ATTGCGAGAG
GCGCTGGCGG AGCGCAACGG CGTGTGCAGC AACTACTCCT ACGAAACGGT TGATCGCACA
TGGGAGAATA TCCGTTCGTT CCGGGGCGGC GTTGCAGTGT TCACGCAACC GTCAACGCCG
ATCAAGTGTG GCGGTGCGCC GCAGAAGATC GCCTACCTTG CCGATGATCA CTTCCGCCGC
GCCGGTGTGC GCTCCAGGAG CGCGATCAAG TTCTTCTCCG GGACCGGCGG CATCTTTTCG
GTCAAAAAAT ATGCCGACGC CCTCACCGCC GTATGCCGAC GGAAAGGGAT CGAGACGCAC
TTCCAGCACG AACTGGTCGA AATCAACCAT CGCCAGAAAG AGGCGGTCTT CCAGAGCCTC
GCAAGTGGCG AGCGTGTTTC GGTTCGGTAT GATATGATCC ATGTGACGCC GCCGATGAGT
AGCCCTGATG TGATTAAGCA GAGCAAACTG GCGGCTTCGA CCGGTTGGGT GGAGGTTGAC
CAGTTCTCGT TGCAGCACGT CCGCTACTCG AATGTGTTCG CCCTCGGCGA TTGCAGCAAC
CTGCCGACGT CGAAAACCGG CGCGGCTATC CGCAAGCAGG CGCCGGTTGT GGTCGAGAAT
CTCGTCGCTG CGATGGAAGG ACGACCGCTC GAATCGGTCT ACGATGGCTA CACCTCCTGC
CCGCTGGTCA CCGGCTACGG CAGCCTGATC CTGGCTGAGT TCGACTATCA ATTGATGCCA
AAGGAGTCGT TTCCGTTCGA TCAATCGCGC GAGCGGTATA GCATGTATGC GCTGAAAGCC
TACGGCTTGC CCGCCATGTA CTGGAATGGC ATGTTGCGCG GACGCATGTA A
 
Protein sequence
MQRHYPVVIA GGGTAGLTVA AQLVNVLDPS RIALIEPNDR HFYQPLWTLV GGGVFPKEES 
MRDEAEFIPA GVTWIQEYVS AFDPDQNRVA LKNGDALTYD YLVVALGIQI DWDKIKGLRE
ALAERNGVCS NYSYETVDRT WENIRSFRGG VAVFTQPSTP IKCGGAPQKI AYLADDHFRR
AGVRSRSAIK FFSGTGGIFS VKKYADALTA VCRRKGIETH FQHELVEINH RQKEAVFQSL
ASGERVSVRY DMIHVTPPMS SPDVIKQSKL AASTGWVEVD QFSLQHVRYS NVFALGDCSN
LPTSKTGAAI RKQAPVVVEN LVAAMEGRPL ESVYDGYTSC PLVTGYGSLI LAEFDYQLMP
KESFPFDQSR ERYSMYALKA YGLPAMYWNG MLRGRM