Gene Rxyl_2944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2944 
Symbol 
ID4115939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2952446 
End bp2953756 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content74% 
IMG OID638037714 
Productmonooxygenase, FAD-binding protein 
Protein accessionYP_645666 
Protein GI108805729 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGCGG AAGACACCAC AGGAGGCGGG AGGATCATGA GCCGGGAGGC CTACGACGTG 
GCGGTGGTGG GGGCGAGCAT CGCCGGCTGC ACCGCGGCGA CGCTCTTCGC CCGGGAGGGG
GCGAGGGTGG CCCTCATAGA GCGCCACGCG GACCCCAACG CCTACAAGGC GCTGTGCACC
CACTACATCC AGCCCTGCGC CGTGCCCACC ATCGAGCGGC TCGGCCTGGC GCCCCTCATC
GAGGAGGCCG GGGGGATCCG CAACGGGCTC GTGATGTACA CGCGGTGGGG TTGGATCGGC
GGCGACGCCG AAGACCTGCC CTACGGCTAC AACATCCGGC GCCAGACCCT CGACCCCATG
CTCCGCGGCC TCGCGGCGAG CACCCCCGGG GTGGAGTTCA TGCCGGGACG GAGCGCGCGC
GACCTCATCC GGGAGGACGG GCGCGTCGCG GGCGTCGTGG TGCGCGACCG GGCGGGGGAG
AGCCATGAGA TACCGGCGCG GCTGGTCGTC GCCGCCGACG GGCGCAGCTC CCGGATCGCG
AAGATCGCGG GCGGCCCGGT CGAGGTCGCG CCGAACAACC GCTTCGCCTA CTTCGCCCAC
TACCGCGACC TCACCCTGCC CTCCGGCTCC AGGTCGCAGA TGTGGATGCT CGAGCCGGAC
GTGGCCTACA CCTTCCCCAA CGACGGCGGG GTGACGCTCG CGGCCTGCAT GCCGGCCAAG
GAGAGGCTGC CCGAGTTCAA GGAAGACCTC GAGGGGGCCT TCGTCCGCTT CTTCGAGGGG
CTGCCGCTCG GCCCCCGCCT CTCCGAGGCA CAGCGGGTCT CCAGGATCAT GGGCGTCGTC
GAGCAGGCCA ACGTCTCGCG GCCCGCGGCG CGTCCGGGCC TCGCCTTCGT GGGGGACGCG
GCCCTGTGCT CCGACCCGCT GTGGGGCGTG GGCTGCGGGT GGGCCTTCCA GTCGGCGGAG
TGGCTGGTGG ACGAGACCGC AGAGGCGCTG CTCGCCGGCG GCGACCTCGA CCGGGCGCTC
GAGCGCTACC GCAGGAGGCA CCGGAGGGAG CTCTCCGGGC ACCACCGGCT CATCTGCGAC
TTCTCGCGCG TGCGCCCGTA CAACCCCGTG GAGCGCCTGA TGTTCTCCGC GGCGGCCAGG
GACAATCGCT CGGCCCGGCA CTTCCACGCC TTCGGCGCCA GGATCATCGG GGTGCGCGAG
TTCCTCTCGC CCCGGGCCGT CGGGCGCGCG CTGTGGGTGA ACGCCCGGCA CGCCGCCCGG
GGCCGCGCCA AGCCCGGCCC GGGCCCCGCG GTCGCGAGGG CGGGGAGGTA G
 
Protein sequence
MSAEDTTGGG RIMSREAYDV AVVGASIAGC TAATLFAREG ARVALIERHA DPNAYKALCT 
HYIQPCAVPT IERLGLAPLI EEAGGIRNGL VMYTRWGWIG GDAEDLPYGY NIRRQTLDPM
LRGLAASTPG VEFMPGRSAR DLIREDGRVA GVVVRDRAGE SHEIPARLVV AADGRSSRIA
KIAGGPVEVA PNNRFAYFAH YRDLTLPSGS RSQMWMLEPD VAYTFPNDGG VTLAACMPAK
ERLPEFKEDL EGAFVRFFEG LPLGPRLSEA QRVSRIMGVV EQANVSRPAA RPGLAFVGDA
ALCSDPLWGV GCGWAFQSAE WLVDETAEAL LAGGDLDRAL ERYRRRHRRE LSGHHRLICD
FSRVRPYNPV ERLMFSAAAR DNRSARHFHA FGARIIGVRE FLSPRAVGRA LWVNARHAAR
GRAKPGPGPA VARAGR