Gene RoseRS_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3858 
Symbol 
ID5210840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4821459 
End bp4824395 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content63% 
IMG OID640597453 
ProductFAD linked oxidase domain-containing protein 
Protein accessionYP_001278161 
Protein GI148657956 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase
[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.738511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000383652 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATTACG CCGATCTGGC TGCTGAATTG CGTGAATGCG TCAAGGGTGA TGTGCACACC 
GATGCGATCT CGCGCGCTAT CTATGCGACC GATGCCAGCA TCTATCAGAT CGAGCCGCTG
GGGGTGGTGC TACCCAGGGA CGAAGAGGAC GTTGCAGCGG TCGTGCGCCT GGCGCGCCTC
CGTCGCCTCC CGATCCTGCC GCGCGGCGGC GGTACGAGCC TGGCGGGGCA GGCGGTGGGG
CGCGCCATCC ACCTCGATTT TACTCGCTAT ATGAACCGCC TGCTGGAGGT CAACGTCGCC
GAACAGTGGG CTTGGGTCGA GCCGGGGATC ATCCTCGATC AGTTGAATGC AGAAGTGGCG
CCGCACGGTT TGATGTTTGC ACCCGATGTT TCGCCGTCGA ACCGCGCAAC CATCGGCGGG
ATGATCGCCA ACAACTCGTC CGGTATGTAT TCACTCGTCT ACGGCAAAAC CATCGACCAC
GTGCTGGAAC TGAAGGTGAT GCTGTCCGAT GGCAGTATCA CCACCTTCCG ACCGCTCGAT
GAAACGGAAT TGCGTGCCAG GCTGAGCAAC CCCGAACTCG AAGGACGGAT CTACCGCGCA
GTTGCGCGTC TGGCGCACGA CCATGCCGAC GAGATTGCGC GTCGCTATCC CAAAGTGCTG
CGGCGCGTCG GCGGCTACAA TCTCGATGCG TTCGTGCCAG TCGTCGAAGA GGGACAGACC
CGCTACGGTA TTATGTTCGG TTCGCGTTCC CCCGACCGAC GGTTCAACAT GGCGAACATG
ATCGTCGGCT CGGAGGGCAC GCTGGCAATC GTGCTGGCGG CGCGGCTGCG CCTCGTTCCC
CGTCCCAAAC ATACCGCGAT TGCGATTCTG GAGTTTGCCA CGCTCGACGC TGCGCTCGAT
GCAGTGGTTC CCTGCCTGGA GTGCGAACCT GCCGCCGTCG AATTGATGGA CGATATTCTG
CTCGACCTGA CGCGCAAGTC GCACGAATAC GCGCAGTATC TGGCGATGTT CGTGCAGGGA
ACGCCGGGTG CGCTGTTGCA GGTGGAGTTC TTCGGCGAGA CCGGCGATGA CGTGCGCGCG
CATCTGGATC GCCTGGAACA GCGTCTGCGG GGGCATTACC AGGCAATGAC CCCTGTGCTC
ACTGCTGAGC GCAAACGCGC AGTACTGGCA GTGCGCAAGG CGGGGCTGCC GCTGCTCCAG
TCGCTCTCAC CCGACCTGAA ACCCGAAACG TTTGTCGAAG ACTCCGCGGT TCCGCCAGAG
AGATTGAACA TCTACCTGCG CCGGTTCCGC GACATCTGCC ACGCGCACGG GGTGCGGGTC
GCCTTTTACG GGCACGCCAG CGTTGGCGTC ATTCACGCCC GCCCGCTCCT CAACCTGAAG
GACGCGGGGG ATGTGCGCAA GATGCGCGCA ATTGCCGAAC AGATCAAAGA CCTGGTGATC
GAATTCGGCG GGGCGCTCTC CGGCGAGCAT GGCGACGGGA TGCTGCGGGC TGAGTTCAAC
CGCGAGTTAT TCGGCGATAC CCTCTACGAA GCCTTTCGCG AGATCAAGCA TACGTTCGAC
CCTTACGACA TTCTCAACCC CGGCAAGATC GTCGAAGCGC CGCCGATGGA TGCCAGCCTG
CGGTATGGCG CATCGTATCA TCCGATCCAG TTGCATACCC ACTTTCGCTT CAGCGACACC
GGCGGCATTG TCGGTGCGGT CGAGTTGTGC AACGGCAACG GTCTGTGCCG CAAAGTGGCG
GGTGGCACGA TGTGCCCCAG TTACATGGTG ACGCGCGACG AAGAACACTC GACCCGCGGG
CGTGCCAACG CGCTGCGGAT GGTCTTCTCC GGAGCGCTTC CGCTCGATGC GCTCACCAGC
GCACGCATGA AAGAAGTGAT GGACCTGTGT CTGGAGTGCA AAGGGTGCAC CGGCGAATGC
CCGTCACGGG TCAATATGAC CCGTCTGAAG TCCGAGTGGC TCTCGATCTA CTATGATCGC
CACGGCATTC CGCTGCGATC ACGACTGTTC GGCAGCATTC GCACCATCAA TGAACTCGGC
AGTCGGATTG CGCCGCTGGC GAACCGCGCA CTGGCGCTGC CATTCGTCCC CTGGCTCACC
GAACGGCTGA TCGGCGTCAG CCGCCACCGT CGTCTGCCGC CATTCGCCGA TCAGCCGTTC
CATCGCTGGT TCGAGAAACG ACCGACGCCG GTCGGCGCCG ATCGTCCTTC GGTCGTCCTC
TTCCCTGACA CATTCGCCGA CTATAACGAC CCGCACGTTG CGCAGGCTGC GGTGCACGTG
CTGGAGGCGG CAGGGTACCG CGTACTCCTG CCGACGCGCC GCGTCTGCTG CGGGCGTCCG
CAGATCTCGA AGGGGTTGCT GAAGGAAGCA CGGGCGCTGG CGCAGCGTCA GATCGACGCC
CTGGGACCGT ATGCCGCCGC CGGGATTCCG ATCATCGGTC TTGAACCGAG TTGCATCCTG
ACCTTCCGCG ATGAGTATCC CGACCTGCTG GACGATCCCC GCACTGCAAC ACTGGCGCAG
ATGTCGTTCC TGTTCGACGA ATTCCTGGCG CGTGAGGTTC GCACCGGTCG CGCGACGCTG
CGTTTCAAGC AACCGGATCA GGCGCCGCGC CGGTATCTGT TCCACGGTCA TTGCCACCAA
AAGGCGTTGA TCGGCAGCCA GCACGCGCTG GCGCTCCTCC GCATGATCCC CGGCGCGGAG
GTGCATGAAG TCGATAGCGG ATGCTGCGGC ATGGCGGGTT CATTCGGCTA CGAAGTGGAA
CATTACGTCA TTTCACAGAA GATCGGCGAG CGCGCGCTCT TCCCGGCTAT CCGTTCGCTG
CCTGCCGACG CGACGATTGT GGCGATGGGA ACCAGTTGCC GCCAGCAGAT CGCCGATGGA
ACCGGAAGAC GCGCCGATCA CCTTGCCGGG GTGTTGGCTG ACGTTCTCGA AGAGTAG
 
Protein sequence
MNYADLAAEL RECVKGDVHT DAISRAIYAT DASIYQIEPL GVVLPRDEED VAAVVRLARL 
RRLPILPRGG GTSLAGQAVG RAIHLDFTRY MNRLLEVNVA EQWAWVEPGI ILDQLNAEVA
PHGLMFAPDV SPSNRATIGG MIANNSSGMY SLVYGKTIDH VLELKVMLSD GSITTFRPLD
ETELRARLSN PELEGRIYRA VARLAHDHAD EIARRYPKVL RRVGGYNLDA FVPVVEEGQT
RYGIMFGSRS PDRRFNMANM IVGSEGTLAI VLAARLRLVP RPKHTAIAIL EFATLDAALD
AVVPCLECEP AAVELMDDIL LDLTRKSHEY AQYLAMFVQG TPGALLQVEF FGETGDDVRA
HLDRLEQRLR GHYQAMTPVL TAERKRAVLA VRKAGLPLLQ SLSPDLKPET FVEDSAVPPE
RLNIYLRRFR DICHAHGVRV AFYGHASVGV IHARPLLNLK DAGDVRKMRA IAEQIKDLVI
EFGGALSGEH GDGMLRAEFN RELFGDTLYE AFREIKHTFD PYDILNPGKI VEAPPMDASL
RYGASYHPIQ LHTHFRFSDT GGIVGAVELC NGNGLCRKVA GGTMCPSYMV TRDEEHSTRG
RANALRMVFS GALPLDALTS ARMKEVMDLC LECKGCTGEC PSRVNMTRLK SEWLSIYYDR
HGIPLRSRLF GSIRTINELG SRIAPLANRA LALPFVPWLT ERLIGVSRHR RLPPFADQPF
HRWFEKRPTP VGADRPSVVL FPDTFADYND PHVAQAAVHV LEAAGYRVLL PTRRVCCGRP
QISKGLLKEA RALAQRQIDA LGPYAAAGIP IIGLEPSCIL TFRDEYPDLL DDPRTATLAQ
MSFLFDEFLA REVRTGRATL RFKQPDQAPR RYLFHGHCHQ KALIGSQHAL ALLRMIPGAE
VHEVDSGCCG MAGSFGYEVE HYVISQKIGE RALFPAIRSL PADATIVAMG TSCRQQIADG
TGRRADHLAG VLADVLEE