Gene Rcas_2173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2173 
Symbol 
ID5539654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2793460 
End bp2796510 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content59% 
IMG OID640894307 
Productnitrate reductase 
Protein accessionYP_001432275 
Protein GI156742146 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000388509 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGGCT CATTGGTGAA GAACATCCTG CGCGCACTGG CGCAGCCGAC GCACGCATCG 
ATCTGGCAGG ACACGGTTGC ACAACCGCAG GCGCCGGTGC AGGCAAGCAT CGTCGATCAG
CCCTCGTCGC AAGCGGTGCA GGTTCACGGC GCTGAACGGA CAGCAACCGC CGAACTGAAC
GGATACCCAC CGGTCGAACG CTGGCAGCAT TGGACGGAGT ACGACCCCAA GGCATGGCCT
CAGAAAGTCG AGCGTTCCTA TACCCTGGTG CCGACGATCT GCTTCAATTG CGAAAGCGCC
TGTGGTCTCC TGGCGTATGT TGATACTTCC ACGCTCAAGA TACAAAAGTT CGAGGGAAAT
CCCCTACACC CCGGTAGTCG GGGGCGCAAC TGCGCCAAAG GACCGGCAAC GCTCAATCAG
GTCTACGACC CGGATCGCAT TCTTTACCCG CTCAAGCGGG TCGGTCGGCG CGGCGAGGGG
AAGTGGAAGC GCGTCAGTTG GGACGAGGCG CTCGACGACA TTGCCGGGCG CATTCGTCGC
GCTCTCGTCG AAAAGCGTTT GACTGAAATC ATGTATCACG TCGGTCGTCC CGGTCATGAT
GGCATTATGG AATGGGTGCT GCCTGCCTGG GGTGTAGATG CCCACAACTC GCACACGAAT
GTCTGCTCAT CGAGCGCACG GTGTGGTCAG GCGTTGTGGA TGGGGTATGA TCGCCCGTCG
CCGGATCATG CGCATGCGCG CGTCATTCTG TTGATCAGTT CCCATCTCGA AACCGGCCAT
TACTTCAACC CGCACGCGCA GCGGATTATG GAAGGGAAGA TGGCAGGCGC GAAATTGATC
GTCCTCGATA CCCGTCTGTC GAACACCGCT TCGCTGGCGG ACGAGTGGCT TGCTCCCTGG
CCCGGCAGCG AGACGGCCAT TCTGCTGGCA ATCGCCAGGC ATCTGATTGT CGGCAAGAAG
TACGACCGCG ACTTTGTGCG TCGCTGGGTC AACTGGGAGC AGTATCTGCG TTGCGAGCAC
CCTGATCTGC CGGTGCGCTT CGAGACCTTC GAGGCGAAGT TGGAGGAACT TTACGCTTCC
TTCACATTTG AGTTTGCCGC GCAGGAGAGT GGCGTCAGCG CCGAACAGAT CGCGCGCGTG
GCGGACTATA TCGCGCAGTG CGACGGTCGC CTCGCCACCC ATACCTGGCG CAGCGCCACG
AGCGCCAATC TGGGCGGGTG GATGGTCGCG CGTTGTCTCT GGTTCCTGAA TGTGTTGACC
GGCTCGATTG GACGGGAAGG CGGCACATCG GCGAATGTTT GGGACAAATG GGTTCCACGC
CATCCCAATA TGGCGCCGCA CGTCCAGGTT TGGAACGAAC TGACCTGGCC CCAGGAATAC
CCGCTCAGTT TCTACGAACT GAGCTATCTG CTCCCTCACT TTCTCAAAGA AGGTCGCGGC
AGGGTTGATG TTTATTTCAC TCGTGTCTAC AATCCCCTCT GGACCAACCC GGACGGTATG
AGCTGGATGG AAGTGCTGAC CGATGAATCG AAGATCGGGT TGCATGTTCA TCTTTCGCCC
AGCTGGAGCG AAACCGGTTT GTTTGCCGAC TATATTCTGC CAATGGGTCA TGGCGCCGAA
CGTCACGATA TCATGAGCCA GGAAACGCAT GGCGGTTGCT GGATCGCCTT TCGCCAGCCG
GTGATCCGCG AAGCATTGCG CCGCCTTGGC AGACCGGTCA ACGACACGCG CCAGGCGAAC
CCCGGTGAGG TGTGGGAAGA GACGGAGTTC TGGATCGAAC TCTCGTGGCG TATCGATCCT
GATGGCAGCC TGGGAGTGCG CCGAACGTTT GAAAGTCCCT ACCGGCCCGG CGAGAAGATC
ACGGTCGATG AACTCTATGC CTGGATGTTC GAGAATCATG TGCCCGGATT GCCCGAAGCG
GCGGCGAAGG AAGGGTTGAC GCCGCTGGAA TATATGCGTC GCTATGGCGC ATTCGAGTTG
CGCAAAGGCG TTCAACCGAC CTACGATCAA CCACTGACGG CGGCAGAACT CGAAGACGCG
ACGATAGACC CGGAGACGCA GGTGGTCTAC ACCAGAAAGC CTGCCGCGCC TTCCTCGAAT
ATCACACCGC TCCCCTTCTT CCAACCAGAC CCGGAGCGTG GTCGTCCAGT CGGCGTGCAA
CTCGAGGATG GTTCACGCCT GATCGGTTTT CCTACACCCT CGCGCAAACT GGAGTTCTAC
TCGACGACGA TGCGCGATTG GGGTTGGCCC GAGTATGCCA TCCCGACCTA CATCCATAGC
CATGTGCATC CCAGCAGGGT TGACCGTGAA CGAAACGAAG CCGTGTTACT CTCGACCTTC
CGCCTGCCCA CCCTCATCCA TACTCGTAGC GGTAATGCCA AATGGCTTTA CGAGATCAGC
CATAAGAACC CGGTCTGGAT CCATCCATCC GATGCGCAGC GGCTTGGCGT TCGGACTGGC
GATCTGATCA AGGTCGTGAC CGCTATTGGC TACTTTATTG ACCGTGTCTG GGTGACAGAA
GGCATCCGCC CTGGCGTGAT CGCCTGTTCT CATCATCTTG GGCGCTGGCG GCTTCAGGAG
GATGTTGGCG GCAAACTCTC CACCGCGCTC GTCGAACTGA CGCCGATGGG CGAGGCGCAA
TGGCGGATGC GCCAGATCCA TGGGATTCAG CCGTATGCCA GTTCCGACCC CGACACCGAG
CGTATCTGGT GGAACGACGC GGGAGTGCAT CAAAACCTGA CGTTTGCTGT TCAGCCGGAT
CCGGTCAGCG GCATGCACTG CTGGCATCAG AAGGTGCGAC TCGAGCGTGT AGGACCAGAC
GACCGCTATG GTGACATTTT CGTCGATACG CGCCGCGCGC ATGAGGTGTA TCGTGAGTGG
CTGGCGATGA CCCGTCCTGC GTGTCAGGTG TCGCCGAATG GTCTTCGGCG ACCGCACTGG
CTTCTGCGTC CATTCCGCCC CGATCTCGAA GCGTACTATC TTCCCGACCG ACATTTCGGC
AATGGGCATA TCGTCACGGT CGAACCATGC GTATCAGATC ACAGGAAATG A
 
Protein sequence
MTGSLVKNIL RALAQPTHAS IWQDTVAQPQ APVQASIVDQ PSSQAVQVHG AERTATAELN 
GYPPVERWQH WTEYDPKAWP QKVERSYTLV PTICFNCESA CGLLAYVDTS TLKIQKFEGN
PLHPGSRGRN CAKGPATLNQ VYDPDRILYP LKRVGRRGEG KWKRVSWDEA LDDIAGRIRR
ALVEKRLTEI MYHVGRPGHD GIMEWVLPAW GVDAHNSHTN VCSSSARCGQ ALWMGYDRPS
PDHAHARVIL LISSHLETGH YFNPHAQRIM EGKMAGAKLI VLDTRLSNTA SLADEWLAPW
PGSETAILLA IARHLIVGKK YDRDFVRRWV NWEQYLRCEH PDLPVRFETF EAKLEELYAS
FTFEFAAQES GVSAEQIARV ADYIAQCDGR LATHTWRSAT SANLGGWMVA RCLWFLNVLT
GSIGREGGTS ANVWDKWVPR HPNMAPHVQV WNELTWPQEY PLSFYELSYL LPHFLKEGRG
RVDVYFTRVY NPLWTNPDGM SWMEVLTDES KIGLHVHLSP SWSETGLFAD YILPMGHGAE
RHDIMSQETH GGCWIAFRQP VIREALRRLG RPVNDTRQAN PGEVWEETEF WIELSWRIDP
DGSLGVRRTF ESPYRPGEKI TVDELYAWMF ENHVPGLPEA AAKEGLTPLE YMRRYGAFEL
RKGVQPTYDQ PLTAAELEDA TIDPETQVVY TRKPAAPSSN ITPLPFFQPD PERGRPVGVQ
LEDGSRLIGF PTPSRKLEFY STTMRDWGWP EYAIPTYIHS HVHPSRVDRE RNEAVLLSTF
RLPTLIHTRS GNAKWLYEIS HKNPVWIHPS DAQRLGVRTG DLIKVVTAIG YFIDRVWVTE
GIRPGVIACS HHLGRWRLQE DVGGKLSTAL VELTPMGEAQ WRMRQIHGIQ PYASSDPDTE
RIWWNDAGVH QNLTFAVQPD PVSGMHCWHQ KVRLERVGPD DRYGDIFVDT RRAHEVYREW
LAMTRPACQV SPNGLRRPHW LLRPFRPDLE AYYLPDRHFG NGHIVTVEPC VSDHRK