Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2173 |
Symbol | |
ID | 5539654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2793460 |
End bp | 2796510 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640894307 |
Product | nitrate reductase |
Protein accession | YP_001432275 |
Protein GI | 156742146 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000000388509 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGGCT CATTGGTGAA GAACATCCTG CGCGCACTGG CGCAGCCGAC GCACGCATCG ATCTGGCAGG ACACGGTTGC ACAACCGCAG GCGCCGGTGC AGGCAAGCAT CGTCGATCAG CCCTCGTCGC AAGCGGTGCA GGTTCACGGC GCTGAACGGA CAGCAACCGC CGAACTGAAC GGATACCCAC CGGTCGAACG CTGGCAGCAT TGGACGGAGT ACGACCCCAA GGCATGGCCT CAGAAAGTCG AGCGTTCCTA TACCCTGGTG CCGACGATCT GCTTCAATTG CGAAAGCGCC TGTGGTCTCC TGGCGTATGT TGATACTTCC ACGCTCAAGA TACAAAAGTT CGAGGGAAAT CCCCTACACC CCGGTAGTCG GGGGCGCAAC TGCGCCAAAG GACCGGCAAC GCTCAATCAG GTCTACGACC CGGATCGCAT TCTTTACCCG CTCAAGCGGG TCGGTCGGCG CGGCGAGGGG AAGTGGAAGC GCGTCAGTTG GGACGAGGCG CTCGACGACA TTGCCGGGCG CATTCGTCGC GCTCTCGTCG AAAAGCGTTT GACTGAAATC ATGTATCACG TCGGTCGTCC CGGTCATGAT GGCATTATGG AATGGGTGCT GCCTGCCTGG GGTGTAGATG CCCACAACTC GCACACGAAT GTCTGCTCAT CGAGCGCACG GTGTGGTCAG GCGTTGTGGA TGGGGTATGA TCGCCCGTCG CCGGATCATG CGCATGCGCG CGTCATTCTG TTGATCAGTT CCCATCTCGA AACCGGCCAT TACTTCAACC CGCACGCGCA GCGGATTATG GAAGGGAAGA TGGCAGGCGC GAAATTGATC GTCCTCGATA CCCGTCTGTC GAACACCGCT TCGCTGGCGG ACGAGTGGCT TGCTCCCTGG CCCGGCAGCG AGACGGCCAT TCTGCTGGCA ATCGCCAGGC ATCTGATTGT CGGCAAGAAG TACGACCGCG ACTTTGTGCG TCGCTGGGTC AACTGGGAGC AGTATCTGCG TTGCGAGCAC CCTGATCTGC CGGTGCGCTT CGAGACCTTC GAGGCGAAGT TGGAGGAACT TTACGCTTCC TTCACATTTG AGTTTGCCGC GCAGGAGAGT GGCGTCAGCG CCGAACAGAT CGCGCGCGTG GCGGACTATA TCGCGCAGTG CGACGGTCGC CTCGCCACCC ATACCTGGCG CAGCGCCACG AGCGCCAATC TGGGCGGGTG GATGGTCGCG CGTTGTCTCT GGTTCCTGAA TGTGTTGACC GGCTCGATTG GACGGGAAGG CGGCACATCG GCGAATGTTT GGGACAAATG GGTTCCACGC CATCCCAATA TGGCGCCGCA CGTCCAGGTT TGGAACGAAC TGACCTGGCC CCAGGAATAC CCGCTCAGTT TCTACGAACT GAGCTATCTG CTCCCTCACT TTCTCAAAGA AGGTCGCGGC AGGGTTGATG TTTATTTCAC TCGTGTCTAC AATCCCCTCT GGACCAACCC GGACGGTATG AGCTGGATGG AAGTGCTGAC CGATGAATCG AAGATCGGGT TGCATGTTCA TCTTTCGCCC AGCTGGAGCG AAACCGGTTT GTTTGCCGAC TATATTCTGC CAATGGGTCA TGGCGCCGAA CGTCACGATA TCATGAGCCA GGAAACGCAT GGCGGTTGCT GGATCGCCTT TCGCCAGCCG GTGATCCGCG AAGCATTGCG CCGCCTTGGC AGACCGGTCA ACGACACGCG CCAGGCGAAC CCCGGTGAGG TGTGGGAAGA GACGGAGTTC TGGATCGAAC TCTCGTGGCG TATCGATCCT GATGGCAGCC TGGGAGTGCG CCGAACGTTT GAAAGTCCCT ACCGGCCCGG CGAGAAGATC ACGGTCGATG AACTCTATGC CTGGATGTTC GAGAATCATG TGCCCGGATT GCCCGAAGCG GCGGCGAAGG AAGGGTTGAC GCCGCTGGAA TATATGCGTC GCTATGGCGC ATTCGAGTTG CGCAAAGGCG TTCAACCGAC CTACGATCAA CCACTGACGG CGGCAGAACT CGAAGACGCG ACGATAGACC CGGAGACGCA GGTGGTCTAC ACCAGAAAGC CTGCCGCGCC TTCCTCGAAT ATCACACCGC TCCCCTTCTT CCAACCAGAC CCGGAGCGTG GTCGTCCAGT CGGCGTGCAA CTCGAGGATG GTTCACGCCT GATCGGTTTT CCTACACCCT CGCGCAAACT GGAGTTCTAC TCGACGACGA TGCGCGATTG GGGTTGGCCC GAGTATGCCA TCCCGACCTA CATCCATAGC CATGTGCATC CCAGCAGGGT TGACCGTGAA CGAAACGAAG CCGTGTTACT CTCGACCTTC CGCCTGCCCA CCCTCATCCA TACTCGTAGC GGTAATGCCA AATGGCTTTA CGAGATCAGC CATAAGAACC CGGTCTGGAT CCATCCATCC GATGCGCAGC GGCTTGGCGT TCGGACTGGC GATCTGATCA AGGTCGTGAC CGCTATTGGC TACTTTATTG ACCGTGTCTG GGTGACAGAA GGCATCCGCC CTGGCGTGAT CGCCTGTTCT CATCATCTTG GGCGCTGGCG GCTTCAGGAG GATGTTGGCG GCAAACTCTC CACCGCGCTC GTCGAACTGA CGCCGATGGG CGAGGCGCAA TGGCGGATGC GCCAGATCCA TGGGATTCAG CCGTATGCCA GTTCCGACCC CGACACCGAG CGTATCTGGT GGAACGACGC GGGAGTGCAT CAAAACCTGA CGTTTGCTGT TCAGCCGGAT CCGGTCAGCG GCATGCACTG CTGGCATCAG AAGGTGCGAC TCGAGCGTGT AGGACCAGAC GACCGCTATG GTGACATTTT CGTCGATACG CGCCGCGCGC ATGAGGTGTA TCGTGAGTGG CTGGCGATGA CCCGTCCTGC GTGTCAGGTG TCGCCGAATG GTCTTCGGCG ACCGCACTGG CTTCTGCGTC CATTCCGCCC CGATCTCGAA GCGTACTATC TTCCCGACCG ACATTTCGGC AATGGGCATA TCGTCACGGT CGAACCATGC GTATCAGATC ACAGGAAATG A
|
Protein sequence | MTGSLVKNIL RALAQPTHAS IWQDTVAQPQ APVQASIVDQ PSSQAVQVHG AERTATAELN GYPPVERWQH WTEYDPKAWP QKVERSYTLV PTICFNCESA CGLLAYVDTS TLKIQKFEGN PLHPGSRGRN CAKGPATLNQ VYDPDRILYP LKRVGRRGEG KWKRVSWDEA LDDIAGRIRR ALVEKRLTEI MYHVGRPGHD GIMEWVLPAW GVDAHNSHTN VCSSSARCGQ ALWMGYDRPS PDHAHARVIL LISSHLETGH YFNPHAQRIM EGKMAGAKLI VLDTRLSNTA SLADEWLAPW PGSETAILLA IARHLIVGKK YDRDFVRRWV NWEQYLRCEH PDLPVRFETF EAKLEELYAS FTFEFAAQES GVSAEQIARV ADYIAQCDGR LATHTWRSAT SANLGGWMVA RCLWFLNVLT GSIGREGGTS ANVWDKWVPR HPNMAPHVQV WNELTWPQEY PLSFYELSYL LPHFLKEGRG RVDVYFTRVY NPLWTNPDGM SWMEVLTDES KIGLHVHLSP SWSETGLFAD YILPMGHGAE RHDIMSQETH GGCWIAFRQP VIREALRRLG RPVNDTRQAN PGEVWEETEF WIELSWRIDP DGSLGVRRTF ESPYRPGEKI TVDELYAWMF ENHVPGLPEA AAKEGLTPLE YMRRYGAFEL RKGVQPTYDQ PLTAAELEDA TIDPETQVVY TRKPAAPSSN ITPLPFFQPD PERGRPVGVQ LEDGSRLIGF PTPSRKLEFY STTMRDWGWP EYAIPTYIHS HVHPSRVDRE RNEAVLLSTF RLPTLIHTRS GNAKWLYEIS HKNPVWIHPS DAQRLGVRTG DLIKVVTAIG YFIDRVWVTE GIRPGVIACS HHLGRWRLQE DVGGKLSTAL VELTPMGEAQ WRMRQIHGIQ PYASSDPDTE RIWWNDAGVH QNLTFAVQPD PVSGMHCWHQ KVRLERVGPD DRYGDIFVDT RRAHEVYREW LAMTRPACQV SPNGLRRPHW LLRPFRPDLE AYYLPDRHFG NGHIVTVEPC VSDHRK
|
| |