Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1737 |
Symbol | |
ID | 5539215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2232439 |
End bp | 2235513 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640893876 |
Product | hypothetical protein |
Protein accession | YP_001431847 |
Protein GI | 156741718 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.522031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAG ACAATCTTTC CTTCCTCCAC CGGTTTGGCG AGCGGGTCTC GTTCGACCGT ATCGAGCGCA AACTCTACGG TCACGACATT GCCGCCATTC CCGGTCTCGT CACCCCCTTG CTGGGTGACA CGCTTCCCGA TGCCGTGGTG CAACCACAGA ACGAAGAGGA GTTGATCGAA CTGGCGCGCT GGGCGTCCGC CAACCGCGTC CCGCTGACGC CGCGCGGCAA GGCGACCTCC GGCTACGGCG GCGCCGTTCC CCTCCGCAAG GGGGTGGTCG TCGATTTTTA CCGGATGCGC ACTGTGCTGC ACATCGACGC AGCCGATCAA ACGGTCACCG TCGAGCCGGG CATCACCTGG GAACAGCTCG ACAGGACGCT GAAACGCGAG GGGTTGACCT TGCGCCTCTA TCCCACGAGT TACCCGTCAT CCACGGTCGG CGGGTGGCTC GCACAGGGTG GCGCGGGCAT CGGCAGTTAC GAGTTTGGCT ACTTCCGTGA GAATGTGGTA TCGGCGCGTC TGGTGCTGCC GTCGGGTGAG GTGCGCGACC TGCGCGACGC CGATCTCGAT CTGGTCGCTG ATGCGGAAGG GATCACCGGC ATGATCAGCC AGGTGACGCT CCGAGTGATG CGGCTGACCG GCATTCAAAC ACTGGCGCTC GCTGTGTACG ATGCGTATGC GTTCCAGTTG CTGTTGCAGG CGCTGATTGA TCGACGCTTG CCGATCTGGT CGATGAGTTT CATCAACCCG AAGATGGCGG AAATGAAGAA CGAAGCGCCG CTGCGCGAAC ACCACGGTCA TCCCGTCGAG CAGCGCATCA TCCTGCCGAA AGCGTATATT CTCACCCTTG CGTTCCGCGA CGCTGACGCG ACTGCGGTGC AATCCGCCAT TCCGGGAATC GCCGCCGCGA CCGGCGCCGA AATCCTGAGC GATGAGATTG CCCGCCATGA GTGGGACGGG CGCTTCAAGT TGATGACGAT CAAGCGCCTG AGTCCGTCGC TGGTTCCGGC GGAAGTGATT GTGCCGCTCG ACCGCCTGGC AGATACGCTC GATGCGTACA ACCGCCTTGT TGCACAACCG CTCGTCAAAG AGGGCGTCGT CATCCGACTC GGCGTCGATG GCAGCCCGGA AGTTGTCATT CTCGGCTTCA TCCCCGCCGA TCAGCGCTCT TTTTCGTACA ACCTGGTCTT CGGGCTGGCG CTGACCGTGA TGAATATCGC CATCAAAAAC GGCGGGCGCC CTTACGCCAC CGGCATCTAT TTTGCGCGCA AGGCGGATCA GGCGCTGGGC AAAGCGCGCG CGGCGCAACT GAAAGCCTTC AAGTCACGTA TCGATCCGGC TGGCGTGATG AACCCAGGGA AAGTCTTTGG CGGCGGCGCT GTCGGCGCCT TCGTCGATCT GGCGTCGCGC TTTGAGCCGC TGGTGCGGCG CTTCGGCAAC GCCGTCCCTG TGACCATCGG CGAGCGACCA ACCGGTCCGG TGCGTGACAT TCCCGCCGAT GTCGCCTGGT ACGCTTACGC CTGCTCGCAG TGCGGTTACT GCATCGATGA GTGCGACCAG TTCTACGGGC GCGGATGGGA GAGCCAGAGT CCGCGCGGCA AGTGGTTCTG GCTGCGCGAA TACATGGAAG GTCGTGAACA GTGGAACCAG CGCATGGTGG ACACCTTCAT CGCCTGCACC ACCTGTGAAA TGTGCAACCT GCGCTGCTCC GCCAACCTAC CGATTGAACC CGCGTGGATG AAACTGCGCG GTCAACTGAT CACCGAACAG AAAAAGATGA CCTTCCCGCC ATTCGAGATG ATGAGCGCCG CACTCACCGC ACAGGGCAAC ATTTGGGCAG GCTACCGCGA ACGGCGCGAC GCATGGTTCC CCAAAGACCT GAACGAGCGC CACGGACCTG ATCATCGCGC GAAGAATGTC TATTTCGCCG GATGCACTGC CAGTTATGTC GAGCAAGACA TTGGCATCGC CAGTGTGCGT CTGCTCGACG CAGCAGGCGT CGATTTCACC TATCTGGGAC CGAAGGAAAA TTGCTGCGCC ACGCCGATGC TGGTTGCCGG TCGCTGGGAC CTGTTTGTCG AAACGATGAA GAAAAACATT GCCGCCGTCA AAGCCGCCGG CGCCGATACT GTCATCAGTT CCTGCCCGGC GTGCGACATG ATGTGGCGAC ACGTCTACCC GGCATGGGCG AAGAAACTCG GCATCGAGTA CAATATCACA GCAAAGCACT ATTCTGAGAT CGTTGCGGAG AAGATCCACG CCAGCGAGTT CGCCTTCCCG CAGACCAACC GGGCGCCGGT GACCGTCACC TGGCACGACT CGTGTCACAT CGGGCGGGTG TCGGGGGTCT ACGAACCGCC GCGTGATCTG ATCAAATCTG TGCCGAACGT CCATTTCGTC GAAATGGCGA GCAACCGCGA CTGCGGGAAG TGCTGCGGCT CGGTGCTGAC GCTGATCAAA GAGCCTGATG TCGCTGCCGA CCTGGGCAAG ACGCGCATTG ATGAAGCGCT CGATATCGGC GCAGAGAAGA TCCTGGCGCT CTGCCCGTGC TGCGAGTTTC AGTTGCGGGT CAGCGCCGAA AAGAAGCGCC TGCCGGTCGA AGTTGTCGAT CTGGCGCACT TCGCTGCCGA AGCGCTTGGA TACGAACTGC CCGACCCGAA CCCGGAAGTG CGCGCGCAGT GGGCTGTATT CGAGGCGATG ATCAAACTGA TGACCCCAGA GGGATTTGCC AGTCTGATGA CGACTATGTG GCCCGAGATG ATCGATGCTA TGCCGTTGGG CATGGGGCGC ATGATGCGGG CGATGGGCAG GGTCCCCGGC GCGCTCGAAG CGATGAAGCC GCTCTTTCCG ATCCTCTTCC CGCGCCTGCT GCCGCTTATG ATGCCGAAAC TGATGCCAGC CATGCTGGCG CGGGTTGGCG CGATGATTCC GATGCCGGAC TACATGGCGG AACAGATGCC GGTGCTGATG CCAAAGGTCG TTGATCGGCT CATGCCGCAT ATGATCGGCG ACGTTGTGCC GCTCGTGACC CAACCACTGA TCGACTATCT GCACGAACAG GCGCGACTGA ACTGA
|
Protein sequence | MKADNLSFLH RFGERVSFDR IERKLYGHDI AAIPGLVTPL LGDTLPDAVV QPQNEEELIE LARWASANRV PLTPRGKATS GYGGAVPLRK GVVVDFYRMR TVLHIDAADQ TVTVEPGITW EQLDRTLKRE GLTLRLYPTS YPSSTVGGWL AQGGAGIGSY EFGYFRENVV SARLVLPSGE VRDLRDADLD LVADAEGITG MISQVTLRVM RLTGIQTLAL AVYDAYAFQL LLQALIDRRL PIWSMSFINP KMAEMKNEAP LREHHGHPVE QRIILPKAYI LTLAFRDADA TAVQSAIPGI AAATGAEILS DEIARHEWDG RFKLMTIKRL SPSLVPAEVI VPLDRLADTL DAYNRLVAQP LVKEGVVIRL GVDGSPEVVI LGFIPADQRS FSYNLVFGLA LTVMNIAIKN GGRPYATGIY FARKADQALG KARAAQLKAF KSRIDPAGVM NPGKVFGGGA VGAFVDLASR FEPLVRRFGN AVPVTIGERP TGPVRDIPAD VAWYAYACSQ CGYCIDECDQ FYGRGWESQS PRGKWFWLRE YMEGREQWNQ RMVDTFIACT TCEMCNLRCS ANLPIEPAWM KLRGQLITEQ KKMTFPPFEM MSAALTAQGN IWAGYRERRD AWFPKDLNER HGPDHRAKNV YFAGCTASYV EQDIGIASVR LLDAAGVDFT YLGPKENCCA TPMLVAGRWD LFVETMKKNI AAVKAAGADT VISSCPACDM MWRHVYPAWA KKLGIEYNIT AKHYSEIVAE KIHASEFAFP QTNRAPVTVT WHDSCHIGRV SGVYEPPRDL IKSVPNVHFV EMASNRDCGK CCGSVLTLIK EPDVAADLGK TRIDEALDIG AEKILALCPC CEFQLRVSAE KKRLPVEVVD LAHFAAEALG YELPDPNPEV RAQWAVFEAM IKLMTPEGFA SLMTTMWPEM IDAMPLGMGR MMRAMGRVPG ALEAMKPLFP ILFPRLLPLM MPKLMPAMLA RVGAMIPMPD YMAEQMPVLM PKVVDRLMPH MIGDVVPLVT QPLIDYLHEQ ARLN
|
| |