Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSc2373 |
Symbol | |
ID | 1221219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ralstonia solanacearum GMI1000 |
Kingdom | Bacteria |
Replicon accession | NC_003295 |
Strand | - |
Start bp | 2575415 |
End bp | 2578414 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637238773 |
Product | formate dehydrogenase large subunit |
Protein accession | NP_520494 |
Protein GI | 17547092 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00446398 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTTCTGA CCCGCAAACC GGCGGCGCAG GATGCCAAGA CCGGGCAGGC AAGCCGCCCC GGCGCCGCGA CGCTGTCCGG CAGCCTGGCG CGCGGCCTGG CCGGCGCGCT GCCGACCATG GACCGGCGGA CCTTCCTCAA GCGCTCCGGC ATCGGCGTGG GCGCGGGGCT GGCGGCGTCG CAGCTGTCGC TGGTGAAGAA GGCCACGGTG GTGGGCGAGG CCAGGGCGGC CGAAGGCAAG GACGACATCG TGGTGCGGCG CACCGTGTGC TCGCATTGCT CGGTGGGCTG CGCGGTGGAT GCGGTGGTGC AGAACGGCGT GTGGGTGCGC CAGGAGCCGG TGTTCGACTC GCCCATCAAC ATGGGCGCGC ACTGCGCCAA GGGCGCCGCG CTGCGCGAGC ACGGCCACGG CGAATACCGC CTGAAGACGC CGATGAAGCT GGTGGACGGG CGCTACCAGC GCATCAGCTG GGAGCAGGCG CTCAACGAGA TCGTCGCCCG GATGAACGCC ATCCGCCAGG AGACCGGGCC CGATTCGTTC TTCTTCGTCG GCTCGTCCAA GCACAGCAAC GAACAGTCGT ATCTGCTGCG CAAGTGGGTG TCGTTCTTCG GCACCAACAA TTGCGACCAC CAGGCACGCA TCTGCCACTC CACCACCGTG GCGGGCGTGG CGAACACCTG GGGCTACGGC GCGATGACCA ACTCGTACAA CGATATGCAG AACGCCAAGT GCGCGCTGTA CATCGGCTCG AACGCGGCGG AGGCGCACCC GGTGTCGATG CTGCATCTGC TGCATGCCAA GGAGACCGGT TGCAAGGTGA TCGTGGTCGA CCCGCGCTAT ACGCGCACGG CGGCCAAGTC GGACGAGTAC GTGCGCATCC GCTCGGGCAC CGACATCGCC TTCCTGTTCG GCGTGCTGCA CCACGTCTTC AAGAACGGCT GGGAAGACCG GCAGTTCATC CATGACCGTG TCTACGGCAT GGACAAGGTA CGCGACGAGG TGCTCGCCAA GTGGACGCCG GACAAGGTCG AGGCCGTCTG CGGCGTGCCC GAGGCGCAGG TGCGCAAGGT CGCCGAGATG ATGGCGATGA ACCGCCCGAG CACGCTGGTG TGGTGCATGG GCCAGACCCA GCACACCATC GGCAACGCCA TCGTGCGCGC TTCGTGCATC GTGCAACTGG CGCTGGGCAA CATCGGCCGG CCGGGCGGCG GCGCCAACAT CTTCCGCGGC CACGACAACG TGCAGGGCGC CACCGACGTC GGCCCCAACC CCGATTCGCT GCCCGGCTAC TACGGCCTGG CCGCCGGCGC GTGGAAGCAC TACGCCGCCG TGTGGGGCGT GGACTACGAC TGGATCAAGG GCCGCTTCGC CTCACAGGCG ATGATGGAGA AATCCGGCAC CACCGTCTCG CGCTGGATCG ATGCGGTGCT GGAGAAGAAC GAACTGATCG ACCAGGACCA CAACGTCCGC GCGATGTTCT ACTGGGGCCA CGCGCCCAAC TCGCAGACGC GCGGCCTGGA GATGAAGCGC GCGCTCGACA AGCTCGACCT GCTGGTGGTG ATCGACCCCT ATCCGTCGGC CACCGCGGCG ATGGCCAACA TGCCCGCCGC CGACGGCACG CCGCCCAATC CGAACCGCGC GGTCTACCTG CTGCCGGCCG CCACGCAGTT CGAGACCTCG GGCTCGTGCA CCGCGTCCAA CCGCTCGCTG CAATGGCGCG AGAAGGTGAT CGAGCCGCTG TTCGAGTCGC AGCCGGACCA CGTCATCATG CAGGCCCTGG CCGATCGGCT CGGCTTCGGC AACGAGCTGT CGAAGCGCCT GAAGGTCACC GAGTACAAGC GCGCCGGCAT GACCTGGCGC GAGCCCGGGA CCGAGTCGAT CCTGCGCGAG ATCAACCGCA GCAACTGGAC CATCGGCTAC ACCGGCCAAT CGCCGGAGCG CCTGCAGGCG CATATGCGCA ACATGCACCT GTTCGACGTG CGCACGCTGC GCTGCCAGGG CGGCAAGGAC CCGGTCACCG GCTACGATCT GACCGGCGAC TACTTCGGCC TGCCGTGGCC GTGCTATGGC ACGCCGGAAT TGAAGCACCC CGGCTCGCCC AACCTGTACG ACACCAGCAA GCACGTGATG GACGGCGGCG GCAACTTTCG CGCCAACTTT GGCGTGGAGC GCGACGGCGT CAATCTGCTG GCCGAGGACG GCTCGTATTC GCTCGGTGCC GAGCTGACCA CCGGCTACCC CGAGTTCGAC CACGTGTTTC TGAAGAAGCT GGGCTGGTGG GACGACCTCA CCGATGTCGA GAAGAAGGCC GCCGAAGGCA AGAACTGGAA GACCGACCTG TCGGGCGGCA TCCAGCGCGT GGCGATGAAG CACGGCTGCC ACCCCTTCGG CAATGCCAAG GCGCGCGCCG TCGTGTGGAA CTTCCCCGAC CCGATCCCGC AGCACCGCGA GCCGCTGTAT TCCACACGGC CCGACATGGT CGCCCAGTAC CCGACGCACG ACGACAAGAA GGCCTTCTGG CGCCTGCCGA CGCTGTACAA GACCGTGCAG CAGCGCAATA TCGAAAACAA GGTCTATGAG AAATTCCCGA TCATCCTGAC CTCCGGCCGG CTGGTCGAGT ACGAGGGCGG CGGCGAAGAG ACGCGCTCCA ACCCTTGGCT GGCGGAGCTG CAGCAGGAGA ACTTCGTCGA GATCAACCCG AAGGCCGCCG CCGACCGGGG CATCCGCAAC AACGACTACG TGTGGGTCAA GACGCCGACC GGCGCGCAGA TCAAGGTGAG GGCGCTGGTG ACCGAGCGCG TGGGCGTGGA CCATGCCTTC ATCCCGTTCC ACTTCTCGGG CTGGTGGCAG GGCAAGGACC TGCTCGACTA CTACCCCGAG GGCGCTCACC CGATCGTGCG CGGCGAGGCG GTCAACACGG CCACGACGTA CGGCTACGAC TCGGTGACGA TGATGCAGGA AACCAAGACG ACGGTCTGCC AGATCGAGCG TTTTGCCTAA
|
Protein sequence | MLLTRKPAAQ DAKTGQASRP GAATLSGSLA RGLAGALPTM DRRTFLKRSG IGVGAGLAAS QLSLVKKATV VGEARAAEGK DDIVVRRTVC SHCSVGCAVD AVVQNGVWVR QEPVFDSPIN MGAHCAKGAA LREHGHGEYR LKTPMKLVDG RYQRISWEQA LNEIVARMNA IRQETGPDSF FFVGSSKHSN EQSYLLRKWV SFFGTNNCDH QARICHSTTV AGVANTWGYG AMTNSYNDMQ NAKCALYIGS NAAEAHPVSM LHLLHAKETG CKVIVVDPRY TRTAAKSDEY VRIRSGTDIA FLFGVLHHVF KNGWEDRQFI HDRVYGMDKV RDEVLAKWTP DKVEAVCGVP EAQVRKVAEM MAMNRPSTLV WCMGQTQHTI GNAIVRASCI VQLALGNIGR PGGGANIFRG HDNVQGATDV GPNPDSLPGY YGLAAGAWKH YAAVWGVDYD WIKGRFASQA MMEKSGTTVS RWIDAVLEKN ELIDQDHNVR AMFYWGHAPN SQTRGLEMKR ALDKLDLLVV IDPYPSATAA MANMPAADGT PPNPNRAVYL LPAATQFETS GSCTASNRSL QWREKVIEPL FESQPDHVIM QALADRLGFG NELSKRLKVT EYKRAGMTWR EPGTESILRE INRSNWTIGY TGQSPERLQA HMRNMHLFDV RTLRCQGGKD PVTGYDLTGD YFGLPWPCYG TPELKHPGSP NLYDTSKHVM DGGGNFRANF GVERDGVNLL AEDGSYSLGA ELTTGYPEFD HVFLKKLGWW DDLTDVEKKA AEGKNWKTDL SGGIQRVAMK HGCHPFGNAK ARAVVWNFPD PIPQHREPLY STRPDMVAQY PTHDDKKAFW RLPTLYKTVQ QRNIENKVYE KFPIILTSGR LVEYEGGGEE TRSNPWLAEL QQENFVEINP KAAADRGIRN NDYVWVKTPT GAQIKVRALV TERVGVDHAF IPFHFSGWWQ GKDLLDYYPE GAHPIVRGEA VNTATTYGYD SVTMMQETKT TVCQIERFA
|
| |