Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3073 |
Symbol | |
ID | 5210041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 3860857 |
End bp | 3862584 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640596664 |
Product | hypothetical protein |
Protein accession | YP_001277386 |
Protein GI | 148657181 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0798426 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00432489 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGCCA TTGCACCAGA TCATCAGGCT GTGCCGAATG CGCCTTATGC ACCGCCGCTG TGGGCGGATG TGGATCGCGC GCTTGCCGAT CTGCACTCGC AAAAAGATTC CTGGGTTCAG GTAAGCCTCG ACGAGCGCAT CGACCTCCTC GCCGCCTTGC GTCGCTCCCT TGCCGAGGCT GAGGATCGCT GGATTACGAT CAGTCTGGAG TCCAGGGGAT TGGCGCCGGG GGGGTACGGC GAGGGTGAAG AACGCACCTG GTTCACAATC CTGACCCGCG CCCTGCGGTT GATCCATCAG GCGCTGATCG ACGTGCGCGA TCATGGGCGT CCGCATCTTC CTGGCAGTTT GACGACCCGA TCAGATGGGC AGGTGGTGGC GATGGTGCTG CCTGCCAGCC GTTATGACTC AGCTCTCTTC CCCCGGATGA CCGGCGAGGT CTGGATCGAA CCGGGCATGA CCGTCGAAGA AGTGGTGCAG CGCCAGGCGG AGGTCTACCG GAATCCGCCG GAGCGTGGGC GTGTTGCGCT GATATTGAAT GCCAGCCAGT CTTCCTTCCT GCCGGTGACC GATGTGCTGC ACAAACTGTT TGTCGACGGC GATGTGGTGG CGCTGAAGCT GCATTCGACC TGCGCCTCGC TGGCGTCGCT CTTTGAGGAG GTCTTTCGCC CGCTGCTCGA TCAGGGTGTT GTCCGACTGA TCTATGGCGA TAAGGACCTT GATGCGTATC TCTGCGCGCA TGAACTGGTC GATACTATCC ACTTTAGCGG CGCCGATGCC GCTTTCGATA CGCTGATCTT TGGTGCAGGG CAGGAAGGCG TCGAACGTCG CGCGCGGCGT GAGCCGGTGC TGGCAAAGCC GTTCAGCGGC GAGGCGGGTA ATGTCACGCC CGTGCTGGTC GTGCCGGGTC CGTGGAGTGA CGATGATCTG CACCGGCATG CGACTCTTCT GGTGCGTGCG TTCGTGTGCA ACGCAGCGTC TGTTTCTGCC GCTCCGCGCC TGATTGTGCA GCACCGTGGA TGGCGACGGC GTGATTCATT CCTGGCTTCG TTCGAACAGG TGCTGGCGCG TATTCCCACC CGGCAGGCGC CGTTCCCCGA AGCGCGGCGA CAACTCGATG CGCTCCTGAA TGCCCACGCT CGCGCGCATC GTATCGGGCA GGCGGATCTC GATCACCTGC CCTGGACGGT GATCCCCGAC CTCGATCCTT CGTCGCTCGA CGCGGACTGC TTTGTGCGCA CGATGTTCTG CCCGGCTGTT GGTGAACTCG CGCTCGATGC AAAGGATGCA GCGACGTTCA TCGACGCAGC GGTTGAGTTT CTCAATCGGC AGGTTCGGGG AACAGTGGCG GCGACGATCC TTATCCATCC GCGCACGTTG CGTGAACGCC GGGTGCGTGC AGCGTTTGAT CGAGCGTTGA GCGATTTGCA GTACGGCACG ATAACGATCA ATACGGTGGC GCAGCAGGCG GTGCTGGCGG GTGTGCTGCC GTGGGGCGCG TTCCCGATTG CGAACGGCGA AAGGCGGTAT GGCGGCAAAG TGGCGAACCC GCTGATGCTC CCCAACCCGC AAAAATCGAT ACTGCGCGGT CCTTTCAGCA TTCCCCAACT GTTTCTCTCG GTCGAACCGC AACGCAACAT TGAACTGTGC AAAGCGATCA CCCGACTCGA ACAGGCGCCA TCGCCGTGGC GTCTGGCTCA ACTGATGCGT CTGGCGTTGC GGCGGTAA
|
Protein sequence | MTAIAPDHQA VPNAPYAPPL WADVDRALAD LHSQKDSWVQ VSLDERIDLL AALRRSLAEA EDRWITISLE SRGLAPGGYG EGEERTWFTI LTRALRLIHQ ALIDVRDHGR PHLPGSLTTR SDGQVVAMVL PASRYDSALF PRMTGEVWIE PGMTVEEVVQ RQAEVYRNPP ERGRVALILN ASQSSFLPVT DVLHKLFVDG DVVALKLHST CASLASLFEE VFRPLLDQGV VRLIYGDKDL DAYLCAHELV DTIHFSGADA AFDTLIFGAG QEGVERRARR EPVLAKPFSG EAGNVTPVLV VPGPWSDDDL HRHATLLVRA FVCNAASVSA APRLIVQHRG WRRRDSFLAS FEQVLARIPT RQAPFPEARR QLDALLNAHA RAHRIGQADL DHLPWTVIPD LDPSSLDADC FVRTMFCPAV GELALDAKDA ATFIDAAVEF LNRQVRGTVA ATILIHPRTL RERRVRAAFD RALSDLQYGT ITINTVAQQA VLAGVLPWGA FPIANGERRY GGKVANPLML PNPQKSILRG PFSIPQLFLS VEPQRNIELC KAITRLEQAP SPWRLAQLMR LALRR
|
| |