Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2959 |
Symbol | |
ID | 5209927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 3700954 |
End bp | 3703794 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640596552 |
Product | hypothetical protein |
Protein accession | YP_001277274 |
Protein GI | 148657069 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.941991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGAGTA AAACGATTTC GGGCGATCCA CCTGCCTGGC AGCCGTCCGC TGGCACGTAT GCGCAGATGG TGCTGCATTA TCTGGATATG CGCCTGGCGC TGATCATCGC GCTGGCGGTG CTGGCTGGCA TCTTCGCGTA TCAGGCGCCG GTCAATACGA CGATTCTCGT CGGTTGGCCC GGCGACCGCT TGTTTTTGCA GGCGAGCGAG GGCGCTGGCG CTGCGGATCG CTACACGTTC TATGGGGACG AACTCACCGC AGACGCACAA AGTGGACGGA GTCGCTGGAC GCACCAGGGG GCGCGGGTCG ATCTGGCTGG TCTGGGAGAA GGCGCGCTCG TTGTGACGGT GCGCGCTCAG GGTTGGCCCG CCGATGCACT GAACAGCGTG ACGCGCCAAC CGGAGGTGAT CGTTGCAACC CATGATGCGC CGATCGGTCG ATTCACCCCT GATGAGCGAT GGGCGGAGTA TGAGTTTGCC ATCCCGGTCG AGGCGCGTCG CGGCGCCGAC CTGATATTGA CGTTCACCGC CTCGGACGTT TTCACCAGCA CAAGTGTCTA CACCGATCCG CGGCCCAAAG GGGTTCGGAT CGAATCGATC AGCGTGCGCA GCGCCAGCGA TGGTCCGTTC ATGCCGGTTG TTGCGCCGGT GTTCTGGCTG GCGGTGAATG GCGTCGTCTG GTTCCTGGCG CTTGCCGCAC TCACGCGCCG ACCGACAGGC GCCTTTGTTG TCGCTACGTT GCTGGTCAGC GGGGCAGCGG TTGCTCTGGC AGCGTTGCGT ATCTGGGCGG TTGCGTTGCT GCCGTGGTTC GCCGGTATTG GCGCTGTTCC GCTGATCTGG CAGTACCGTG CGCTGCTCAC AAGATACCCC TTGCGGTTGA TACGCCGCTT TGCGCGTGGT GCATCTCTTG GGTATGGGTT GCTCGGCGCG GTTGCCGTGT GGCTGATGGC GATGGTGACG CTTTACGCTC CTCCATTGCC GCGTCCGGAG TGGTTCTGGG AGTTTTTTCC GGACTCGCTG GTCTCTACCC TGATCATTGC TGGCGTTCTG GCGCTGATAC TCATCTATGG TCACAACGGT TTACCGCGTC TCGTCCAGAA TACGGTGGAT GCTATCGGTT CGCAACGAGG AGCGGCATTG CTGCTGGCGC TGATGCTGGC GGTATGGGTC GGGTATCTGG GATCGGTCAT CATGGCACTG CCGTATGTCG GTCATGCCGA TTATGCGGAT AATGCGGTCG TGGCGCGCAA TCTGCTCGCC GGTCGCGGAT GGACGGTCGA TTATGTGACG CAGTTCTACT ATCTCTATGA TGGCGTCACG CGCCCGCAGG AGACCTGGCC CCTGCTCCAA CCGGTGTGGG TGGCGTTCTC GTTTGCGGTG TTCGGTGTCA GCGCATGGGC GGCGAAAGTT CCCAATCTGC TGTTCATGAC GCTGCTGGCG CTGGCGGTCT ACCGTGCTGG TGCGCGCCTG TGGGATCGAC GCGTCGGACT GGTCGCCGTT GTGTTGCTGA TCACCAGTCA CCTCTATTTC AAACTGGTGA TCTATGTGAC GAATGACCTG GGATTTGCGC TGTTTACGTT TGGGGCGCTG ATCGCGCTGT ACCGCGCCTG GGGTGCGCCG CAGGCGGGTA TGTCTGCGCG AAAGGTTGCG CTTCTGACCG CCGTTTCTGG TGCGTTGACC GGACTCATGC TGCTTCAGAA GCCGGGAAGC GGCGGGTTGA TCGCGTTGGG AATGGGGTTG TGGTTCCTGT GGGTGCGCTT CGACGCGCTG CCGCGAACCA TGGCAGACCT GCGCGTGCGT CTTGCCCCGG TTGTCGGGTG GAGTGTTGTT GCTTTTGCGT TACTCTCCCC CTACCTGGCG CGGAATATGC TCACCTTCGG CGTGCCGTTC TATTCAACCG AAAGCAAAGA TGCATGGGTG TTGGAGTATA CGACGTGGGA TCAGATCTAT GCGGTGTATA CCTCTGAGGA AGGGTTGAGC ACGCTGGGTG TTCCGGATCG AAGCTGGATT CTGCGCTGGG GATTTGACCG CACCCTGGTG AAACTGGAGC GTCAGGTTCA GGCATTGCGC GATTATCTGA TCCCTTCCTG GGAGCGTGCT CCGTTTGGGA TGGGCGAGTG GTTCGGGCGC GCCGACAAGA GGCGTTTGTT GTTTGAGGCG GGCGCGTGGC TGGCGCTTTT TGGCGCCGTT GGCGCTGCGG TCGCTCACCG CCGCCTGATA ATGCTTCTTT GCGCTGCTTT TGTGCCATAT ACTCTGTTTC TGATCGTCTA CTGGCACACG AACGAAGAGC GCTACTGGGT TGTGATCATG CCGTGGCTGG CGCTCTTTGC CGCTGCTGCG ATCTGGCGCA TCTACGATAG AATTGCCAGG ATCTCCAACC GGCGATGGGC GCCGTTCGGG TTGATGGCAG TTGTCGCGTT GACTGCGGCT ATCATTGTCC CTTCGTGGCC CGAAATCGCC GAAAAGGTGC GCGATGAACC TCACCTCTAC CGCGCCGATC TCGACGCTTA TGCCTGGTTG CAGCGTTCTA CGCCGCCCGA TGCGGTCGTT ATGACGCGCA ATCCGTGGCA ACTCAACTGG CATAGCGAGC GTCCGGCGCT GATGATTCCA TACACAACCG ATCAGGAGAC GTTCCTGCGC CTGGCGCGCC ACTACCGTGT GCGGTACCTG GTGCTCGATA CGTTGCAGCG TCCGGCGCCA GAGGTGCGAC GGATGCTTGA TGCAATGATT GCCGACCCGC ACCTGGGTTT CCGTGAAGTC TATCGCACAC CGACGTACCG CGCCGATTTC CGGGGTGTGA CGAAGGAGTT GACCGCAGTG GTGTACGCAT TCCCGGAGTA A
|
Protein sequence | MRSKTISGDP PAWQPSAGTY AQMVLHYLDM RLALIIALAV LAGIFAYQAP VNTTILVGWP GDRLFLQASE GAGAADRYTF YGDELTADAQ SGRSRWTHQG ARVDLAGLGE GALVVTVRAQ GWPADALNSV TRQPEVIVAT HDAPIGRFTP DERWAEYEFA IPVEARRGAD LILTFTASDV FTSTSVYTDP RPKGVRIESI SVRSASDGPF MPVVAPVFWL AVNGVVWFLA LAALTRRPTG AFVVATLLVS GAAVALAALR IWAVALLPWF AGIGAVPLIW QYRALLTRYP LRLIRRFARG ASLGYGLLGA VAVWLMAMVT LYAPPLPRPE WFWEFFPDSL VSTLIIAGVL ALILIYGHNG LPRLVQNTVD AIGSQRGAAL LLALMLAVWV GYLGSVIMAL PYVGHADYAD NAVVARNLLA GRGWTVDYVT QFYYLYDGVT RPQETWPLLQ PVWVAFSFAV FGVSAWAAKV PNLLFMTLLA LAVYRAGARL WDRRVGLVAV VLLITSHLYF KLVIYVTNDL GFALFTFGAL IALYRAWGAP QAGMSARKVA LLTAVSGALT GLMLLQKPGS GGLIALGMGL WFLWVRFDAL PRTMADLRVR LAPVVGWSVV AFALLSPYLA RNMLTFGVPF YSTESKDAWV LEYTTWDQIY AVYTSEEGLS TLGVPDRSWI LRWGFDRTLV KLERQVQALR DYLIPSWERA PFGMGEWFGR ADKRRLLFEA GAWLALFGAV GAAVAHRRLI MLLCAAFVPY TLFLIVYWHT NEERYWVVIM PWLALFAAAA IWRIYDRIAR ISNRRWAPFG LMAVVALTAA IIVPSWPEIA EKVRDEPHLY RADLDAYAWL QRSTPPDAVV MTRNPWQLNW HSERPALMIP YTTDQETFLR LARHYRVRYL VLDTLQRPAP EVRRMLDAMI ADPHLGFREV YRTPTYRADF RGVTKELTAV VYAFPE
|
| |