Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1941 |
Symbol | |
ID | 5539419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2484902 |
End bp | 2487766 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640894077 |
Product | hypothetical protein |
Protein accession | YP_001432048 |
Protein GI | 156741919 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000218078 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGGAGTA AGACGATTTC AGGCGATCCG CCTGCACTAC AGCCAACCGT CAGCGCGTAT GCGCGCCTGA CGCTGCGTTA TCGTGATGCG CGTCTGGCGC TGGTTGTTGT GCTGGCGCTT CTGGTGGGCA TCTTTGCGTA TCAGGCGCCG GTTGATGCGA CGATCTTCGT CGGTTGGCCC GGCGACCGCC TGTTTTTGCA GGCAAGCGAG GGCGCTGGCG CTGCCGACCG GTTCACCTTC TACGGCGATG AAATCACCGC TGACGCGCCG GGGGGGCGCA GCCGCTGGAC GCACCAGGGA GCGCGGATTG ATCTGGCAGG TCTGGGCAAG GGTGCGCTTG TGGTGACGGT GCGCGCGCAG GGGTGGCCCC CCGATGCCCT CAACCGCGCG ACGCGCCAGC CGGAGGTGAC GGTCATAGCG GACGATACGT TCATTGGACG CTTCACGCCA GGCGAACAGT GGGCAGAGTA CGATGTCGCT GTTCCCGCCG AGGTGCGGCG CAGCAATGAC CTGACGCTGA TGCTGGTTGC GTCGGATGTC TTCACCAGCA CGAGCATCTA CAACGATCCT CGTCCAAAAG GAATCCGCAT CGACGCTATT CGGGTGCGCA GCGTCGGTGA TGGTCCATTC GTGCCTGCGC TGGCGCCGGT GCTCTGGTTG ACGGTGGGTG GAGGGGTCTG GTTTCTGGCG CTGGCAGGGG GCACTCGCAA GCCAACGTTT GCCTTCGTTG CTGCGACGCT GCTCGTCAGC GGTGCAGCGC TTGCGCTGGC GGCGGCGCGC ATCTGGACGG TTGTACTGTT GCCATGGCTC GCCGGGGCCG GCATTGCGCT GCTGATCTGG CAGCATCGCG TCGCTCTGGT GCGCCATCCG CTGCGCCTGG CGCGCCGCTT CACGCGCGGG TCGGCGCTCG GCTATGGGCT GCTGGCGGCA ACAGGCGCGT GGCTGGTGGC GATGATCATT CAGTATGCCC CGCCGATGCC GCGCCTGGAA CAGTTCTGGG AGTTTTTTCC CGACTCGCTC ATCTATACGC TGATCACGCT GGGCAGTCTG GCGCTGATCC TGGTCTACGG GCGCAATGGA GTGCCCCGCC TGGCGCAGGC GACGGTGCGC ATGGCAGGTT CGCGGCGCGG TGCGACGCTG ACGCTGACGC TCCTGCTGGC AGTGTGGGTT GGCTATCTGG GGGCGGTCAT TCTGGCGATG CCATATGTCG GGCATGCCGA CTATGCCGAT AATGCGGTCG TGGCGCGCAA TCTCCTTGCC GGACGAGGAT GGACGGTCGA TTATGTGACG CAGTTCTACC GCCTCTACGA CGGTGTCACG CGCCCGCAGG AAACATGGCC CCTGCTGCAA CCGGTGTGGG TGGCGTCTTC GTTTGCGCTT TTTGGCGTCA GTAATTGGGC TGCGAAGATT CCGAACCTGC TGTTTCTCAC TGTGCTGGCG CTGGCGGTCT ATCAGGCTGG CGCGCGCCTG TGGGACCGGC GCGTCGGATT GGTTGCAGTC GTGTTGCTCA TCACCAGTCA TCTCTACTTC AAACTGGTAA TCTATGTGAC GAATGACCTG GGGTTTGCGC TGTTTGCGTT CGGCGCACTG CTGCTGCTGT ATCGCGCCTG GGGTGCGCCG GATGCGATGG CGCTCGCGTC GCGTTTGCGC TTTGGGAGCA TATCGCAGCA ATTGGCGTTG ACGATCCTCT CTGGTGCGTT AACCGGTCTG ATGCTCTTGC AAAAGCCGGG AAGCGGCGGG TTGATCGCGC TGGGCATGGG ATGGTGGTTC CTGCGGGTGC GCTTTGACAC CTGGCCGCGA ACCCTGGCCG ATCTGCGCAC GCGACTGGCG CCCGTAGCGG CGTGGGGTGT GGTTGCGTTT GTGTTGCTGG CGCCGTATCT GGCGCGGAAT ATGCTCACCT TCGGCGTGCC GTTCTATTCG ACCGAGGGCA AGGATGCGTG GGTGCTGGAG TACACGACAT GGGATCAGAT CTATGCCGTG TATACCACCG AGGAAGGGTT CAGTACGCTC GGCGTTCCCG ACCGCAGCTG GATTCTGCGG TGGGGATTCG ACCGCACCCT GGTCAAAATG GAGCGACAGG TGCGCGCACT GCGCGACTAT CTGCTACCAT CCTGGCAGCA TGCCCCTTTC GGATTGAGCG AGTGGTTCGG GCGCGCCGAC AAGGATCGCC TGCTGTTTGA TCCAGGCGCG TGGCTGGCGC TCTTCGGCGC TATGGCTGTC ATAGCATCAC ATCGCCGCCT GATGACCCTG CTCGGCGCAG CGTTTGTGCC ATACACGCTG TTTCTGATCG TCTACTGGCA CACGAATGAA GAACGTTACT GGGTTGTGGT GATGCCGTGG CTTGCGCTTT TCGCTTCGGC AACGCTGTGG TGCATCTATG ACCGGATCGC TGCGCTGTCC GATGGTCGTT GGACGCCATT CGGGTTGCTG GCAGTCGTTG CACTGATAGC GGCGATCATC GCGCCGTCCT GGTCTCCAAT CGCCGAAAAG GTGCGCGACG AACCGAACCT GTATCGCGCC GATCTCGACG CCTATGACTG GCTGAAGCGC AACACGCCGT CTGATGCCGT AGTGATGACG CGCAACCCGT GGCAACTCAA CTGGCACAGC GAACGTCCGG CGCTCATGAT ACCGTACACG ACCGACCGGG AGACCTTCCT GCGCCTGGCG CGCCGCTACA ATGTGCGCTA TCTGATGATC GATACCTTGC AGCGCCCTGA GCCGGAAGTG CGCCGCCTGC TCGATGCCAT GATCGCCGAT CCGGCATTGG GGTTCCGCGA AGTCTACCGC ACACCAGTGT ATCGCGCCGA TTTTCGCGGC GTAACGAAGG AGATGATCGC CGTTGTGTAC GCATTCCCGC AGTGA
|
Protein sequence | MRSKTISGDP PALQPTVSAY ARLTLRYRDA RLALVVVLAL LVGIFAYQAP VDATIFVGWP GDRLFLQASE GAGAADRFTF YGDEITADAP GGRSRWTHQG ARIDLAGLGK GALVVTVRAQ GWPPDALNRA TRQPEVTVIA DDTFIGRFTP GEQWAEYDVA VPAEVRRSND LTLMLVASDV FTSTSIYNDP RPKGIRIDAI RVRSVGDGPF VPALAPVLWL TVGGGVWFLA LAGGTRKPTF AFVAATLLVS GAALALAAAR IWTVVLLPWL AGAGIALLIW QHRVALVRHP LRLARRFTRG SALGYGLLAA TGAWLVAMII QYAPPMPRLE QFWEFFPDSL IYTLITLGSL ALILVYGRNG VPRLAQATVR MAGSRRGATL TLTLLLAVWV GYLGAVILAM PYVGHADYAD NAVVARNLLA GRGWTVDYVT QFYRLYDGVT RPQETWPLLQ PVWVASSFAL FGVSNWAAKI PNLLFLTVLA LAVYQAGARL WDRRVGLVAV VLLITSHLYF KLVIYVTNDL GFALFAFGAL LLLYRAWGAP DAMALASRLR FGSISQQLAL TILSGALTGL MLLQKPGSGG LIALGMGWWF LRVRFDTWPR TLADLRTRLA PVAAWGVVAF VLLAPYLARN MLTFGVPFYS TEGKDAWVLE YTTWDQIYAV YTTEEGFSTL GVPDRSWILR WGFDRTLVKM ERQVRALRDY LLPSWQHAPF GLSEWFGRAD KDRLLFDPGA WLALFGAMAV IASHRRLMTL LGAAFVPYTL FLIVYWHTNE ERYWVVVMPW LALFASATLW CIYDRIAALS DGRWTPFGLL AVVALIAAII APSWSPIAEK VRDEPNLYRA DLDAYDWLKR NTPSDAVVMT RNPWQLNWHS ERPALMIPYT TDRETFLRLA RRYNVRYLMI DTLQRPEPEV RRLLDAMIAD PALGFREVYR TPVYRADFRG VTKEMIAVVY AFPQ
|
| |