Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1689 |
Symbol | |
ID | 5539165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2175078 |
End bp | 2178272 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640893826 |
Product | hypothetical protein |
Protein accession | YP_001431799 |
Protein GI | 156741670 |
COG category | [R] General function prediction only |
COG ID | [COG1287] Uncharacterized membrane protein, required for N-linked glycosylation |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0104584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000000757543 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGATTG TCATTCTCAT ATTCATTGTT CTGACGCTCA TCTTGATCGG CGCGATCATC GCTCACGCGC TGCCTGTCGC ATGCCGCGCC GATGATCCGC TCGAGCGATA TTTTGAGTAT GCGCTGATCG GTGCGTTGCT CAACGGATGG CTGGCGTTCA CCCTGGCGCA GATCGGCGCC TTCTCCGCGC TGCTGCACGC CGCTATCATT GCCGTTCTTT GCCTTATCGC TCTGCGGATC GGTCGCCGCC ACGCGCCCAA CACTCCGACC ACGCCAGCAG ATGTGTGGAG GCGGTGCATT GCCTGGGTCA GGCAGTGCAC TGCACTGCCT CTACGGGAAC GTTCAACATC CCTCCTCCTT GTGGTCCTCC TTCTCGTTTT CGCTCTCCTC GTTTCTCGCC CCTTCGAGAC GATCATTGGC GTGCGCGATG CTGGCGTGTA TGCCAACGCC GGGTTCATTA TGGCGCGCAC CGGTTCACTT ACCTTCACCG ATTCAGTGGT GGCGCAGATC GCCGCCGATC AGCAGTCGTC CGATCCTGAG ATTGCCGATG CAGCGCGTCA GGCGGAAACC AACTTCCTAG GCGTGCAGAA TCCGCAACGG TTTATCGCCA CACGCCTGCG CGCTGCCGGC TTCTTCATCG ACCAGGGAGA CCTGGCGCGC GGGCGCGTCG TGCCGCAGGG GTTCCACCTC TTCCCGGCAT GGATTGGACT GATCGCTGCG TTCTTCGGAT TGCGCGCCGG ATTACTCGCG CCCGCCGTCA CCGGTCTGCT CGGCATCTGG AGCGTCGCCA TGCTCACCCG TCGCCTCGCC GGTCCGTGGG TTGCGCTCGT GGCGGCGCTC TTCCTGACGC TCAATGCCGT GCAGGTCTGG TTCAGCCGCT ATACGACCGC CGAAACCGCC GCGCAGTTCC TTGTCTTTGC CGGACTCTAC GCCTTTGCCG CCGCATTTGG ACATTCGATG GCAAACGTTC AACGCACCAT GCTCCTCGCC CTCCTCGCGG GGCTGGCATT CGGGCAACTC GCGCTGACGC GCATCGAATT CTTCCTGGTT CTCGGTCCGG TGGCGCTCTA CCTGCTGCAC GCCTGGCTCG CGCGTCGCTG GACGCTGCCG CATACCGCGC TGCTGGCGGG CATCGGAGCG ATGGTGCTGC ACGCCGGGTT GCACATCGCA TTCCTGGCGC GCGCCTATTT TTTCGATACG CTCTTCGCCC GCCTCCAGGA TTTTGCCCTT ACTGCGGCAT TCGCCCTGCC ATTCCTCACT CCAACGCTGC GTCAGGTTTA CCTGTTGCGC CCCTGCTCGC GCCTGACGAT GCAGCCCTGC CCGCCGATTG CCGGCATGCC GCCGACTGCC GATGCGCCGC TGAACTGGAC CCGTATCGGC ATCGAGGCGC TGGTCGTTGT CGTGGTTGTG GCCGCCCTGG TCGCAATCCG CCGCCTGAAC CTGATCGCCC GCGTCTCGCC GCTGCTGGTG CGCCTGAACC GTCCGTTGCG TCTGGTTGCG GCAATTGCCA TCATCGGGAT CGGTGGATAT GCGTACCTGA TCCGTCCGCA GATTCTGTCG CTGCCGGTCA TCACCGCGCT CCCCGCATGC CTTGCCCCTG AACAACTGAC GAACCCGCAG GGGGCATGCC TGACGCTCCA GGGGTATGTC GGCGCGCCAA TTGCGACGCC AGCCTATGTG GACCCGCTCG CCGCGTGGTT CGACCGCGCC ATTGGCGCTG TGCGCGGGCG CGCCGCTCCG CCGCTGGACG CCTGTATCGC GCTGCGGCGC TCCACGTTGC CGCCAACTGC CGATGGACGC ACCATACCGG AGGTTCTTCG AGACGGCTTG CTCGACGAAA CCGACGTTCC GCCTGAGATG CTGGCAACCC TCCGCGCTTG CGACCGCTAC GTGCTGCGCG ATCTGTTCGG CGCAGCGCAG GCGAACCTGG TGCGCCTGGG ATGGTATCTT TCGCCGCCGG GCATCGCGCT CGCGCTCATC GGACTGGCGC TCCTCGCCTA TCGCGCCAAC TCGACCTCCT GGTTGTTTCT GGTCATCGCC GCTGTTGCGT CGGTCGTGTT CCTGCGACTG ACCTACGGAA CCAGCGACCA GCACTACATC TATATTATGC GCCGCTACGT GCCGCACGTG TATCCCGCAT TCGCCATCGG CGCCGCCTAC GCCATCGCTC GCCTCTCGTT CAACGTTCAA CCCTCAACGT TCCACATTCC ACGTTCCACG TTTTCCCGTC TCATCCTCAC TCTCGTCCTC GTTCTGTTTC TGGTCGTTAC AGGCAGACCG ATCTACCGTC ACACTGAATA CGCTGGCGCG CTCGATCAGA TCGGCGCTAT GGCAGGGCAG TTCGATCCCG GCGCAATTGT CCTCATGCGC GGCGGTGCGC CTTCCTTCGC ACAGGCGCGC GACATCCCCG ACCTGCTTGC CACACCGCTC ACCTTCGCCT TCGGTATTGA CGCATTTGCG CTAAAAAGCC GCGACCCCGG ACGATACGCG CCGCAACTGG CGCGCTACAT CCGCCGCTGG CACGACCAGG GACGACCGGT GTATCTGGCA ATCGGCGCGA GTGGCGCAAT TGCGCTACCA GAGTGGCGAC TCGAACCGGC TGGACGTCTG CACGTCGACC TGCCAGAGTA CGAACAGCCG ACCGATCACA AGCCCTCCGC CGTTCAGCGC TTCACCCTCG ATTTTGCGCT CTACCGCCTG TTGCCCGACG AACCGACGCC AGCGGAACCG CCAGCGCTCA CCATCGCGCC CGACGACTAC GCCTATCAGG TGCGCGGCGT TTACCGCGCC GAACGCATCG GCGACCGCCT GATCGCCTGG ACCGATGGCG ACGCGATCTT CCGCCTGCCG GCGCCGATGA CTGAGCCGCT CACCATCAGC GTAACACTTG CTGCCGGCGC GCGCCCCGCA ACGCTGCCCG GCGAAACCTG CCTGTCGCTC GCCGCCGAAC CTGGCTCCTC GACCGATGAA GCGACGTTTA CCGCGCCGGT CTGCACCGTC CCCGGCGCCG AACCTCTCAC TATCACACTC AGCGCCGACC CGCGCAATCT GCCGCGCTCA CCGACCGGAC ATCTCCTGTT GCGGGTGCAA ACCCCACCCT TCATCCCCGC CCGCGACGAT CCCGCCAGCC ATGATCCGCG CCGACTTGGG GTTCAGATTG TCGCGTTAGC GGTGAGGAGC GCGCCCATTC GATAA
|
Protein sequence | MTIVILIFIV LTLILIGAII AHALPVACRA DDPLERYFEY ALIGALLNGW LAFTLAQIGA FSALLHAAII AVLCLIALRI GRRHAPNTPT TPADVWRRCI AWVRQCTALP LRERSTSLLL VVLLLVFALL VSRPFETIIG VRDAGVYANA GFIMARTGSL TFTDSVVAQI AADQQSSDPE IADAARQAET NFLGVQNPQR FIATRLRAAG FFIDQGDLAR GRVVPQGFHL FPAWIGLIAA FFGLRAGLLA PAVTGLLGIW SVAMLTRRLA GPWVALVAAL FLTLNAVQVW FSRYTTAETA AQFLVFAGLY AFAAAFGHSM ANVQRTMLLA LLAGLAFGQL ALTRIEFFLV LGPVALYLLH AWLARRWTLP HTALLAGIGA MVLHAGLHIA FLARAYFFDT LFARLQDFAL TAAFALPFLT PTLRQVYLLR PCSRLTMQPC PPIAGMPPTA DAPLNWTRIG IEALVVVVVV AALVAIRRLN LIARVSPLLV RLNRPLRLVA AIAIIGIGGY AYLIRPQILS LPVITALPAC LAPEQLTNPQ GACLTLQGYV GAPIATPAYV DPLAAWFDRA IGAVRGRAAP PLDACIALRR STLPPTADGR TIPEVLRDGL LDETDVPPEM LATLRACDRY VLRDLFGAAQ ANLVRLGWYL SPPGIALALI GLALLAYRAN STSWLFLVIA AVASVVFLRL TYGTSDQHYI YIMRRYVPHV YPAFAIGAAY AIARLSFNVQ PSTFHIPRST FSRLILTLVL VLFLVVTGRP IYRHTEYAGA LDQIGAMAGQ FDPGAIVLMR GGAPSFAQAR DIPDLLATPL TFAFGIDAFA LKSRDPGRYA PQLARYIRRW HDQGRPVYLA IGASGAIALP EWRLEPAGRL HVDLPEYEQP TDHKPSAVQR FTLDFALYRL LPDEPTPAEP PALTIAPDDY AYQVRGVYRA ERIGDRLIAW TDGDAIFRLP APMTEPLTIS VTLAAGARPA TLPGETCLSL AAEPGSSTDE ATFTAPVCTV PGAEPLTITL SADPRNLPRS PTGHLLLRVQ TPPFIPARDD PASHDPRRLG VQIVALAVRS APIR
|
| |