Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3443 |
Symbol | |
ID | 5540942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4495036 |
End bp | 4497003 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640895561 |
Product | hypothetical protein |
Protein accession | YP_001433511 |
Protein GI | 156743382 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.047188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0829627 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGGA TGAAAACACA GAACAATACA GTCCGGCATC CAGCATTCTG GCTGGCAGTC ATATGGCTGC TGACACTGCC AGCGCTGTGG CACAGCCTTA ATGCCGCGAC CGAACTGCGC GTGGATGTTG GCGAATGGGG CGACCACACC ATACTGCACG GCGCACATGG CAGGGAAGCC AATCCATTCG AGAATTATCG TTGGACAGAA AAGCGCGCCA CGATCACATT CCCGAACCTC AGTCCTCGCT ATCGCATTCT GCAAATACGC ACCCATGGCT GGCGACCAGA AGGAATACCG TCCCCATGGG TGCAAATCAC GATAGCCGAA CAAAGGCAGG TCGCATTTCA GACTGAACGC GCATTGCGCA CATATACGAC GCTGCTGCCA GACGCCGCAT TGAGTCCAAC GATTGAGGCA TCGCTGACCA GCGAGACATA CACTCCCGCC GAAGACACGC GCGCCCTTGG GGTCGCAATC GACTGGATCG CGTTGTACGC TCTCGACACG CCAGGCAGCC TGGCGAACGG TCAATTCATC GGTCAGGCGT TATTGCTCGG ACTGACCCTG ACACTGATTG CACTGCTTGC ATTGCCAGGA GCGGTCACTG TGGCGTGCGG TCTGCTGGCG TCTACGACGC TGGTCGGATT GAACATTGTT GAACCATTAT GGGTAGGGTT GGGGTTGATC CCATGGCTAT TCATTGTTGC ATTGCTGACT GGCGCAACGT GGTTCATTGC ACCATGGCTG ATGCGCATGC TGGAGCGCAT GCCCAATACC GGGCTTCAGC AGCGTGTATC TTCAAAGACC TGGATCACCC GCATGCAGGC GCGGATCGCC TGGGCGCTGC TCGTCGCTGC GCTGATCGTG CGGCTGGCTG GCGCAGCGCA TCCGCTCTTT GACGCGCGCG ATGTCCATGT CCATACCCGT TGGCAGAAAA CCGTCGCCGG CGGGCAGTTG TTCTTCTATT CGACCCCAGC CGAATTTCAG AACCGACAGA CCTTCAATCC GCCTGCCGCC TATATCGTTC TGCTGCCGCT CTACCTGGCG CTCGGCGACG CGCGGCTGAC CGTACAGGCT GGCGTTGCGC TGCTTGATGG GCTGATCGCC CTGGCGCTGC TCCTGATCGC GCACGAATTC GGACTTTCGG CGCGAGCCGG GCTGTTCGCA ATGGCACTGT ATGTCGCACT CCCGATCTCA ATGACCATGA TGTGGTGGGG ATTCGCCGCG AATGCGATAG CACAGGTGTG GTGGGTGCTG CTCCTCTGGC TGCTGCTGCG CCTGACACGC GCACCGGACC GGTCACTCTT TGCGCTCGTT ACAGTGGTTG CGATCCTGTG TCTGACAACC CACATCGGCG CACTGGTGAC GCTCGCGGCA TTCCTGGGAC TGATCACATT GATTGGATGG TGCGTCCTGC CGAGCAATGG ATGGCGCGCC ATGGTGGCAG GGCTGTTGCT TGCAGGGGTG TTCGCTGCGC CCATGTACTT CATCCCTGCT GCTGCGCCAC TGGTGAACGC CCCTCGCAGC CCGACAACGC TCGATCCGAT CGCATCGTTT ATAGACAGTC TGGCGCTGTG GCCGGAGCGT GTCGATCTGG TGCAGCGCGC GCTGACCCTT GGGTTTCTGT CGCCAATCCT GGCGCTGGCG GCGGTTGGAT TGCCGCTGTT GTTCACAGCG CGCCGGCGGC ATCCCCTTCA GCGCACCCTG CTCCTGTCAA CGCTGATCGT CTGTGTCGTC TTCTTTCTCT CGTATGTCTT TCTCCAACTG CTGACGCGCT ATATCTACTT TGCGACGCCA CTTGTCTGCC TGGCGGCTGG CGCCACCCTT GCGCGCCTGG CAGTGCGCCC CGGCGGACGC TGGATGACGT ATAGCCTGAC GCTGCTGGTG GTCTGGAGCG GCGTTGCCCT CTGGTTTGGC GGTGTTTTGC TGCGGATCAA ACCGTCGCTG GTTCCATTGA CACAGTAG
|
Protein sequence | MSRMKTQNNT VRHPAFWLAV IWLLTLPALW HSLNAATELR VDVGEWGDHT ILHGAHGREA NPFENYRWTE KRATITFPNL SPRYRILQIR THGWRPEGIP SPWVQITIAE QRQVAFQTER ALRTYTTLLP DAALSPTIEA SLTSETYTPA EDTRALGVAI DWIALYALDT PGSLANGQFI GQALLLGLTL TLIALLALPG AVTVACGLLA STTLVGLNIV EPLWVGLGLI PWLFIVALLT GATWFIAPWL MRMLERMPNT GLQQRVSSKT WITRMQARIA WALLVAALIV RLAGAAHPLF DARDVHVHTR WQKTVAGGQL FFYSTPAEFQ NRQTFNPPAA YIVLLPLYLA LGDARLTVQA GVALLDGLIA LALLLIAHEF GLSARAGLFA MALYVALPIS MTMMWWGFAA NAIAQVWWVL LLWLLLRLTR APDRSLFALV TVVAILCLTT HIGALVTLAA FLGLITLIGW CVLPSNGWRA MVAGLLLAGV FAAPMYFIPA AAPLVNAPRS PTTLDPIASF IDSLALWPER VDLVQRALTL GFLSPILALA AVGLPLLFTA RRRHPLQRTL LLSTLIVCVV FFLSYVFLQL LTRYIYFATP LVCLAAGATL ARLAVRPGGR WMTYSLTLLV VWSGVALWFG GVLLRIKPSL VPLTQ
|
| |