Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3820 |
Symbol | |
ID | 5541323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4991048 |
End bp | 4992982 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640895930 |
Product | hypothetical protein |
Protein accession | YP_001433876 |
Protein GI | 156743747 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02226] N-terminal double-transmembrane domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.095196 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTTC TGACGCCGCT GGCGCTTCTC AGTGCACTTG TTGTGGGTCC GCTGATCGTG GCGATGTATC TGTTGAAACT TCGCCGCGAG GAACTGCGCG TCTCTTCGAC TTTCCTCTGG CAGCGCATGG TGCGCGATGT GGAGGCAAAC GCGCCCTGGC AGCGCCTGCG GCGCAACTGG CTGCTCTTCC TGCAACTGCT GCTGCTGCTC CTGTTGGCAA TCGCGCTGGC GCGACCCTTC TTGCTTACCA CCGGCATCAG CGGGCGTAAC CTGATTATCA TCATCGATCG CTCGGCAAGT ATGGCGGCGA CGGACGTCCC CCCCTCGCGG CTCGAAGCGG CGCGCCGCCA GGCGCAGACG CTGGTCGATC AGTTGCCCGA AGGCGGGCGC GCGACGATTA TTGCCATCGG CGGGCAGATG GACGTGCTTG CCGCTTCAAC GACGGATCGC CGCCAGATGT ATGATGCCAT TCGCGCGACG ACGCTCAGCA TTGGTGGTCG TGGCGATTTG TCGCAAGCGC TGGCGCTTGC CACCGCTCTC GCGGCGCGTG AACCGGATAG CGAGGTTGCC ATCATTTCCG ACGGCAATGT CGAGACTCCA ACCGACATCC GTGTTCCGGC GACGGTGCGC TATTTTCCCA TCGGTCAACG CGCGGAGAAT GTCGCTATCA GCGCTATGGC GCTGCAACCG ACACCCGCCG GACAGACGCT GTTTGTTCAG GTCTCTGGCT ATGGCCCGGC GCCGGTTTCG CGGCGGCTTG ACCTCTACCT CGATGGCGCA CTGTTCAATG CATACGAACT CAACCTCGGA CCAGACGGCA CTCCAGACGC TGTCCAGACG GTGATCGTCG ATATTCCTGC TCAGGCGCGC GTTGCCGAGG CGCGACTCAG TCCGGCGCCC AATGACGATT TCTTGCCCTC CGATGATCGG GCATGGGCAG TAAGTTCGAC GGGCGCAGGC ATGGAGGTGC GTATTGTTGG TCCTGGCAAC CGCTTCCTCG AAACGGCGCT CTCGTTGTTG CCCGGCATCA CTGCCACCAA AACAACGACC ACGACGGTTT CTGGCGATAC TGCACCACAG GTGACAATCT TCGATCGGGT TGTGCCGGAA GCGCTGCCGA CCGGCAATCT GTTGTTCATT GCTCCGATGC GCTCGACCCC CCTCTTTTCT GTGACCGGCA TGGTTGAATT TCCGCTGCTG CGCCCGGCGC CGATCGTAAT CGAAGGGCAA GCGCCGCCAC TGCTGCGGAA TGTCAGTGTG AGCGAGGTGA ATGTGCTGCG CGCGATGCGC ATCGAGACAG GCGTGTGGGC GCGCGCGCTG GTCGAAGGAG ATGGCAGCCC AATGCTCCTG GCGGGGGAAC GCGAGGGGCG ACGCATTGTT ATCCTGGCAT TTGCGTTGCA AGACTCCGAT CTGCCGCTTC AGGTTGCCTT TCCGCTGTTG ATCTCGAATA TCATCGGGTA TCTCGCGCCG GGAAGCGGTC TGGAAGCATC GCAGATCGCT CCCGGGCAAC CGCTGGTCGT GGCAGTTGAT CCCGCTGCCA CAGCGGTGCG TGTCGTTCGT CCCGATGGGC GCGTCGATGC GGCACAGATT CAGGGTGGGC AGGCAATCTA TGCCGATACT GATGCGCTCG GACCGTACCT CATCGAGCAG GTGCGCGATA ATCAGGCAGT CGAGCAGCGG CGTTTCGCTA TCAATCTGTT TGCGCCGGAG GAGTCGCGCA TTGCACCGTC AGGTGAGTTA CGCGTGCCAC AAGTCAGTGG TTTGCAACAG GCGGTGACCC GCGAGCAGGT GGGACGACAG GAACTCTGGC GCTGGCTGGC GGCTGCGGCA ATCCTGATCG TTCTTATCGA ATGGCTGGTG TACCAGCGCA GCAGTCTGGC GTACCTGCGG CAGCGCGTCC GTCTTGCGCT CGCAGCGCGT CGCCATCCGG CGTAG
|
Protein sequence | MSFLTPLALL SALVVGPLIV AMYLLKLRRE ELRVSSTFLW QRMVRDVEAN APWQRLRRNW LLFLQLLLLL LLAIALARPF LLTTGISGRN LIIIIDRSAS MAATDVPPSR LEAARRQAQT LVDQLPEGGR ATIIAIGGQM DVLAASTTDR RQMYDAIRAT TLSIGGRGDL SQALALATAL AAREPDSEVA IISDGNVETP TDIRVPATVR YFPIGQRAEN VAISAMALQP TPAGQTLFVQ VSGYGPAPVS RRLDLYLDGA LFNAYELNLG PDGTPDAVQT VIVDIPAQAR VAEARLSPAP NDDFLPSDDR AWAVSSTGAG MEVRIVGPGN RFLETALSLL PGITATKTTT TTVSGDTAPQ VTIFDRVVPE ALPTGNLLFI APMRSTPLFS VTGMVEFPLL RPAPIVIEGQ APPLLRNVSV SEVNVLRAMR IETGVWARAL VEGDGSPMLL AGEREGRRIV ILAFALQDSD LPLQVAFPLL ISNIIGYLAP GSGLEASQIA PGQPLVVAVD PAATAVRVVR PDGRVDAAQI QGGQAIYADT DALGPYLIEQ VRDNQAVEQR RFAINLFAPE ESRIAPSGEL RVPQVSGLQQ AVTREQVGRQ ELWRWLAAAA ILIVLIEWLV YQRSSLAYLR QRVRLALAAR RHPA
|
| |