Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1161 |
Symbol | |
ID | 5538627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 1505758 |
End bp | 1508658 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640893293 |
Product | hypothetical protein |
Protein accession | YP_001431276 |
Protein GI | 156741147 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGTA TCGCCGAGCG CTGCGCGTCG TTCATTCCGC GTCCATCGAC GGTCACAGCG CTCGAACAGT CTGTGCGCGA CTGTGGTGGA GGGGTCGTGG CGCTGGATTG TCACGATGGC AGCGGCGCAA CGGCCCTGAT CTGCTGGCTT GCCGCCACAC GTCGTTGGGC ATTCTGGCTG CCCGAAGATG ATGCTGGCGG CGGTCTGGCG GCGCTCTGCG CCCAAATCCT GGCGCTCGCC GATCTGCCGA TCCCGCTGGT TCCGCCTATC GCCTCACGCG ACGCGCTGAC CCTCGAACGT CTGCTGGCAG AGGCGGCAGA GAAGCGACCG ACCGACGATC CGCTGGTCGT CGTCGTTGGG CGGATGCCGG GCGACGCCGA CGTGCCAGTA TCGCCTCCGC TGCCGGTCGA TCTTCCTCCT GGCGTTGTGG TGGTGCTACC GGCAACACCG GACACGCAAC AACCGGCGAT TGGCGCCTAT TTCGTTGATC GCGTGGCGCT GGAAGCGGAT GAAGCGTGGC TGGCAGACGT TGCTTTCCAC CTGAGTGGGA ATCGTCGTCT GGCAACGCTC CTGGCGTCAC GCAGCCAGGG ATCGCCGCTC TACCTTCGGC TGGTTATCGG GTTGCTGGCA GGACGGACGC TCGATATGAA ACGCCTTCCG CGCGGACTGA ACGATCTGCA CCAACGCTGG TGGGAAGGGT TGGATGGGCA GGGGCAGGCG CTGGCATGCG TCCTCACCGC TTCACGCGAT CCGTTGCCGG TATCGTTGGC GGCAACACTG ACCGGATGCG ACGAAGGGAC CATCCAACGG TTGGTGCGGC GTTGGGGGTC GCTCATTGAG CGTACCGATG CGGGCATTCG GCTCTATCAC CGCGCAACAC GCGCATTTAT CACAGCGACG TCGGGTGACA GACTGGCATC TGCTCATGCG CGTTTTGTCG AACTGACTCT CACCCGGTCG AACGGCGCGC CAGAGCGTCC GGGCTGGCAG GACGAGGCGG CGTTGAGCCA TGAATTTGCT CGCCATAGCG CGCTTGGCGG CGCCACAATG CAGGCGGCGA TCAGTCGTGT TGTCCAGCGC GCCTGGGTGC GAGCGCAGGA GCGGCGCACC GGCACATTGA GCGATGCCGC CGTCGATGGA TTGTGGGCGC TGCGCGCGGC AGTAGAGAAC GACTCGGCGC TCCAGATGGT GCGAGCCGCT GCACTGACCG GCGGGCTGGC CTTTCTGGCG CGCAGCCTGC CGCCCCGCGC GCCGGCCGAA GTGTTCGCCG CTGCCGTTGA TCACGGGCAA CCGCGCGAAG CAACGATGAA GCGCATCCGG GCAATGATCG ACCAGTTGCC GGATGGACGT GACAAAGCAC TGGCATTACG CAACCTTGGC GAAGTATGTT ATGCGCTGCG TATGCGCGCG CCGGCGATGC GGATGCTTTC GGAGGCGCTT GATCTCGAAG CGCCGGGTCC CACCCGCCAA TGGCGCGATC TGCACGAAGA AACCCTCGTC GCGCTGGCGC GCGCGGCAAT CGGCATCGAT GCACCGGACA CGGCATTGGG CATTACCACG CGCATCGTGC ATGCCGAGCG GCGTGGCATG ATCGAAACCG AAGTCGTGCG CTGGCTGATT GCGCATAGGC GGCTGAGTCG CGCCGAAGAG GTGGCGTATG CCATCGCGCA TCCAGGAAAT CACGATTGGG CAATGGCAGA AGTAGCGATA GCGCACGCAC GCGCCGGTGA TTACGAGCGC GCCAGTCTGG TGCTCGACAC CCTGCGCACC GCCACAGCGG TGGCATGGGT CACTGCCGAA TTGGCGTGCG ACGCTGCACG GCGCGGCAAC CCGCGCGCCG CGGATCGGGT GATGCTCCTG CCGAACCCGG CGTTGCGCGA CCGCGCGCTG GCGCAGGTAG CGCGCGCTTT GATCGTTCAG GTGGCGCCGG AGACGGCACT CGAAGTTGCA CGCCTGATCG ATGACCACGA AACCCGCGCC CGTTGCCTGA TCGACATGGC GCTCGCCCAT CCGCCGGTGG CGGCGCAGGC GCTCGATGAA GCTGCCACCG CAGCAATCGC CGTCGAAGGG GAGGAACGAG CGTCGGTGAT TGCGGCACTG GCGGCAGCCG AAGCCGCCAG CGGGCAGTTC GAGGTCGGCA TGCGCATGGC GGCGCTGCTG CCCGAAGGCG AAGAGCGAGA TCGCGCCCAT TCGCGCACTG CCATCGCTCT GGCGCGTCGT GGCGACTATA CAACGGCTGA GACCGTGGCG CTGACCATTG CAGATGAGGA CGAACGCGGT TGGGCGCTCG ACGAACTGGC GCGCATCCTG GCAGCGAACG GGCGCCACCG CGAGGCATTC GCGTTGGCGG CGCAAATGAG CGACGATGCA GCACGGGCGC GGCTCGAAGC CGATCTGGCG ATCGCCTGGG CGCGATCCGG CGCCGCAGTG GCAGCTCATG CGCGCGCCGA GCAGATCGCT GTCCCCACTG AGCGCGCGCG CGCGCAGGCA GCGATTGCCC AACCGCTGGT CGAATCCGGC GCGCGCGCGC GCGCCTTCGT CAGTATCGCT GATGTGTTGC CTCCCGATGT GCGTAGTCGC TACCTGCTTG CTGCGGCGCA GGCGCTTGCC GCGCACGGAT TGCCCGACGA TGCCGAAGAA GTGGCGCGAC TCATCCCGCG CCCGCTGGAG CGCGCGCGCG CGCTCGTCGC CACGGCGCAC GCCATCATTA AACAGCAGGA TGTCCAACGC GCGCATCGTC TCCTGGGTCA GGCATTTCTG ACCGTCGCTC CGCTTGGGCG CACGGAAACC CTTCAGTGCA TCGGATGGGC AGCCGATGCG CTGGCGCTGA TCGGCGGCGC CGAACTGTTG CTGACTGTTG CCGCTGCCCT TGATGACATC GATTCCTGGC TCGTTGGGTA G
|
Protein sequence | MQRIAERCAS FIPRPSTVTA LEQSVRDCGG GVVALDCHDG SGATALICWL AATRRWAFWL PEDDAGGGLA ALCAQILALA DLPIPLVPPI ASRDALTLER LLAEAAEKRP TDDPLVVVVG RMPGDADVPV SPPLPVDLPP GVVVVLPATP DTQQPAIGAY FVDRVALEAD EAWLADVAFH LSGNRRLATL LASRSQGSPL YLRLVIGLLA GRTLDMKRLP RGLNDLHQRW WEGLDGQGQA LACVLTASRD PLPVSLAATL TGCDEGTIQR LVRRWGSLIE RTDAGIRLYH RATRAFITAT SGDRLASAHA RFVELTLTRS NGAPERPGWQ DEAALSHEFA RHSALGGATM QAAISRVVQR AWVRAQERRT GTLSDAAVDG LWALRAAVEN DSALQMVRAA ALTGGLAFLA RSLPPRAPAE VFAAAVDHGQ PREATMKRIR AMIDQLPDGR DKALALRNLG EVCYALRMRA PAMRMLSEAL DLEAPGPTRQ WRDLHEETLV ALARAAIGID APDTALGITT RIVHAERRGM IETEVVRWLI AHRRLSRAEE VAYAIAHPGN HDWAMAEVAI AHARAGDYER ASLVLDTLRT ATAVAWVTAE LACDAARRGN PRAADRVMLL PNPALRDRAL AQVARALIVQ VAPETALEVA RLIDDHETRA RCLIDMALAH PPVAAQALDE AATAAIAVEG EERASVIAAL AAAEAASGQF EVGMRMAALL PEGEERDRAH SRTAIALARR GDYTTAETVA LTIADEDERG WALDELARIL AANGRHREAF ALAAQMSDDA ARARLEADLA IAWARSGAAV AAHARAEQIA VPTERARAQA AIAQPLVESG ARARAFVSIA DVLPPDVRSR YLLAAAQALA AHGLPDDAEE VARLIPRPLE RARALVATAH AIIKQQDVQR AHRLLGQAFL TVAPLGRTET LQCIGWAADA LALIGGAELL LTVAAALDDI DSWLVG
|
| |