Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3069 |
Symbol | |
ID | 5540565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3973705 |
End bp | 3976053 |
Gene Length | 2349 bp |
Protein Length | 782 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640895188 |
Product | hypothetical protein |
Protein accession | YP_001433141 |
Protein GI | 156743012 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACACG ACAGCCATTC AATCTACACC GACGCCGATT TTATCAAGGA CGATCTGCGG CGCTGCGATC TGGTGATGAA AGGCGGCATC ACCAGCGGCG TAGTCTATCC ACCGGCGATC ATCGAGCTTG CAACGCGCTA CCGGTTTGTC AACATCGGCG GCGCCTCGGC GGGCGCGATT GCAGCGGCAG CGGCAGCGGC GGCAGAGTAT GGGCGCGCCG TCCCCTATGC CGGGCATCGC AGCGGTTTTC AACGGCTCGA CCGATTGCGC GCCTGGCTTG GCGAGGGTGA AGGCAATCTG GCGGGGCTAT TCCAGCCATT CTCCCGCATG AAACCTCTGT TTCACGCGCT GTTCGACCTG GTGATCGCTT CCAGAGCAGC GCCCCCAAAA CGCCCATCTG CATCCGGGAG ATCGCCGTCT GTGTTTCGTC TTCCCGTTGC CGCGATCCGG TTTGTCTGGC GAGCGCTTTC CTTCTTCGTC CGCATTACTC GATTGCTGGC GCGCCATCAT CCATCGATAA CACTGGGGAT CGGCGTTGCT ATCGCACTGA TCGGTGCGGC GATCTGGATT CTCCCACCCT TCCTGAGCGA CTCGCCGGTG CAGCCGTTGG TCGTGATCGT GGGGGCAGTA CTGGCGATTC TTTCTGGTTT GATCGTCGGA ACACTGGCGG GGGCGACGCA CCTCGCGTGG ATTGCCGTTA CTGAACTGCC GCGCCATCTG TTTGGGTTGT GCAGCGGACA TACGGACGGC GCAACACCTG AAGGATGGCC GCCAACGCTG GGAGATGCCA GGGGAGGCGG CAAACCGCCC GCATTGACCG ATTGGCTTCA TGCGGTCATC AACGGACTGG CAGGTCGCGA CGCCGATCAA CCGCCGCTGA CCTTCGGCGA ACTGGCAAAC ACACCAGACG GACGGACGAT TTCACTACGC ATGATGACCA GCGACTTGAG CGAACACATG CCGTATGTGA TCCCTAAAGA CCTGGGGCGT TTTCTCTTCG ATCCGACCGA ATTCGCGCGC CTGTTCCCGA AGGTTGTTGT TGAACATATG CGCGCGCGAA GCATAGCCGC AGCGTACCAG GTTCTGGATG AGCGCGGCGC AGTGCGAATG CTGCTGCCCC TGCCCGCCTG GCGCGATCTG CCGGTCGTCG TTGGTGCGCG TATGAGCCTG AGTTTTCCGC TCCTGATTGC CGCCGTGCCG CTCTACACCA TCAGCGTTGC CGGGCAACGC GAAGCGAGCG CCGGAAGGAC GTTGCGCGTA GAACACCTTC AGCGCCATAT TTTCAGCGAT GGCGGGATTG CCAGCAACTT CCCGATCCAT TTTTTCGACC GCTGGCTGCC AACCCATCCG ACGTTTGGCA TCAATCTGGT GCAATTGCCC ACCGATGAAG CCGACAACGA ACGGTTTCTC CAGGCGCTGC TGACAAACAA TAGCGCAGAG TCGTCCGACG AAACGCTGAA ACCGGAGATT ATTGTCAAGC AGGCATACCT GGGACGCGCC GACTTTCGCG CCGCAGCGCC GGAACCACCG CCTGCCCGCG CCGCGACACC GTTCAGCGAT CCACAAGGAG AGGTCTATCT GCCGCCTGCC GGACCGGGTG AAGATTATGT CGAATGGCAG ACGATTGACA CTCTTCCGGC ATTCTTCCGG TCGATCTTCG GCACAGCGCA GAGTTACCGC GATACGGCAC AGGCGCGCCT GCCAGGGTAC AACGAGCGCG TGGTGCGGGT GCGCCTGCGA CCGGAGGAAG GCGGTCTCAA TCTACGCATG TCGCCACGGA TCATCGCCAA AATCGAGCGC AAAGGGCGAC TCGCCGGGCG CGCATTGCTG CCGCAGGAAC CCGAAACTGC CGGAAGAAGG CGCGGCGGCT TCCGTTTCGA CGATCATCGA TGGGTGCGCC TCGTGACGCT CCTTTCGGAG ATTGATCAGC AACTCCGCGA TATGCGCACC GCCTACGATG ATGTGCAGGC AGAGTACCCC ACGTTTTTGC GTCATGCGCT GACGAACGAC CATCTGCCGG TTTGCCCGCC CTACTACGCC GGCAGCGCCG AAGAGCGCGC ACTCCTTGCG CGGCGCATCG AGGCGCTGAT TGCGCTCTAC ACGCTGTGGA GCGATCTGGA CGAAGACACA ACAGCGCGCC TTAATACAAT CCTGCTGCGC ATGAACAAAC GGACGCTCCA GAACCTGGCG CAACTCCTCG AACAACCAGA CGATCAGGAA GCGTTGAACA GCCTGCACGA CTGGCTTGGC GCCCTGCACG CCGAGAAAGT GCGCGTCGCA AAAGTGGCAG AACGCGAGGC GTCTCCGGTT GAGGATATGA TGGACCTGCG GGTCATGCCG ACGGTGTGA
|
Protein sequence | MTHDSHSIYT DADFIKDDLR RCDLVMKGGI TSGVVYPPAI IELATRYRFV NIGGASAGAI AAAAAAAAEY GRAVPYAGHR SGFQRLDRLR AWLGEGEGNL AGLFQPFSRM KPLFHALFDL VIASRAAPPK RPSASGRSPS VFRLPVAAIR FVWRALSFFV RITRLLARHH PSITLGIGVA IALIGAAIWI LPPFLSDSPV QPLVVIVGAV LAILSGLIVG TLAGATHLAW IAVTELPRHL FGLCSGHTDG ATPEGWPPTL GDARGGGKPP ALTDWLHAVI NGLAGRDADQ PPLTFGELAN TPDGRTISLR MMTSDLSEHM PYVIPKDLGR FLFDPTEFAR LFPKVVVEHM RARSIAAAYQ VLDERGAVRM LLPLPAWRDL PVVVGARMSL SFPLLIAAVP LYTISVAGQR EASAGRTLRV EHLQRHIFSD GGIASNFPIH FFDRWLPTHP TFGINLVQLP TDEADNERFL QALLTNNSAE SSDETLKPEI IVKQAYLGRA DFRAAAPEPP PARAATPFSD PQGEVYLPPA GPGEDYVEWQ TIDTLPAFFR SIFGTAQSYR DTAQARLPGY NERVVRVRLR PEEGGLNLRM SPRIIAKIER KGRLAGRALL PQEPETAGRR RGGFRFDDHR WVRLVTLLSE IDQQLRDMRT AYDDVQAEYP TFLRHALTND HLPVCPPYYA GSAEERALLA RRIEALIALY TLWSDLDEDT TARLNTILLR MNKRTLQNLA QLLEQPDDQE ALNSLHDWLG ALHAEKVRVA KVAEREASPV EDMMDLRVMP TV
|
| |