Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2521 |
Symbol | |
ID | 5540003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3252091 |
End bp | 3254931 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894652 |
Product | FHA domain-containing protein |
Protein accession | YP_001432619 |
Protein GI | 156742490 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1716] FOG: FHA domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.295094 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCAC GACTGACCGG TATGCGCGGT CCGCTGACCG GACGCACTTT TGACATCGGC GATCAACCGC TCACCATCGG GCGCGCTGCT GACAATCATA TCGCCATCGC CAGTCCGCGC GCCTCGCGCC ATCACGCGCA GATTCGGCGC GAGGGCGCGT CGTTTGTCCT CTACGACCTC GGCAGCGCCA ACGGCACGCT CGTCAACGGG CAGCGCGTGC AGCGCGCCGT GTTGCAGCCC GGTGACCTGA TCGACATTGG TGATGAGGTC TTTCGATTCG AGGCTTCGTA TCAACAGGAT GCAACCGTGT TGAGCGCACC TCCACCACAT GCGTACCCGT CGCAACCTGC CGCGCCCGCT TACCCGCCAC CGTCTCCGGC GGCCCCGCCG CACCCGGCTG CACCGATACA CCCGCCTCAA TACCAGTCTC CGGCAGCTCC GCCGCCACAA CCGGCCTACC CGCCGCAACC GGTCTACCCG TCCCAGCCGC CTCCCATTTC GCCCCAGATG CCGCCTGCTG CTCAACCACC GATGTATGCG CCGCCGCGTT CCGGGAAAGG CGGTCGCTCC TGCCTGCTGG CAGTGCTGCT GCTGCTGGCG CTGGTCTGTG TGGCCGGCGC TGCGGGCGCT TTCATCTTCC GCGACCGCCT GCGCGACGTT CCCGGTATTG GTAATCTGCC GGGCATCGGC GGCGGCCCAA CCATGACGCG CCCGCCCATC CAGACCGGCA GCGCTGCGGT CGGCAGCGGA CAGGCTGCCG CCATTGCTAT GCCCGGCGGC GGACCGATGA TTGAAGTGCC ACCGGGCGCT GTGCCGACCA ACCCTGATGG CTCCCCCGGC ATGTTGACCT TCTCAGTCGC GCCCGCCCCC GACCAACCGG TCAACCTGTC GCCCGATATG GCGCTCAGTG GATCGCTCTA CCAATTCGAG CCGGAAGACG TGACCTTCGC CGTACCGGTG CGCATCACGT TGCCGATCCC CGCCGGAACC GACCCGGCGC GGGTGATGGG ACTGATCACG CGCGATCCGC AGAGCGGCGC ATGGGCGCCG GTCGCAGGAG TAGTCGACTT TGCCGCGCGC ACGGTCAGCG CCGATGTGAC CCACTTCTCG CCCTACGGTG TGTACAGCTA CACTGGCAGC GATCTTGACG CCTGGTATCG CGCCAACGGT GGCTGGTTCG TGATCGAAAA TCAATTGCTG AGCGGCGATA AGCCTTATCC AGGGTGCCGT AACCTGCCGC GCGCGCTGTA CGTCAACGTC TGCATTCAAC AGGCGAATCC AGGTGATCCG GGTCTCTCCT ACCTGTTGCC CGCCGACAAT CTGCTGGCGC GGGGACCACG CAATGACTTC GGCGCGCCCT ATCGCCCGCT GAAAACGTGG CTTCCTGCCG GCACGTACCG GGTTGTCCAC TATGTATTCA TGAGCGAGAT CAACACCGAT CCCATGTATG TGCCGTGCTT CGGCTGGTGG GTCAAACCGC CACAGATAAT CAACCTGAAG GCAGGGCAGA CGGTTACTTT CGGGCCTTTC AGTGAGCATG ACGGGACCTC CATGACCTCG TTCGATGTGA AGACCTGCAC CGGCATGCCT GCGTCAGGGA CGCCCGTGGT GGGGCAACCG ACGTCGCAAC CGCCGCCGCA ACCGCCTGTG GAAACGTCCG GCGTCTGCCC GGCGAAGATG AATGGCGAGT GGGATGCCAA CCTGACACTG CGTGAAACGA ACGATCCAGA TTTGCAGGAC GAGATTGGCA ACGTGGACAC GGGTATTTTT GTCTTTCAGA TCAACGGAAA CGATGCGCAG GTCCAGATTG TCGAACCAGA CGGTGCGCGC AGTGACCCTG CCAGCGGATC GTGCAGTGTG CAAAACGGGC GCTTTGTCAT CACAATATCG GAAACAAACA GCAACGCAGG GCTTGTGTTC AAACTCCAGT TCAACGGAGA TGATCGGATG ACCGGCGAAG TCACCGTTTC AGAAGGAGAG AAATACGCCA CCGGTGACAT CGATATGATC CGCCGAACCG GATCAGGGGA GTCATCTCCC GGCGATCAGC CTGTGGTCTG CACCCAGGAA GTCGAGATCA TCAACAACAA GTATCTCGGC GCGTGCGGCG TTAGTGATAC ACAGACGTTC GAGCTGGCCC AGCGTGCATT TGTCGCTCGC ATTCGAGTCT GGCACAATCC AGAGATCACC GAGACTGATA CGCCGTATGT GACGATTACC GGTCCCGATG GCTATAACTT CTCAGGAAAC ACGGCAAAAG GCGGTTGTTA CGCTGGTTGG TGCGAGGCGA TGGTCTCGCT CAATCAATAC CTGGACCCAG GAACCTACCA GTTGTCCATT CCGACTGCTT CGATCTGCGC AGATCCAAGC GGGAAGACCA CCCTGATCCT CTATGGATGC TTTATGCCCG GGCAATCCTC GGATTTTACC CCCGGTTGTG CGGCGATGAC TGGCGTCTGG AACACGACGA TGATGCTGCG TTCGTCGACC AACTCCAACG TTCCGACAGG AGGAACACGA CAGGGGGTGT TGACCCTGCG CGTCGAGGGC GATTCGGCTG AGGTGCAATG GACAGAGGGC GGCAGAAGCA GCCCGGTAGT TGGTGGAACG TGCGCAGTCC AGGGTGATCG ATATCAGATC ACTCTGAATC AGCAGGTGGA GGCGTTGCTC CCGCCCGGAG CGCCCCTGCC GCCGCCAAAT GCGCAGGCGC CGCTCGAAGA AGTCTTATTC CCCGTTACCT TTGATCTTCA ATTTGAAGGC GGTGATCGCC TGACAGGCAT CGTCACGTCA CGGGCTGATA GTTACGAATA TAAGAGTGAT GTTGTGATGT CGCGCCGATA A
|
Protein sequence | MTSRLTGMRG PLTGRTFDIG DQPLTIGRAA DNHIAIASPR ASRHHAQIRR EGASFVLYDL GSANGTLVNG QRVQRAVLQP GDLIDIGDEV FRFEASYQQD ATVLSAPPPH AYPSQPAAPA YPPPSPAAPP HPAAPIHPPQ YQSPAAPPPQ PAYPPQPVYP SQPPPISPQM PPAAQPPMYA PPRSGKGGRS CLLAVLLLLA LVCVAGAAGA FIFRDRLRDV PGIGNLPGIG GGPTMTRPPI QTGSAAVGSG QAAAIAMPGG GPMIEVPPGA VPTNPDGSPG MLTFSVAPAP DQPVNLSPDM ALSGSLYQFE PEDVTFAVPV RITLPIPAGT DPARVMGLIT RDPQSGAWAP VAGVVDFAAR TVSADVTHFS PYGVYSYTGS DLDAWYRANG GWFVIENQLL SGDKPYPGCR NLPRALYVNV CIQQANPGDP GLSYLLPADN LLARGPRNDF GAPYRPLKTW LPAGTYRVVH YVFMSEINTD PMYVPCFGWW VKPPQIINLK AGQTVTFGPF SEHDGTSMTS FDVKTCTGMP ASGTPVVGQP TSQPPPQPPV ETSGVCPAKM NGEWDANLTL RETNDPDLQD EIGNVDTGIF VFQINGNDAQ VQIVEPDGAR SDPASGSCSV QNGRFVITIS ETNSNAGLVF KLQFNGDDRM TGEVTVSEGE KYATGDIDMI RRTGSGESSP GDQPVVCTQE VEIINNKYLG ACGVSDTQTF ELAQRAFVAR IRVWHNPEIT ETDTPYVTIT GPDGYNFSGN TAKGGCYAGW CEAMVSLNQY LDPGTYQLSI PTASICADPS GKTTLILYGC FMPGQSSDFT PGCAAMTGVW NTTMMLRSST NSNVPTGGTR QGVLTLRVEG DSAEVQWTEG GRSSPVVGGT CAVQGDRYQI TLNQQVEALL PPGAPLPPPN AQAPLEEVLF PVTFDLQFEG GDRLTGIVTS RADSYEYKSD VVMSRR
|
| |