Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3721 |
Symbol | |
ID | 5541223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4878561 |
End bp | 4880177 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640895832 |
Product | single-stranded nucleic acid binding R3H domain-containing protein |
Protein accession | YP_001433779 |
Protein GI | 156743650 |
COG category | [S] Function unknown |
COG ID | [COG3854] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0059695 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAATAC AACGGGATAT TACCAGCAAT ATCGAATTGC TTCTCGAGAC GCTTCCGCCG CCTATCCGTC AGGCAATTGA GGCGGGTCCC GACGATCAGG AGGACCTGCT GGAAGTGATT ATGGATCTTG GGCGCCTTCC GGAAGCGCGC TACCGCAACC ACGAACGTTT TCTGAGCAAC CACGAAGTGA CCCAGGAAGA TATCGACTAC GTGATTGCCC GCATCGGCGC CTTCGGTGAG GACAACCGCG CCGGGATTCC GCGCACACTG CACCGCATTT CGGCAATTCG CAATCGCACG GGGCGCGTGA TCGGGTTGAC ATGCCGCGTG GGGCGTGCGG TCTTTGGCAC AATTACCATT CTGCGCGATC TGATCGAGTC TGGGAAGAGC GTGCTCCTTC TGGGTCGTCC GGGCGTCGGC AAAACTACCA TGCTGCGTGA AACGGCGCGG GTGCTGGCAG AAGAGCTGCG CAAGCGCGTG GTGATTGTCG ATACGTCGAA TGAGATTGCG GGCGATGGTG ATATTCCGCA TCCCGGCATC GGTCGGGCGC GTCGTATGCA GGTGCCGCGC CCGTCGGAAC AGCATGCGGT GATGATCGAG GCGGTCGAAA ATCATATGCC GGAAGTGATC GTCATCGATG AAATCGGCAC AGAACTCGAG GCGTTGGCAG CGCGTACCAT TGCGGAACGT GGCGTGCAGT TGATCGGCAC GGCGCATGGT CAGACGCTCG AAAACCTGCT GTCGAACCCA ACCCTTTCGG ACCTGATCGG CGGTATTCAG GCAGTGACGC TTGGCGATGA GGAGGCGCGC CGTCGCGGGA CGCAGAAGAC GGTGCTCGAA CGCAAGGCGC CGCCAACGTT CGATATCCTG GTCGAAATTC AGAATTGGGA TCAGGTGACG GTCTATCCCG ATGTTGCAAG CGCCGTCGAT AACCTCTTGC GCGCCGAGTC GCCGCGCGCC GAGGTGCGCC GCCGCGCTGC CGATGGTCAG ATTGAGGTCG TTTCGGTCTC CGCCGTCGAG CAGCATATTC AGACGATGCC GGGCATGCGG CGTGGCGGCG GGCGTGAGCG CGGGGAGCGG GGCGCGCGCT CGTCGGCGAT GCTCGCACCG ACGCAGGCTG GCATGGCGCC GCAGCGCATC TATCCGTTTG GCGTCAGTCG CGATCGCCTG GAGCGGGCGA TTGCCACACT GCATGTGCCG GCGACGATTG TGCGCGATAT GGGCGAAGCG ACGATGGTTA TGACGTTGAA AAATTACTAT CGCCAGGGGG CGCAGCGCGT GCGTCAGGCG GAAGAGCGCG GTGTGCCGGT TTATGTGTTG CGCAACAATA CCCTGGCGCA GATGGAGCGT CAACTGGCTG ATGTGTTCAA TATCAGTCTG AACGACAATG GCGCCCAACG CTCTGCTGAG CGCGACGACG ATGAGGCGAT GACGGAAGCC TTGCTGGAGG TTGAAACTGC GATTACGCAG GTGCTGAACG GTGAGCGTTC GTCGGTCGAA TTACAGCCGC AGAGCAGCTA CGTTCGCCGT TTGCAGCATC AGATGGCGGA ACGGTACAAT CTCCAGTCGG AAAGTCGCGG GCGCGAGCCG AATCGTCGGG TGAAAATCTT TCGTTAG
|
Protein sequence | MAIQRDITSN IELLLETLPP PIRQAIEAGP DDQEDLLEVI MDLGRLPEAR YRNHERFLSN HEVTQEDIDY VIARIGAFGE DNRAGIPRTL HRISAIRNRT GRVIGLTCRV GRAVFGTITI LRDLIESGKS VLLLGRPGVG KTTMLRETAR VLAEELRKRV VIVDTSNEIA GDGDIPHPGI GRARRMQVPR PSEQHAVMIE AVENHMPEVI VIDEIGTELE ALAARTIAER GVQLIGTAHG QTLENLLSNP TLSDLIGGIQ AVTLGDEEAR RRGTQKTVLE RKAPPTFDIL VEIQNWDQVT VYPDVASAVD NLLRAESPRA EVRRRAADGQ IEVVSVSAVE QHIQTMPGMR RGGGRERGER GARSSAMLAP TQAGMAPQRI YPFGVSRDRL ERAIATLHVP ATIVRDMGEA TMVMTLKNYY RQGAQRVRQA EERGVPVYVL RNNTLAQMER QLADVFNISL NDNGAQRSAE RDDDEAMTEA LLEVETAITQ VLNGERSSVE LQPQSSYVRR LQHQMAERYN LQSESRGREP NRRVKIFR
|
| |