Gene Rcas_3721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3721 
Symbol 
ID5541223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4878561 
End bp4880177 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content61% 
IMG OID640895832 
Productsingle-stranded nucleic acid binding R3H domain-containing protein 
Protein accessionYP_001433779 
Protein GI156743650 
COG category[S] Function unknown 
COG ID[COG3854] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0059695 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAATAC AACGGGATAT TACCAGCAAT ATCGAATTGC TTCTCGAGAC GCTTCCGCCG 
CCTATCCGTC AGGCAATTGA GGCGGGTCCC GACGATCAGG AGGACCTGCT GGAAGTGATT
ATGGATCTTG GGCGCCTTCC GGAAGCGCGC TACCGCAACC ACGAACGTTT TCTGAGCAAC
CACGAAGTGA CCCAGGAAGA TATCGACTAC GTGATTGCCC GCATCGGCGC CTTCGGTGAG
GACAACCGCG CCGGGATTCC GCGCACACTG CACCGCATTT CGGCAATTCG CAATCGCACG
GGGCGCGTGA TCGGGTTGAC ATGCCGCGTG GGGCGTGCGG TCTTTGGCAC AATTACCATT
CTGCGCGATC TGATCGAGTC TGGGAAGAGC GTGCTCCTTC TGGGTCGTCC GGGCGTCGGC
AAAACTACCA TGCTGCGTGA AACGGCGCGG GTGCTGGCAG AAGAGCTGCG CAAGCGCGTG
GTGATTGTCG ATACGTCGAA TGAGATTGCG GGCGATGGTG ATATTCCGCA TCCCGGCATC
GGTCGGGCGC GTCGTATGCA GGTGCCGCGC CCGTCGGAAC AGCATGCGGT GATGATCGAG
GCGGTCGAAA ATCATATGCC GGAAGTGATC GTCATCGATG AAATCGGCAC AGAACTCGAG
GCGTTGGCAG CGCGTACCAT TGCGGAACGT GGCGTGCAGT TGATCGGCAC GGCGCATGGT
CAGACGCTCG AAAACCTGCT GTCGAACCCA ACCCTTTCGG ACCTGATCGG CGGTATTCAG
GCAGTGACGC TTGGCGATGA GGAGGCGCGC CGTCGCGGGA CGCAGAAGAC GGTGCTCGAA
CGCAAGGCGC CGCCAACGTT CGATATCCTG GTCGAAATTC AGAATTGGGA TCAGGTGACG
GTCTATCCCG ATGTTGCAAG CGCCGTCGAT AACCTCTTGC GCGCCGAGTC GCCGCGCGCC
GAGGTGCGCC GCCGCGCTGC CGATGGTCAG ATTGAGGTCG TTTCGGTCTC CGCCGTCGAG
CAGCATATTC AGACGATGCC GGGCATGCGG CGTGGCGGCG GGCGTGAGCG CGGGGAGCGG
GGCGCGCGCT CGTCGGCGAT GCTCGCACCG ACGCAGGCTG GCATGGCGCC GCAGCGCATC
TATCCGTTTG GCGTCAGTCG CGATCGCCTG GAGCGGGCGA TTGCCACACT GCATGTGCCG
GCGACGATTG TGCGCGATAT GGGCGAAGCG ACGATGGTTA TGACGTTGAA AAATTACTAT
CGCCAGGGGG CGCAGCGCGT GCGTCAGGCG GAAGAGCGCG GTGTGCCGGT TTATGTGTTG
CGCAACAATA CCCTGGCGCA GATGGAGCGT CAACTGGCTG ATGTGTTCAA TATCAGTCTG
AACGACAATG GCGCCCAACG CTCTGCTGAG CGCGACGACG ATGAGGCGAT GACGGAAGCC
TTGCTGGAGG TTGAAACTGC GATTACGCAG GTGCTGAACG GTGAGCGTTC GTCGGTCGAA
TTACAGCCGC AGAGCAGCTA CGTTCGCCGT TTGCAGCATC AGATGGCGGA ACGGTACAAT
CTCCAGTCGG AAAGTCGCGG GCGCGAGCCG AATCGTCGGG TGAAAATCTT TCGTTAG
 
Protein sequence
MAIQRDITSN IELLLETLPP PIRQAIEAGP DDQEDLLEVI MDLGRLPEAR YRNHERFLSN 
HEVTQEDIDY VIARIGAFGE DNRAGIPRTL HRISAIRNRT GRVIGLTCRV GRAVFGTITI
LRDLIESGKS VLLLGRPGVG KTTMLRETAR VLAEELRKRV VIVDTSNEIA GDGDIPHPGI
GRARRMQVPR PSEQHAVMIE AVENHMPEVI VIDEIGTELE ALAARTIAER GVQLIGTAHG
QTLENLLSNP TLSDLIGGIQ AVTLGDEEAR RRGTQKTVLE RKAPPTFDIL VEIQNWDQVT
VYPDVASAVD NLLRAESPRA EVRRRAADGQ IEVVSVSAVE QHIQTMPGMR RGGGRERGER
GARSSAMLAP TQAGMAPQRI YPFGVSRDRL ERAIATLHVP ATIVRDMGEA TMVMTLKNYY
RQGAQRVRQA EERGVPVYVL RNNTLAQMER QLADVFNISL NDNGAQRSAE RDDDEAMTEA
LLEVETAITQ VLNGERSSVE LQPQSSYVRR LQHQMAERYN LQSESRGREP NRRVKIFR