Gene Rcas_4287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4287 
Symbol 
ID5541798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5528871 
End bp5530622 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content64% 
IMG OID640896394 
Productfibronectin-binding A domain-containing protein 
Protein accessionYP_001434332 
Protein GI156744203 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0863996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTTCG ATGCCCTGAC TCTTGCCGCT GTCGTCGATG AATTGCGCGC AACCATTGTT 
GGCGGTCGCG TCCAGCGTGT GTTGCTGCCC GGTGCGCTGA GCGTCGCGCT TGAAATCTAT
ACCGGTCGCC GCTGCTACCT GCTTCTCTCC GCCCATCCGC AGTTTGCGCG CGTCCAGTTG
AGCCCGGTGC GCATCTCACG CGGCACGGAT GCGACGCCGC CATTGTTGCT GTTGTTGCGC
AAGTATGTCA ATCGCGGGCG GATCACTGCC GTTGAACAGC CCGATCTGGA GCGGGTGTTG
CTGCTAAGTA TAGCCAAACG TCCCTTCTCG CGCAACTCCG ATCACCCAGC GGATCTCGAC
GATGATGACA CGCCGCCGGA GGACACCGCG CCGGAGGAGG AAACGCTGCG TTGTGAGTTG
ATTGTCGAAA TCATGGAACA GCGCAGTAAT ATTGTGCTGG TTGGCGACGA CAATATCATT
CTGGCGGCAG CGCGGCATGT GACGCCGCGC ATGAGCCGCC GACCTGTGCT GCCGCGTGAA
CCATACGAAC TCCCGCCGCC GCAGATCAAA TACGACCCAC GGCAGGCGAC GGCGGCTGAA
CTGCGCGCCG CCATCCCGGA TGGGCAACCT GATCTGGCGC GTGCTCTGGT GAGCGCCTAT
CGCGGTTTGT CGCCGCTGGC GGCGCGCGAA GCGGTGTATC GGGTGATGGG ACGCACCCTT
GTGCCTACCG GTCCCGATCT GCCATGGGAC GCGCTCGCCG ACGCGCTGTG CGCGTTGTGG
CACGCTTCCT GGTCGCCGCA TCTGGTGGTC GATGAGCGTG GCCCGATCGC GTTTGCGCCA
TATGAAATCA CACATCTGGC AGGGGCGCGC CCCTACGCCT CGATGAGCGC CGCACTCGAT
GCATACTACG CGGCGCGTGA GCGCCTGACG GCGCACCAGC AACGCCGCGA CGCGCTGCGC
GAGCAGTTGC ATGATACGCG CGAGCGCCTG GAACGGCAGC GATCCGCTTT GCACGCCGAG
TTGCAGCGTG CAGCCGACTT CGAGCGATTG CGCTGGGAGG GAGAAATGAT CTTCGCCTTC
CTCCATACGC TGACGCCGGG GCAGGAGCAT CTTGACGTCG AAGGACGCAC CATTACGCTC
GACCCCCATA AGTCGCCGGT CGAGTCCGCT CAGGAGCGAT TTCGCGCCTA TGACAAAGCC
AAAAGCGCGC TTACGGGAGT CCCCGAACGA TTGCGCGCCG TCGAGTTGCG TCTGGCAGGT
CTTGACGAAA CGCTGGCGCT GCTGGACGTA GCGGAGCGGT TCGAGGAGAT TGAGGCGATT
GCGCGTGAAG CGGAAGCAGA AGGGTATCTT GGACCCGCAT CCACAGAACG CACCCGGAAG
CGCCAGGCGC GTCCTATGCC TCCCCTGCGT CTTGAGTCGA GTGACGGATT CACGATCTAT
ATCGGGCGAA CCGCGCAGCA GAACGAGCAG GTCACCTTCC GTCTCGGCGC GCCCGATGAT
CTCTGGCTGC ACGCGCGCGG CGCGCCTGGC GCCCACGTCA TTATCAAAAG CGGCGGACGG
GAGGTTCCAG AACGCACAAT TGAGGAAGCA GCGGCGCTGG CGGCATACTA CAGCGCTCTG
CGTTCATCCT CGAGCGTTGA TGTCGAGATT GCGCGACGCC GCCACGTGCG CAAGGTGCGT
GGCGGACCGG CAGGTCTGGT GACGTACCGG GCGGAGCGGA GCGTGCGGGT GGCGCCGCGA
CCGCCATGGT AG
 
Protein sequence
MYFDALTLAA VVDELRATIV GGRVQRVLLP GALSVALEIY TGRRCYLLLS AHPQFARVQL 
SPVRISRGTD ATPPLLLLLR KYVNRGRITA VEQPDLERVL LLSIAKRPFS RNSDHPADLD
DDDTPPEDTA PEEETLRCEL IVEIMEQRSN IVLVGDDNII LAAARHVTPR MSRRPVLPRE
PYELPPPQIK YDPRQATAAE LRAAIPDGQP DLARALVSAY RGLSPLAARE AVYRVMGRTL
VPTGPDLPWD ALADALCALW HASWSPHLVV DERGPIAFAP YEITHLAGAR PYASMSAALD
AYYAARERLT AHQQRRDALR EQLHDTRERL ERQRSALHAE LQRAADFERL RWEGEMIFAF
LHTLTPGQEH LDVEGRTITL DPHKSPVESA QERFRAYDKA KSALTGVPER LRAVELRLAG
LDETLALLDV AERFEEIEAI AREAEAEGYL GPASTERTRK RQARPMPPLR LESSDGFTIY
IGRTAQQNEQ VTFRLGAPDD LWLHARGAPG AHVIIKSGGR EVPERTIEEA AALAAYYSAL
RSSSSVDVEI ARRRHVRKVR GGPAGLVTYR AERSVRVAPR PPW