Gene Rcas_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0044 
Symbol 
ID5537502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp57963 
End bp58976 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content64% 
IMG OID640892209 
Productintegrase family protein 
Protein accessionYP_001430200 
Protein GI156740071 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.381266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAAA TGACCATCGC CGGCGCCGCC GACGCCTGGG CCACTGCTCA ATACTCACGC 
CGCAAACCGC CGGCGCCCTC CACAATTGCC ACGTACTGTA ATGCATGGCG CTCGTTTGCA
GCCTGGGCGC ACTGCGAAGG GAAGCGTGTC GTTGCCGACA TCGCAGCCTC CGATCTGGGT
GCATGGATCG ATTCGCTCAA TGGCATGGCG GATGGCACGG TGCTCACGTA TGGTCACGGC
GCTCTGGCAA TCTGCAAGTT CCTTGCCGAT CGCGGTCACC TGCGCTGCGA TCTGGCGCTG
CTACGCCTGC ACCTGCGCGA TGCACTGCCG CGCGCCTATG CCGGCCGCGC GCCCGATGTC
CCCGATCTGC GCAGATTGGT GACGTTCTAC GACGGTGAAT TGCCCGCCGG CGAACCAGGG
AGCCGCGCAG AGCGCAACCG CCTGAATATG CTGCGCAACG CTGCACTGTT GCACACCCTC
TTTTCCACTG GCGCGCGAAT CTCCGAGGTG TTGAGCCTGA ATGTCGGTGA TGTGCGCGCC
GATAACGGCC GTATTGTTCC TCGCGCTTTC GTGATCGGCA AGGGACAGCG CCGTCGCGCT
GTTTTTCTCC GTGCGCACGC ACAACAGGCA ATTCTGCGCT ACCTCCACGC GCGGCATGCC
GCGTTTCCGC GCGCCGATGC CCTCTTCATC TCTCACGGTC CACGTGGCGC CGGCGGACGG
TTGGGGCGCA TCGCAGCGTG GGAGATCGTG ACCGGCGCCG CTCATCGCGT GGCAGACCAG
ATCGAATGCG AGGGACGCAT TCGAGAAGCG CGCGCGTTGC GGACCGTGAC CCCCCACACC
TTCCGCCACT TCGTTGCCAC CTGGCTCCTC AACGAAGGCG CTCAACTCTC CGAGGTTTCG
GCGATTTTGG GTCACGCCAA CACCCGCATT ACCGAGCAGT ACTACGCGCG TCACACGGAT
GAACAGCTGC AAGAACTGCA CGATCAATTC GCGCCCGATC CGGAATCGGA ATGA
 
Protein sequence
MSEMTIAGAA DAWATAQYSR RKPPAPSTIA TYCNAWRSFA AWAHCEGKRV VADIAASDLG 
AWIDSLNGMA DGTVLTYGHG ALAICKFLAD RGHLRCDLAL LRLHLRDALP RAYAGRAPDV
PDLRRLVTFY DGELPAGEPG SRAERNRLNM LRNAALLHTL FSTGARISEV LSLNVGDVRA
DNGRIVPRAF VIGKGQRRRA VFLRAHAQQA ILRYLHARHA AFPRADALFI SHGPRGAGGR
LGRIAAWEIV TGAAHRVADQ IECEGRIREA RALRTVTPHT FRHFVATWLL NEGAQLSEVS
AILGHANTRI TEQYYARHTD EQLQELHDQF APDPESE