Gene Rcas_3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3651 
Symbol 
ID5541153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4780587 
End bp4781729 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content60% 
IMG OID640895771 
ProductIS605 family transposase OrfB 
Protein accessionYP_001433718 
Protein GI156743589 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.166379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.432603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGG CGTTCGTCTA CCGTCTCTAC CCATCGAAGC CCCAGGAACG GCGGCTGGAA 
TTCACCCGCG AGACCTGCCG GCGCTTCTAT AACGACTGCC TTGCAGAGCG CAAGACCGCT
TACGAGGAAC GCGGTGAGAC GATTCGCAAA TCTGCGCAAC TGCGGCGCGT GAAAGAGCGC
AAAGCGACCA ATCCCGACGC TAAAGACGTG CATAGCCACA TCCTGCAGAT CGTGGTAGAC
GACCTGGATA AGGCATTTCA AGCCTTCTTC CGTCGTGTGA AAACGGGCGA GAAACCAGGA
TACCCCAGGT TCAAGGGACG CAATCGCTGG AAGGGGTTTG GTCTGAAAGA GTACGGCAAC
GGCTTTAAGA TCGATGGGCG GCGGCTCCGT ATCACCGGCA TTGGTCGGAT CGCGGTTCGC
TGGCATCGTC CCCTTGAAGG GCGGATCAAG ACCCTCCGGA TTGTGAAACG GGCTGGCAAG
TGGTACGCCG CCTTTTCGTG CATCACGAAC GAAGTGCCGG CTCCGCCCGC TCAGCGCGAT
GTGGGGATTG ATGTCGGCGT CGCCTGCCTG CTCACGACAA GCGATGGGGA GAAGGTGGAC
AACCCGCGCT GGCATCGCGT CGAGCAGCGG AAATTGCGCG TGTTGCAGCG ACGGGTTGCC
CGTCGCAAGA AAGGCGGCGG CAACCGCAAA AAGGCCGTGC TTGCGCTGCA ACGCCAACAC
GAACGGATCG CCAATCGGCG GAAGGACTTC TTGAATAAGC TGGTACACAA CCTGATCACA
CGCTACGACC GGATTGCGCT GGAAGACCTG CGCATCACGA ACATGGTGCG CAACCCGCGC
CTGGCGAAGA GCATTTTGGA CGCTTCGTGG GGCTGCCTCA TACGACGCCT GATGAGCAAG
GCTGCAAGCG CTGGCCGTCT GGTGCTGCTG GTAGACCCAC GGAACACGTC GCAGGACTGC
TCTCAGTGTG GGCGGCTCGT TGCGAAGGAT TTGAGCGTGC GCGTGCATCA GTGTCCACAT
TGCGGCGTTG TGCTCGATAG AGATCACAAC GCGGCGATCA ATATCCTGAA AAGGGCTGGA
CAAGTCCTTT GGGGCATAAG CTCGTCACCA GGCGGGTTGC CCCAAGAAGC TGCCGGGTTT
TAA
 
Protein sequence
MRKAFVYRLY PSKPQERRLE FTRETCRRFY NDCLAERKTA YEERGETIRK SAQLRRVKER 
KATNPDAKDV HSHILQIVVD DLDKAFQAFF RRVKTGEKPG YPRFKGRNRW KGFGLKEYGN
GFKIDGRRLR ITGIGRIAVR WHRPLEGRIK TLRIVKRAGK WYAAFSCITN EVPAPPAQRD
VGIDVGVACL LTTSDGEKVD NPRWHRVEQR KLRVLQRRVA RRKKGGGNRK KAVLALQRQH
ERIANRRKDF LNKLVHNLIT RYDRIALEDL RITNMVRNPR LAKSILDASW GCLIRRLMSK
AASAGRLVLL VDPRNTSQDC SQCGRLVAKD LSVRVHQCPH CGVVLDRDHN AAINILKRAG
QVLWGISSSP GGLPQEAAGF