Gene Rcas_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1697 
Symbol 
ID5539175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2190868 
End bp2192532 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content65% 
IMG OID640893836 
Productpeptidase M9A collagenase domain-containing protein 
Protein accessionYP_001431807 
Protein GI156741678 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.367258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00906719 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCCGCA CTACGCCTGC AATCCTGCTC TTGGTCCTCG TCCTCACCAC TACCGCGCCG 
TCGCCGCTAC AACCGCCGCC GACGCCCGCC TGTCCGCGCA CCTACCACAT CGGACCGCAC
TCGCTCATCG TGATGCTCTA TGCGTCGGGA TACGACTGCA CTGAAGAGAT TGCGCACGCG
CTTGCCGCTA TTGCCGATGA TCAGATTCTC GAACGTCTGA TCGCCCTGAC CGCTCCCGAC
TTTCACAGTC TGACCCGCCG CAACGCCCTG CGCGTCATCG GTCGCATGGC GGAGCGTCCG
CCGCGCGAAC CTGCGCACCG CGTTGTGGCG CGCGCCGCGC CCGCGCTCCG CATTCATCTG
TTGACCCTGC TGCACACCGA TCCCCACGAT GATGTGCGGG CCGACGCAAT CTGGATTCTC
GACACCTTTT TCTTCCCGGC GTATGACGCA CAACCCGCTT TTACCACCAT TGCGCTCACG
CCTGGCAGCG GGGCAAATCT GCGCACCCGC GCTGCATACG CTGCCGCGCG GTTGATTGCC
ACCCGCGTCG GACCGCTCCA CGACGACGAT CTGGCGTTTC TTCTGGCAGG ATTGCAATCC
GACGAGCCAG GCGTGCGCGC GCGCGCCGCC GACGCCATTG CACGCCTGCG CGACGATCAA
CTCGACGTGG CGGCGCGCGC GCGCGTCACC GCAGCACTGG AAGACGCCTG GACATCCGCC
GCGCGCCTGG CGCCAGCCGA TGCGCTCCCC AGCGCGCCTC CCGCCATCCA TGCCGGGCGC
TTCATCTCCG GCATTCCTGA AACCTCGCCG GGACCATTCG CTGCGCGCGC GGCATTGGCG
CGCGCCCTCG ACCGCTTCGG CGGCGAGCGA TTCGCCGTGC TGCGCGCAGA GTTCGAGACC
ATTCATCTCT CTTCGTGCCT CGATCAGCGC ACCGTGCGAA TCTGCACCGG ATCATCCACA
GCCAACCTTT CCGCCATCAC CGCGCAACTG GAACGCCTGC GCACCTTTTT CTTCGACCTG
ACCGGCATCA CCGATCCTGT GCCTGGCGAT CCGCACGAGC AACTCACGAT TAAGATATTT
GCCAACCGCA GCGTCTTTCG AGAGTACATG CTGGCATTCG TCGGCTTCGG CGCCGATGTT
GACGGCATAT ACGTCGAGCG CGACGGCGTT CTCTTCACCT ACGAGCGTAA CGCTGCCGAA
AGCGTCAACA CCACCGACGA GACTATCGCC CATGAGTTCG GTCACTACCT GAACGGACGA
TACCTCTTCC CCGGCGTCTG GCACGATCCT GGCTACCATG CCGAGCCGAA AGGATGGTTC
GACGAAGGCA TCGCCGAATA CCTCGCCGCC CTCAGCGACT CTGGCGCCGG ATCGCGCGCC
GCACTCGACC GCCTCTGCAC TCATTCCGTT CCACCCGATC TGGCGCGCCT GCTGGGGCAG
CGCGATGGAT ATGATCAGTA TGGCACATTC GCGTATGATG AGGCGTGGGC GTTCGTGACA
TTTTTGCATC AGGAGCGACC ATCCGCCATC CGCGCGATTG CGGCGGCTTT TCGCGCCAAC
ACCTACCGCC AGAGCGACGT GCCGGATCTC GTCGGCGCAC CGTCGCTCGC CACAATCGAG
ATGGAATGGC ACGCCGCACT GGTACGCTGG TGCCGTCAGC GATGA
 
Protein sequence
MPRTTPAILL LVLVLTTTAP SPLQPPPTPA CPRTYHIGPH SLIVMLYASG YDCTEEIAHA 
LAAIADDQIL ERLIALTAPD FHSLTRRNAL RVIGRMAERP PREPAHRVVA RAAPALRIHL
LTLLHTDPHD DVRADAIWIL DTFFFPAYDA QPAFTTIALT PGSGANLRTR AAYAAARLIA
TRVGPLHDDD LAFLLAGLQS DEPGVRARAA DAIARLRDDQ LDVAARARVT AALEDAWTSA
ARLAPADALP SAPPAIHAGR FISGIPETSP GPFAARAALA RALDRFGGER FAVLRAEFET
IHLSSCLDQR TVRICTGSST ANLSAITAQL ERLRTFFFDL TGITDPVPGD PHEQLTIKIF
ANRSVFREYM LAFVGFGADV DGIYVERDGV LFTYERNAAE SVNTTDETIA HEFGHYLNGR
YLFPGVWHDP GYHAEPKGWF DEGIAEYLAA LSDSGAGSRA ALDRLCTHSV PPDLARLLGQ
RDGYDQYGTF AYDEAWAFVT FLHQERPSAI RAIAAAFRAN TYRQSDVPDL VGAPSLATIE
MEWHAALVRW CRQR