Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1697 |
Symbol | |
ID | 5539175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2190868 |
End bp | 2192532 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640893836 |
Product | peptidase M9A collagenase domain-containing protein |
Protein accession | YP_001431807 |
Protein GI | 156741678 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.367258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00906719 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCCGCA CTACGCCTGC AATCCTGCTC TTGGTCCTCG TCCTCACCAC TACCGCGCCG TCGCCGCTAC AACCGCCGCC GACGCCCGCC TGTCCGCGCA CCTACCACAT CGGACCGCAC TCGCTCATCG TGATGCTCTA TGCGTCGGGA TACGACTGCA CTGAAGAGAT TGCGCACGCG CTTGCCGCTA TTGCCGATGA TCAGATTCTC GAACGTCTGA TCGCCCTGAC CGCTCCCGAC TTTCACAGTC TGACCCGCCG CAACGCCCTG CGCGTCATCG GTCGCATGGC GGAGCGTCCG CCGCGCGAAC CTGCGCACCG CGTTGTGGCG CGCGCCGCGC CCGCGCTCCG CATTCATCTG TTGACCCTGC TGCACACCGA TCCCCACGAT GATGTGCGGG CCGACGCAAT CTGGATTCTC GACACCTTTT TCTTCCCGGC GTATGACGCA CAACCCGCTT TTACCACCAT TGCGCTCACG CCTGGCAGCG GGGCAAATCT GCGCACCCGC GCTGCATACG CTGCCGCGCG GTTGATTGCC ACCCGCGTCG GACCGCTCCA CGACGACGAT CTGGCGTTTC TTCTGGCAGG ATTGCAATCC GACGAGCCAG GCGTGCGCGC GCGCGCCGCC GACGCCATTG CACGCCTGCG CGACGATCAA CTCGACGTGG CGGCGCGCGC GCGCGTCACC GCAGCACTGG AAGACGCCTG GACATCCGCC GCGCGCCTGG CGCCAGCCGA TGCGCTCCCC AGCGCGCCTC CCGCCATCCA TGCCGGGCGC TTCATCTCCG GCATTCCTGA AACCTCGCCG GGACCATTCG CTGCGCGCGC GGCATTGGCG CGCGCCCTCG ACCGCTTCGG CGGCGAGCGA TTCGCCGTGC TGCGCGCAGA GTTCGAGACC ATTCATCTCT CTTCGTGCCT CGATCAGCGC ACCGTGCGAA TCTGCACCGG ATCATCCACA GCCAACCTTT CCGCCATCAC CGCGCAACTG GAACGCCTGC GCACCTTTTT CTTCGACCTG ACCGGCATCA CCGATCCTGT GCCTGGCGAT CCGCACGAGC AACTCACGAT TAAGATATTT GCCAACCGCA GCGTCTTTCG AGAGTACATG CTGGCATTCG TCGGCTTCGG CGCCGATGTT GACGGCATAT ACGTCGAGCG CGACGGCGTT CTCTTCACCT ACGAGCGTAA CGCTGCCGAA AGCGTCAACA CCACCGACGA GACTATCGCC CATGAGTTCG GTCACTACCT GAACGGACGA TACCTCTTCC CCGGCGTCTG GCACGATCCT GGCTACCATG CCGAGCCGAA AGGATGGTTC GACGAAGGCA TCGCCGAATA CCTCGCCGCC CTCAGCGACT CTGGCGCCGG ATCGCGCGCC GCACTCGACC GCCTCTGCAC TCATTCCGTT CCACCCGATC TGGCGCGCCT GCTGGGGCAG CGCGATGGAT ATGATCAGTA TGGCACATTC GCGTATGATG AGGCGTGGGC GTTCGTGACA TTTTTGCATC AGGAGCGACC ATCCGCCATC CGCGCGATTG CGGCGGCTTT TCGCGCCAAC ACCTACCGCC AGAGCGACGT GCCGGATCTC GTCGGCGCAC CGTCGCTCGC CACAATCGAG ATGGAATGGC ACGCCGCACT GGTACGCTGG TGCCGTCAGC GATGA
|
Protein sequence | MPRTTPAILL LVLVLTTTAP SPLQPPPTPA CPRTYHIGPH SLIVMLYASG YDCTEEIAHA LAAIADDQIL ERLIALTAPD FHSLTRRNAL RVIGRMAERP PREPAHRVVA RAAPALRIHL LTLLHTDPHD DVRADAIWIL DTFFFPAYDA QPAFTTIALT PGSGANLRTR AAYAAARLIA TRVGPLHDDD LAFLLAGLQS DEPGVRARAA DAIARLRDDQ LDVAARARVT AALEDAWTSA ARLAPADALP SAPPAIHAGR FISGIPETSP GPFAARAALA RALDRFGGER FAVLRAEFET IHLSSCLDQR TVRICTGSST ANLSAITAQL ERLRTFFFDL TGITDPVPGD PHEQLTIKIF ANRSVFREYM LAFVGFGADV DGIYVERDGV LFTYERNAAE SVNTTDETIA HEFGHYLNGR YLFPGVWHDP GYHAEPKGWF DEGIAEYLAA LSDSGAGSRA ALDRLCTHSV PPDLARLLGQ RDGYDQYGTF AYDEAWAFVT FLHQERPSAI RAIAAAFRAN TYRQSDVPDL VGAPSLATIE MEWHAALVRW CRQR
|
| |