Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1687 |
Symbol | |
ID | 5539163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2172628 |
End bp | 2174382 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640893824 |
Product | PA14 domain-containing protein |
Protein accession | YP_001431797 |
Protein GI | 156741668 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000000321103 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCACGAC GATTTCAACC TGGACAGGCG CTCGTCGAGT TCGTGATCGC CTCGACGCTC ATCCTGCTGC TGCTGGCGGC TGCGGTCGAT ATCGGTCTGC TCTTCTTCAA TATGCAGGGG CTGACCACCG CAGCGCAGGA AGGCGCCACC TACGGCAGCC GCTACCTGGT CGTTCAGCCG AACGGGACGG TTGACCTCGA CTACACTATG ATCCGCGCGC GCGTGCGCCA GGAAGCCGGC ACAACCGGCG GCATCAACTT CGTCAACATG TACGACCTGA ACAGCGATGC CATTCCCGAC GCCGAAGATA CCAACGGCGA CGGCGTGCTC GATCATTTTC AGTACTTCCT CGATCAGAAC GGTGATGGGC GCGCCATTGG CGACCCGGTC ACCGGACTGA TCCCCGAAGG AACGCAGCCT GCCGGGTATG TCCGGTTGAT CGACCAGTAC ATCCTGGTGC AGGCGATTGA AGACGTCTTC CCCTTCAACG GCAATCCGCT GGGACCGGAG GACGCCAACG GCAATCGCGC CGACGACCTG GTTGCCGATG TCGATACCAC GCCATGCGCA AATCTGGCGG ATCCGAACCG GCAGTGTTAT GTCTTCGTGA TTGTGAAGTC CGACTACAAC ACGGTCTTTG GCTTCACGCC CGCATTCGGC GACAAAGTGC CGCTCAGTTC GCGCTTTGTG ATGCCGCTGC GCGCTGGCTT TGTCTCGCCG GGCGCGCCGA CGAACACGCC GGTGGTGCAG ACCAATACGC CAACACCAAC GCCAACCGAT ACGCCCACTC CAACTGCAAC GAATACGCCG ACGCGCACGC CAACCAATAC GGCGTCGCCG ACGACAACCA ATACACCAAC CAACACGCCG ACGCGCACCA ATACGCCGAC GCGCACCAAT ACGCCGACGA ACACCAATAC GCCGACGAAC ACCAATACGC CGACGCGCAC GCCAACGTTT ACGCCCACAC CGACGCCATG TGCTGGCGGC ACGGGCAACG GTCTGCGCGG TGACTACTTC ATCTACACGC CGGGCGGCAG CGTTGGCACG ACCAACTTCT TCCCCGGCGC GCCGGTCGCC AGTCGCCTCG AGAATATCAA TATGGCGCGG AGCGATACCT CGCCCATCGC CGGCGTCGGC AACGACTACT TCTCGGTGCG CTGGACGGGG CAGGTCGAGC CGTTGTTCAG CGGCGAGTAC ACCTTCTACG CTAACACCGA CGATGGCGTG CGCGTGTGGG TGAATGGCGT GCAGATCATC AATGACTGGC GCACCAAGAA TAGCGAAACC AACGGGAGGA TCACTCTGAC CGCCTGCCAG CGCGTGAACA TTACGGTCGA GTACTTCGAG TGGACCGGTA GCCAGAACGC AATCCTCTCA TGGCAGCACG CGAACGTGCC GAAGCAGGTC ATCCCCATCC AACGGCTCTA TGCGAGCGGC TCACCGCCGG CAACCGCGAC GCGCACGCTG ACGCCGACGC GCACCGATAC GCCGACGCCG TCGCGCACGC CGACGCGCAC GCTGACACCG ACGATTACCA ATACGCCGAC GCCGTCGCGC ACGCCGACGC GCACGCTGAC ACCGACGATT ACCAATACGC CGACAAATAC GCCGCCGGCG ACGAACACGC CAACCCGCAC GCCGACGCTC ACACCGTCGC ACACGCCGAC GCTCACACCG TCGCGCACGC CGACGAATAC GCCAACCCGC ACGCCGACGC TCACACCGTC GCGCACGCCG GACACCGGAA CGTAA
|
Protein sequence | MARRFQPGQA LVEFVIASTL ILLLLAAAVD IGLLFFNMQG LTTAAQEGAT YGSRYLVVQP NGTVDLDYTM IRARVRQEAG TTGGINFVNM YDLNSDAIPD AEDTNGDGVL DHFQYFLDQN GDGRAIGDPV TGLIPEGTQP AGYVRLIDQY ILVQAIEDVF PFNGNPLGPE DANGNRADDL VADVDTTPCA NLADPNRQCY VFVIVKSDYN TVFGFTPAFG DKVPLSSRFV MPLRAGFVSP GAPTNTPVVQ TNTPTPTPTD TPTPTATNTP TRTPTNTASP TTTNTPTNTP TRTNTPTRTN TPTNTNTPTN TNTPTRTPTF TPTPTPCAGG TGNGLRGDYF IYTPGGSVGT TNFFPGAPVA SRLENINMAR SDTSPIAGVG NDYFSVRWTG QVEPLFSGEY TFYANTDDGV RVWVNGVQII NDWRTKNSET NGRITLTACQ RVNITVEYFE WTGSQNAILS WQHANVPKQV IPIQRLYASG SPPATATRTL TPTRTDTPTP SRTPTRTLTP TITNTPTPSR TPTRTLTPTI TNTPTNTPPA TNTPTRTPTL TPSHTPTLTP SRTPTNTPTR TPTLTPSRTP DTGT
|
| |