Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3437 |
Symbol | |
ID | 5540936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4484968 |
End bp | 4486968 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640895555 |
Product | hypothetical protein |
Protein accession | YP_001433505 |
Protein GI | 156743376 |
COG category | [S] Function unknown |
COG ID | [COG4412] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.54517 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0867765 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGTT TCCCCGCCCG ACCGTTGCCG ACCCTGCCCG GCGCTCTTGC ACTGCTCGTG ATTGCACTCC TGTCTGCCTG CGCCGGCGCC GTGCCGTCAT TGCCCCCTTC CCATCTTCCT GTGATTGCGT CGCCGACGGT TGCGCCGTCT GTCACTCCGT CATCCGTGCC TGTCGCTTCT ATCCGCCCCA CCGCATCTAT CGTGCCCGTT ACCGCTCCCA TCGACGAACT GACGACGATT GCCGCTGCCG TCCCTGCTCC TCGTGATCAG CGCGCGATCA GTGCGGCGTT CCATGGCGGC GACATCCCCT ATGTGGCCCG GACGATGCCG CTCGACGTTC GGATTGGCGC AACCGAGACC TTCTGGGTGG CTGATGTCTC GAACAATGTG AACTATACTG TCACAGCACA ACTGCGTTAC GCCGGTCCGG TTGTGTTGAT GTATATTGAT ACGACGCTCG ATGTCCCGCA ACATCTGATC GAGCAGTCGG CGCAGGTCTT CGAGGAACGG ATCTACCCGC GTAATCGCTT GTTGTTCGGT GAGGAACGCA TCCCCGGCGT CGATGGCGAC GCGCGACTGA CGATTCTCAA TACCCGCATT CGCGGAGCAG GCGGGTATTT TTCGTCAGCC GATGGCGTGA CGCGCGCGGT CAATCGTTTC AGCAACGAGC GTGAGATGTT CGTCATCGAC GCAGTCGCCT TCCCTCCCGG CAGCGAGACC TACAACGCAA CGCTGGCGCA TGAGTTTCAG CATATGATCC ACTGGCACCG TCAGCCACGC AGCCCAACAT GGTTTAACGA AGGTCTCTCG ATGCTCGCCG AGGACCTGAA TGGATTGGGC GACAATGGCG CGGCATTGGC GTATCTCCGC AATCCCGACA CGCAACTGAC GACGTGGGCG CCGGGAAGCG GCGTTACGCG CCACTACGGT GCAGCGCAAC TCTTCATGCG CTATCTGTAT GAACAGTATG CTGGCGACAG TCGCCCCGCC GACTGGATCG ACGCTGATGC GGGCAACAAT GTGCATGTTC TGGCAAATCT CGCCGCTTAC CGTCGCCCCG ATATTGTCAC CTTCGCGGAT CTGTTTGCCG ATTGGGCAGT CGCCAATGCC TTGAATGATC CATATGTGGA CGATGGACGC TATGCGTATC GTGGCATTCC GACGCGCGCC GCAACGATGC GCCTTGAACC GGGAACAACC TCCGCTACAG TGCGTCAGTT TGGAGTGGAT TACATGGGTC CGCTCGACGG TCCGCTGGCA ATCGATTTCG ATGGCGCCGA TACGGTGCAG TTGGTTGGAG TGTTGCCGGC CGAAGGGCGC TTCGCCTGGT GGAGCAATCG CGGCGATGAA AGCGTCTCGA CACTGACGCG ACACCTCGAT CTGCGCAGTG TGTCGCGCGC AACGCTCACG TTTCGTCTCT GGCACGAACT TGAGCGCGAC TACGACTATG CCTTCGTCAC CGTCTCCAAC GACGGCGGTA CGCACTGGCA GACGCTCCCC GGCATCACCA CTCGTGCCGA CGATCCGCAG GGGCACAACA TGGGGTACGG ATTCACCGGC GTCAGCGGCG CGCCGGATGT CGCCCTCGGC GGCGTGCGCG GACGCTGGAT CGACGAGCGC ATCGACCTGA CGCCGTTCGT CGGTCAGGAC GTTCTGCTGC GCTTCTGGGT CATCTCCGAT GCGGCGATCA ACGGTCCTGG CATGCTGATC GATGATATTC GAGTTCCGGA GATTGGCTTC GCCGATGGCG CCGAAACCGA TGACGGCGGA TGGGACGCGA TAGGGTTCGT GCGCACATCC GGCATTCTTC CGCAACGCTG GGTCGTGCGG TTGCTGTTGT TCGACAGCGA TGAAACGCGG GTGATCATTC CAGAGATCGA TAATCAGGGG CGCATCAGCC TCCGGGTCGC TGCCGGGCAG CGCGCAATAC TGCTGGTTAG CGGCGCGACT CATTTCACGA CTGAACCGGC TTCGTACCGA GTCAATCTGT ATCAACCGTG A
|
Protein sequence | MNRFPARPLP TLPGALALLV IALLSACAGA VPSLPPSHLP VIASPTVAPS VTPSSVPVAS IRPTASIVPV TAPIDELTTI AAAVPAPRDQ RAISAAFHGG DIPYVARTMP LDVRIGATET FWVADVSNNV NYTVTAQLRY AGPVVLMYID TTLDVPQHLI EQSAQVFEER IYPRNRLLFG EERIPGVDGD ARLTILNTRI RGAGGYFSSA DGVTRAVNRF SNEREMFVID AVAFPPGSET YNATLAHEFQ HMIHWHRQPR SPTWFNEGLS MLAEDLNGLG DNGAALAYLR NPDTQLTTWA PGSGVTRHYG AAQLFMRYLY EQYAGDSRPA DWIDADAGNN VHVLANLAAY RRPDIVTFAD LFADWAVANA LNDPYVDDGR YAYRGIPTRA ATMRLEPGTT SATVRQFGVD YMGPLDGPLA IDFDGADTVQ LVGVLPAEGR FAWWSNRGDE SVSTLTRHLD LRSVSRATLT FRLWHELERD YDYAFVTVSN DGGTHWQTLP GITTRADDPQ GHNMGYGFTG VSGAPDVALG GVRGRWIDER IDLTPFVGQD VLLRFWVISD AAINGPGMLI DDIRVPEIGF ADGAETDDGG WDAIGFVRTS GILPQRWVVR LLLFDSDETR VIIPEIDNQG RISLRVAAGQ RAILLVSGAT HFTTEPASYR VNLYQP
|
| |