Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3712 |
Symbol | |
ID | 5541214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4867409 |
End bp | 4869403 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640895823 |
Product | hypothetical protein |
Protein accession | YP_001433770 |
Protein GI | 156743641 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0157248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000528393 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGGACG ACACATCACC ACATCCATCG AAATTGCAAC CGACAGGGAT CACGGCGGTG CTCAGCGCTA TTCTGGCATC CAATCCGGCT TCTGCGGTTG TGCCGGTTGT TCCTGCCGAT ACGCTCGATG GTATTGTCGA ACGGGTGCGC GCTGCACGCG CCGGCAATGT GCAGTTGCTG GTGCCGGAAG GGATTCCCGC GCTCCGGGGA CGGCGAAGTT TTACGGCGCT GCGGTTGATT TTGCAGCGCG ACGGTATCAG TATCACGGTC ATCAGTCCCG ACGCGGATGT GCTCGAAGCG GCGCGCGCCA GCGGTCTGGA GACGATGGAA GTGGGCGCTC CATCACCGCC GCGCCCGGTT GCCACAACAC GCGCCGCGCC GCCCCCCGCT CGTCCGGTGA TCGATGAACG CGACGCTGAG TTTCTGCGCG TGTTGAATCA GGTTCCATCA CAGGATCAGT ATGCTGAATT GTCGAAGGCG GATGCGGACT TTGCGGCATC GCTCGACGAT CTCGCCGAGG CGGTTGCGAC GGTTTCACCC GCTGCGGCGG AAAAGACGAC GCCCCTGCCA GGCGCTGCCG CTTCTCAGCG GGTCAACGCC GCCGACATCC GCCTCAGCCC GGAAGAAGAA CGTCGCGCCG CCGTCCACGA AACGGGACGC CGCAGTGAAG CGACGCCACG GCGCACTGCC ATGCCACGCC GCGCCGCCGC GCGCACCGCG CAACGTCCGG CAACTCCGGC GCGCACCGGC GACCGCACTA TGATTATTGG CGTCGCTGTG GCCGTCCTGA TGGTCGCCCT GCTGATTGCG TTCGGCTGGT ACCAGGCGAA TCGGGTGTCA ATTCGTGTCG GACCGCCGGT GATGCAGAGT CGCAGTCAGC CCTTCCGCGA TGAGATTATT CCAATTACAA CTGCCGATCC CGGCGCCAAT CCATCGTCGA TCCAGGCGGC AGTCGTGCGG GCGGACGCCA CCTTCACGGT GCAAGGACAG GTGACCGGCG AAACGCTGGC GCCTGTCGGT CGCGCAACGG GTCAGGTGCG TATCGTTAAC GTGATTGAAC AGCCCTTTCC GATTCCCGAA GGCGCAGAAC TGCTCGGACT GAACCCGAAT GGCGCCGAGG TGCGCTTTGC TATCGAAGGA CCGGTCACCG TGCCTCCGGC TGTCACCACG GTGAGCGATC GTGGTCGCAG CACAACATTT GGTGAGGTCG TCGTTAATGT TGTTGCGCGC TCTCCCGGCA GCGCATCGAA TGTTGGGGCG AATGCGCTGA CCCAACTGCT CATTCCTGGC ACGCAGCCAA TCATCAGTGA TCGCGGCAAT CTCCTCATTC GCCACGATGC GATTGGCGGC GGCGGTGAAG AGATGCAGCG GATCGTCACC GAAGCCGAAG TGCAACGGGT GCTTGGCGAG GCGCTGACCG GTCTGTACAA TACCGGAATG CAGCAACTTG CCCGTCAGAT CGACCAGAAT GTGCTGGCAA TCGACCCGAC GACGATCTTC CCCAGTTCCA TCGATCTGGC GCAACCGGAA GCGTATGATC CGCCGCTCAT CGAACCACCC ATCGGTCAAC CGGTGGACCC TGCCAATCCG GTCTTTCGCC TGACGGTGAG CACTCGCTTC AGCGCGCTGG CAACCCCCCG CGAACGTCTG GTGAGCCGGC AACTCGAACT GGTCGTGCCA CAGCATTTCT TGCAACGCAC TGCGCTCTGC AACCCCAATG AGCGTGTCGG GTTCGATGTC GCCGGATGGC GCTGGGATGG CTCGAAACTG ACGATCAACG GCGCAGTGAC ATGCACTGAG TATGGGGTCA TCACGACGGA TACGCTCGAT CAGATCAGGC GAGCGCTGGT TGGCGCTTCA CGCAGCGATG CGGAAACCAT CCTCCAGCAG TTTGCGCAAC AGGGGTTGAT CAGCGATTAT AGCCTCCCTG CGGTCCAGAC GCTGCCAGGG TTCGATTTCC TGATCGATGT GCGTCCCACC AGCACGACAT CGTAA
|
Protein sequence | MSDDTSPHPS KLQPTGITAV LSAILASNPA SAVVPVVPAD TLDGIVERVR AARAGNVQLL VPEGIPALRG RRSFTALRLI LQRDGISITV ISPDADVLEA ARASGLETME VGAPSPPRPV ATTRAAPPPA RPVIDERDAE FLRVLNQVPS QDQYAELSKA DADFAASLDD LAEAVATVSP AAAEKTTPLP GAAASQRVNA ADIRLSPEEE RRAAVHETGR RSEATPRRTA MPRRAAARTA QRPATPARTG DRTMIIGVAV AVLMVALLIA FGWYQANRVS IRVGPPVMQS RSQPFRDEII PITTADPGAN PSSIQAAVVR ADATFTVQGQ VTGETLAPVG RATGQVRIVN VIEQPFPIPE GAELLGLNPN GAEVRFAIEG PVTVPPAVTT VSDRGRSTTF GEVVVNVVAR SPGSASNVGA NALTQLLIPG TQPIISDRGN LLIRHDAIGG GGEEMQRIVT EAEVQRVLGE ALTGLYNTGM QQLARQIDQN VLAIDPTTIF PSSIDLAQPE AYDPPLIEPP IGQPVDPANP VFRLTVSTRF SALATPRERL VSRQLELVVP QHFLQRTALC NPNERVGFDV AGWRWDGSKL TINGAVTCTE YGVITTDTLD QIRRALVGAS RSDAETILQQ FAQQGLISDY SLPAVQTLPG FDFLIDVRPT STTS
|
| |