Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4420 |
Symbol | |
ID | 5541933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5680114 |
End bp | 5681841 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640896518 |
Product | hypothetical protein |
Protein accession | YP_001434454 |
Protein GI | 156744325 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.187576 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.765326 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCGCA AAAACACCCG TCACTGGCAA ATTCCGCTCA CCAGCCATGT GGATGAACTG CCCGAACCGC TGCGCCGCCG CCTGGAGAAC TCCTGGGCCG GCACCTTCTA CCGCGAATTC TTCTCCCGTC TGGACGAAGG CCCCTTTGCC GTGTTGTACA GCGACCTGAT TTCACGTCCC AACGTGCCGG TGAATGTGCT GGTCGGGCTG GAATTTCTCA AAGCCGCCAA CGGCTGGACG GATGAAGAGA TGTACGATCA CTTCTGTTAT GACGTCCAGG TGCGCTATGC CCTGGGCTAC CGCCAATTGA GCGAAGGCTG CTTCGACTTG CGCACGCTGT ATTATTTTCG AGAACGGCTG GCACAACATG CCCAGGAGAC GGGCGAAAAC CTGCTGGAAC GCGCCTTTGA GCAAGTGACG GGCGAACAGC TGCGCGCCTT CTCCATCAAG AGCGGTAAAC AACGCATGGA CAGCACCCTG CTGGCCTCCA ACATCCGCCA GATGGGGCGC ATCCAGCTGC TGGTGACAGT GCTGCAGCGC GTCTGGCGCA TGCTCAGCGA AGCAGACCAG CAGCGCTATG CCCAACGCTT CGAGCGATAC ACCAAAGGCC ACCCCGGGCA GTATATCTAT CGGCTGAAGA AGGAAGAGTG GCCGGAGCAT CTGCAGCGCA TCGGGGAAGA CATGCGCACC CTGTTGCAGG AACTGCAGGC CGCCTACGGA GAACAGCCGA CCTATGCGGT GCTGGCGCGG GTGTTTGCCG AGCATTTCCG CCTGGAGAAG GAGAAACTAC AGGTCAAAGA AGCCAGCGAA TTGAGCGCCC GCAGCCTGCA ATCGCCGGAC GACCTGGAAG CCACCTACCG CGAAAAGCAC GGCAAATCTT CACGCGGGTA TGTGGTCAAC CTCACCGAAA CCTGCGACCC GGACAACCCG CTGCAACTGG TGACCAAAAT CCAGGTCGCG CCCAATGTCA CCGATGACAG CGCCCTGCTG GCCGAAGCCT TGCCCGACCT GAAGGAACGC ACCGGGCTGG AAGAACTCTA CACCGATGGC GCCTACGGCA GCGCCGAGAA CGATAAGCGT TTGGCTGAAC AGGAGGTGAC GCTGATCCAG AGCGCCATCC GCGGGCGCAA GCGAAAGGAA GAGCGGCTGT ACCTGGATGA TTTTACGCTG CCAGGCGACA CTCACAACGG AGCGCTGAAC CTGACCTGTC CACACGCTCA GCAAGCGCCC GTGAGAGGCG CCAAACAGGG CAAATCGTAT CGCGCCACCT TTGACGCGCA GGTTTGTGGG AACTGTCCCT TGCAGCCCCG GTGTCCGGTG CAGCCTCGCA AAAATGGAGA AGCTGTACTG CTCTTCACGG AGGAAGACTT GCGCCGGGCG CAGCGGCGGC GCAGGATGCG TCAGGCGGAC TCGGGAGAAC GGAACCTGCG TTCTGCCACT GAAGCGAGCA TCCGCAGTCT CAAGCATCCC TTCCCGGCGG GCAAGTTACC GGTACGGGGA CGTTTTCGAG CCGCCTGTCT GCTGATTGGT TCCGCCGCCG TGATGACCGT GCGGCGGATA CACCGTTACC TGCAGAGCCA GATAGCAGGA AATCGGCCAG GAGAGCAGGC AAAAAGGATG ACAAAACGCC TGGCAGAACA GGCGGAACAT GTTTTTTTTT TTGGCCGGAC GCTTTTGCAG GCCTTTGGAC TTTACCGCCG AATCAACAGC CCGGTTTTGA CCTGGTAA
|
Protein sequence | MFRKNTRHWQ IPLTSHVDEL PEPLRRRLEN SWAGTFYREF FSRLDEGPFA VLYSDLISRP NVPVNVLVGL EFLKAANGWT DEEMYDHFCY DVQVRYALGY RQLSEGCFDL RTLYYFRERL AQHAQETGEN LLERAFEQVT GEQLRAFSIK SGKQRMDSTL LASNIRQMGR IQLLVTVLQR VWRMLSEADQ QRYAQRFERY TKGHPGQYIY RLKKEEWPEH LQRIGEDMRT LLQELQAAYG EQPTYAVLAR VFAEHFRLEK EKLQVKEASE LSARSLQSPD DLEATYREKH GKSSRGYVVN LTETCDPDNP LQLVTKIQVA PNVTDDSALL AEALPDLKER TGLEELYTDG AYGSAENDKR LAEQEVTLIQ SAIRGRKRKE ERLYLDDFTL PGDTHNGALN LTCPHAQQAP VRGAKQGKSY RATFDAQVCG NCPLQPRCPV QPRKNGEAVL LFTEEDLRRA QRRRRMRQAD SGERNLRSAT EASIRSLKHP FPAGKLPVRG RFRAACLLIG SAAVMTVRRI HRYLQSQIAG NRPGEQAKRM TKRLAEQAEH VFFFGRTLLQ AFGLYRRINS PVLTW
|
| |