Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0050 |
Symbol | |
ID | 5537508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 64741 |
End bp | 66081 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640892215 |
Product | hypothetical protein |
Protein accession | YP_001430206 |
Protein GI | 156740077 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0359798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTATCAA CTCAACGCTT GTTGATGGTT GCTGCGCTTG TCGCTATCTT CGCATCGTTT CCGCTTGTCG ATCAAGCCCA CGCTGAAGAG ACGTCTCCGT CTCCTTCTTC AGACTCTCCC TTTGCAACCT CACCGATCGC TCCGACGTAT CGGGTGTTCG CCACGCGCCA GGGGCGCGTC GGTCGCCGCA CTGCCAATGG GCACATCATT CAACCACGCG ACCGGTTCGT GGCGCTTCCG TCGTGGAGCG CGCTTTCGAG CCGTGGTGGA TCGGAGTATC AGGTGCGCGT CACCTATCGC AACCGCAGCG TCGTGTTGCC GGTGTGGGAC GTCGGTCCGT GGAACACGCG CGATGATTAC TGGTCACCGA ATCGACAGTA TGGCGACCTT CCTGTCGGAC TGCCAATGGC GCAAGCCGCG CGCCAGCAGG GGTACAACAA CGGACGCGAT GAGTTTGGGC GCCGTATCCG TCAACCGAAC GGCATCGATA TTGCCGATGG CGCGTTCTGG GATGATCTTG GTATGGTCGA TAGCGATTGG GTCGAGGTGA CGTTTTTGTG GCTCGGCGCC GACCCGTTCG TTGCATCAGA TGATGCCTCC TCAACGACTG ATCGCGCGGC GGTCGAACCG GAGGCGATTG TCGTGGATGA TGGCACTTCA GAATACGCTG CAACCCATGG AAGAAACTGG CAACACGCCG ACTGCGGATT CGGCGGCGGA CACGCCTGGA GTTACGACAC GCCGCAGGCA ACGGTGCGCT CTCAGCACCG CGCCGTCTGG TCGCCCGATC TCCCAGGAGA AGGCTTCTAC GAGGTCATGG CATTCATTCC AACATGCGGT CCAACACCGA CGAGCCGGGC GCAGTATTCG GTGGTGCATA GCGGCGCGGT GTCTGATGTG GTTATTGATC AGGGCGCAGC ATCCGGTGGA TGGGTCTCGC TCGGGGTCTT CCACATGGGA CCCGGTAGTT CGGTGACGCT GACCAATCAG ACCGGCGCCG ATGGTCGCGC GGTGCATTTT GATGCGCTTA AGTGGGTTCC GCGCAACGAC CAGGCGCCAC CGGACGCTTC GGTGATTGAG GCGACGCTCC TGCCCGAAGG CGGCATTCTG GTGCGCTGGG ATGGACAGGA CGACGTCAGC GGCATTGCGT CGTTCGATGT GCAGGTGCGT CGCGCCCCCG ACGGCGAATG GATCGATTGG CGTAGTCGGG CGACCGATCG GGAAGCGCTC TTCGTTCCCT CTGAGCCGGG CGCCTATGCC TTCCGCGCTC GCGCCCGCGA TTGGACCGGC AAAGAACAGC CCTGGCCCGA TCTGGATGAT GTTCAGATCG TTGTGCCATA G
|
Protein sequence | MLSTQRLLMV AALVAIFASF PLVDQAHAEE TSPSPSSDSP FATSPIAPTY RVFATRQGRV GRRTANGHII QPRDRFVALP SWSALSSRGG SEYQVRVTYR NRSVVLPVWD VGPWNTRDDY WSPNRQYGDL PVGLPMAQAA RQQGYNNGRD EFGRRIRQPN GIDIADGAFW DDLGMVDSDW VEVTFLWLGA DPFVASDDAS STTDRAAVEP EAIVVDDGTS EYAATHGRNW QHADCGFGGG HAWSYDTPQA TVRSQHRAVW SPDLPGEGFY EVMAFIPTCG PTPTSRAQYS VVHSGAVSDV VIDQGAASGG WVSLGVFHMG PGSSVTLTNQ TGADGRAVHF DALKWVPRND QAPPDASVIE ATLLPEGGIL VRWDGQDDVS GIASFDVQVR RAPDGEWIDW RSRATDREAL FVPSEPGAYA FRARARDWTG KEQPWPDLDD VQIVVP
|
| |