Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0738 |
Symbol | |
ID | 5538204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 964763 |
End bp | 966070 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640892894 |
Product | CBS domain-containing protein |
Protein accession | YP_001430877 |
Protein GI | 156740748 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGACGTTG AAATCGGCCT GGTTCTGGCA GGCATTCTTC TCTGTCTCTT CGTTCTGGCA TTCACATCGG CAATCGATGC AGCAATGACG GCGATCAGTC GCCATCGTCT GGGGTTGCTG CACGAGACCG ACGCGCGGCG TGCGCAGGTG ATTGACCGCC TGCTCGCCGA GCCATATCGC TTCAAAGCAA CGGTTCTGCT GCTCAATAGC ACGGCGACCA TAACGGCAAC AGCACTGACC TTGCGCCTGT GCGACGATCA GATGTGGCAA TGGCGCCTTG CTGCGCTTAC CGGGTTGTTG CTCGGCATTC TCATCTTCGC CGAAGCATTG CCAAAAGCGC TGGCAATCGG TAACCCTGCG GCAACTGCAC GCGCGCTCGC CAGTCCAATG GCGCTGATTG CGCGCCTTCT GGCGCCGTTC ATCTGGGTGA TCGGATTGCT CACCCGCCCG TTCCTTCGTG TCGCCAGCGG GCAGACCACC TCGTCGATGC CGCTGGTGAC TGAAGAGGAA CTCCGCATGC TGGTGAATGT CGGCGAAGAG GAAGGGTTGA TCGAACCAGA AGAACGGGAG ATGATCGAAG GAATCTTTTC CTTCGGTGAT ACAACGGTGC GTGAGGTGAT GATTCCGCGC GTGGATATTG TGGCACTTGA GGAAACTGCC TCTATCGATG AGGCGCTCAA CATCATCATT ACGACCGGTC ACTCACGTAT TCCGGTGTAC CGCGAGACGA TCGACCACAT TGTGGGCATT CTCTATGCGA AGGATTTGTT GCTCTGGTTG CGTTCCGGGC AACGTGATGC ATCGATTGGC GCGCTGCTCC GCACTGCGCA CTTCGTGCCC GACACGATGA AAGTGGATGC CCTTCTGAAA GACCTTCAGG CGCGCAAAAT CCATCTGGCG ATCGTTGTCG ATGAGTATGG CGGCACTGCC GGTCTTATCA CTATCGAGGA TGTGATCGAG GAAATCGTCG GTGAGATTCA AGATGAGTAT GATGTGGACG AGCAGCCGAT CCGGGTGCTT GCGCCTGGGG ATATGGAGGT GGATGCGCGT GTGCCGATCG ATGATATCAA CGATCTTACC GGGTTGCGTC TGGCGTCGGA GGAGTCGGAT CGGATTGGGG GGATGGTGTT CGAGCGCCTT GGTCGGGTGC CGAAAGTCGG TGATACGGTG CAGATCGCCG ATGGCGTGAC CGTTGTGGTC CTCTCGATGG ATGGTCTGCG CCTGCGGAAG TTGCGCCTAC AGTATCGGTT GCCTCAGGAA GATGCAATTT CTGCCACGCC TGGCGAAGAT CGAAAGAATC ATGAGTGA
|
Protein sequence | MDVEIGLVLA GILLCLFVLA FTSAIDAAMT AISRHRLGLL HETDARRAQV IDRLLAEPYR FKATVLLLNS TATITATALT LRLCDDQMWQ WRLAALTGLL LGILIFAEAL PKALAIGNPA ATARALASPM ALIARLLAPF IWVIGLLTRP FLRVASGQTT SSMPLVTEEE LRMLVNVGEE EGLIEPEERE MIEGIFSFGD TTVREVMIPR VDIVALEETA SIDEALNIII TTGHSRIPVY RETIDHIVGI LYAKDLLLWL RSGQRDASIG ALLRTAHFVP DTMKVDALLK DLQARKIHLA IVVDEYGGTA GLITIEDVIE EIVGEIQDEY DVDEQPIRVL APGDMEVDAR VPIDDINDLT GLRLASEESD RIGGMVFERL GRVPKVGDTV QIADGVTVVV LSMDGLRLRK LRLQYRLPQE DAISATPGED RKNHE
|
| |