Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3601 |
Symbol | |
ID | 5541102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4699923 |
End bp | 4701266 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640895720 |
Product | hypothetical protein |
Protein accession | YP_001433668 |
Protein GI | 156743539 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.413111 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAGGG CGCCTGAAAC GCAACGACAA CTCGAAGAGT TGCGCCGGGT GTTGCTCAAG CCGGAAGCGC TGGTTGATCG CATCAGCCCG GTTATTGCCG ATATTCTGGC GGAACAGATC AATCAGTCGC GTGATGAAAT TGCACACGCC ATAGCGCCTG CTATCGGTGA AGCCATTCGC CACCAGGTCT ATCAGGCGCG TGAAGATATT GTCGATGCGC TCTACCCCGT GGTTGGGCAG ATGATCACCC GCGCCGTCGC CGAAGCAGTG CGCAATCTGG CGCAATCGAT CGATGAGCGC GTTCGTCAAA GCACCTCGGT GATGCTCAGC CCTCGCTACT GGCAGGCGCG CGTGCAGGGG GTCTCACATG GCGAGTACGC CTTGCGCGAA GTCCTTCCCT TTACTATTCA TGAACTCTTC CTGATCCAAC GCGAGTCGGG GGTGCTGATC TGCCACTATT CCGCCGGACC AGAACGCCCG GACCGCGATG TCGTCAGTGG GATGCTCACT GCTATTCGTG ACTTCGCGCA GGAAGCGTTT GGGCGCGAAG AAAGCGGAGA ACTCGGCGCC ATTACCTATG AGTCGCGCCA GATCATCCTC GAGACCGGAA GCGCCGCATA CCTGGCAGTG GTCATCAGCG GTGTTGAGCC GCCAGACTTC CGCGAACGCC TGCGCGAAAC GCTCTTTGCC ATTCACGAAC ATCGCTACGA GCGTCTCCGC GCCTTCGACG GAACCGATGC CCGACTGATC CAGGAAGCGC GTCAGACACT GCGGCAACAT CTGGTTCCGC AGCAGGAAGA CCACCCTCCG CGACGTCTCT CGATGCTTCA GCGCGTAATT GTGGTTGTGA TCGGATTGTT TGTGCTGTCG CCGTTGCTCC TCTGTGGCGC CTGGATCTGG CATGTCGAAA CGCGAATGGC GATGCTCATG ACGCCGCCGA TTGCAGCGCC AACGGCGACC GCCACGCCAA CGGCGACCGC CACGCCGACG CCTACCAGCA CGCCGACGGC AACCGCCACG CCGACGCCTA CCAGCACGCC GACGGCAACC GCCACGCCGA CGGCAACCGC CACGCCGACG GCGACGCCTT CGCCATTCAA TGGTGTCATG ATCGGAAACG TGTATCTGTA CAGCACGCCG GACGAAGCCA GTACACGCAC CGGTATCGTT GCACCGCTCG GCGCGCCGGT CGAAGTGCTG GCACAGCGAG GTGATTGGTA CCGAGTGCGG GTAGCGCTGC CGCAAAACCC GCAGGTCGAA CTGATCGGAT GGATCCCGGC GCGTTGGGTC AGCCTGCTCA AACCGGTGCC GCCCGAAGTA ATTACGCCGA CTGCAACACA GTAG
|
Protein sequence | MVRAPETQRQ LEELRRVLLK PEALVDRISP VIADILAEQI NQSRDEIAHA IAPAIGEAIR HQVYQAREDI VDALYPVVGQ MITRAVAEAV RNLAQSIDER VRQSTSVMLS PRYWQARVQG VSHGEYALRE VLPFTIHELF LIQRESGVLI CHYSAGPERP DRDVVSGMLT AIRDFAQEAF GREESGELGA ITYESRQIIL ETGSAAYLAV VISGVEPPDF RERLRETLFA IHEHRYERLR AFDGTDARLI QEARQTLRQH LVPQQEDHPP RRLSMLQRVI VVVIGLFVLS PLLLCGAWIW HVETRMAMLM TPPIAAPTAT ATPTATATPT PTSTPTATAT PTPTSTPTAT ATPTATATPT ATPSPFNGVM IGNVYLYSTP DEASTRTGIV APLGAPVEVL AQRGDWYRVR VALPQNPQVE LIGWIPARWV SLLKPVPPEV ITPTATQ
|
| |