Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2837 |
Symbol | |
ID | 5540326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3677271 |
End bp | 3678461 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640894966 |
Product | extracellular solute-binding protein |
Protein accession | YP_001432926 |
Protein GI | 156742797 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000153831 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAAAG GACGCACCAG GTTCATTGCG CTGTTGCTCG CCGTGCTGCT GACACTCGCA GCATGCGGCG GGCAGCCGAC CGGCAGCCCC GGCAATGAAT ACGGCAGCGG CGGCGCGACA ACCGAGCCGA CGACTGCTCC TCCGGCGCAA CCGTCTACAG GTGACGAGTT GCAGGTTGAT CGCTCGCGGC TCTCGAGTGA ACTCAGATTC TTCAACTGGA CGGATTATGT CGATCCTTCA ATCCTGGAAG ATTTTGAGAA AGAGTATGGC GTCAAAGTGA TTGTTGACCT CTTCGACGCT AACGAAGATA TGCTCGCCAA GGTGCGCGCC GGGCGCTCCG GGTACGACAT TGTGACGCCA TCGGACTATG CGGTTGAGAT CATGTGGCGT GACGGACTCA TCGCAAAACT TGACAAGTCG CTGCTGCCCA ATTTGAAGAA CATCGATCCC GACCTGCTCA ATAAATATTT CGATCCGGGG AATGTCTATT CTGTGCCGTA CATGTACGGC ATTACTGGAA TCGCCTACAA CCGGCAATCC TTCCCCAACG GCGTCGAGAG TTGGGCGGTG CTGTTCGACA CGGCGGAGAT CGCGCGCTAT CGCGGTCAGT TCAGCATGCT CGACGATGAA CGCGAAACAC CCGGCGCGGC GCTGAAATTC CTGGGCTACT CACTGAATGA AACCAGCCCG GAGGCGCTGA AAAAAGCGCA GGACCTGTTG ATCGCCCAGA AACCGTTCCT GGCCGGGTAC AACAGCAGTG ATGTCAACCG GAAACTGGCG AGCGGCGAAT ATGTGATTGC GCATGCCTGG AGCGGTTCGG CATTGCAGGC GCGCAACGGC TTGGGCGACG AGTTCTCCGG CAACCCGGAT ATCGCCTTTG TTATTCCAAA GGAAGGCGGC ATGATCTGGA TGGACAATAT GGTCATCCTG GCCGACTCGC CGAACGCTTA TACGGCGCAT GTGTTCATGA ACTTCCTGAT GCGCCCTGAT ATTGCTGCGC GCAACGCCGA ATACATTGGC TATCTCTCGC CGAACGTCGA GGCGATCAAA CTGCTGCCGC AGGAGATTAT CGACCTGTAT AACGAAGGGT TTGCCCCGAA CGATGAGGTT CTGAAACGGT TGGAATGGGC AATACGCAAT GATCAGACTG CCGCCTTCAC CGACCTGTGG ACGGCGGTGA AAGGGGAGTA G
|
Protein sequence | MLKGRTRFIA LLLAVLLTLA ACGGQPTGSP GNEYGSGGAT TEPTTAPPAQ PSTGDELQVD RSRLSSELRF FNWTDYVDPS ILEDFEKEYG VKVIVDLFDA NEDMLAKVRA GRSGYDIVTP SDYAVEIMWR DGLIAKLDKS LLPNLKNIDP DLLNKYFDPG NVYSVPYMYG ITGIAYNRQS FPNGVESWAV LFDTAEIARY RGQFSMLDDE RETPGAALKF LGYSLNETSP EALKKAQDLL IAQKPFLAGY NSSDVNRKLA SGEYVIAHAW SGSALQARNG LGDEFSGNPD IAFVIPKEGG MIWMDNMVIL ADSPNAYTAH VFMNFLMRPD IAARNAEYIG YLSPNVEAIK LLPQEIIDLY NEGFAPNDEV LKRLEWAIRN DQTAAFTDLW TAVKGE
|
| |