Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1724 |
Symbol | |
ID | 5539202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2219309 |
End bp | 2220238 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640893863 |
Product | periplasmic solute binding protein |
Protein accession | YP_001431834 |
Protein GI | 156741705 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.147475 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACACT TCGGGTTGCT GCTGCTGATG CTGGTACTGG TTGCGTGTGC ATCGCCCGGT GCGCCCGACG GCAATCTGTC CGACCGTCCC ATCCGCGTGG TGACAACCAC GGGCATAGTT GGCGATCTGG TGCAAAACAT CGGCGGCGAG CGCGTCAGCG TGATCAGCCT GATGGGTCCG GGGGTTGATC CACACGTGTA CAAAGCGAGC GCCAGGGATG TGATCCGACT CCAGGATGCG GACATTATCT TCTATAACGG GCTGCACCTC GAAGCGGCAA TGGGCGACGT GCTCGAACGC ATGCAGGGTC GGGTCAGAAC AGTCGCGGTG ACGGCCGGCA TCCCGCGCGA ATCGCTGCTA CCGTCGGACA AGTATGAAAA CCTGTACGAT CCGCATATCT GGTTCGATGT GACGCTGTGG AAGCGAGCGG CGGAGCATGT GCGCGACGCG CTGATCGATC TCGATCCTTC CAATGCATCG GTCTATCGCG CGAATGCTGA CAATTATATT CGTCAACTCG ATGCGTTGCA CACCTACGCG CTGGAGCAGT CGGCGACCAT TCCCGCCGGG CAACGGGTGC TGATCACTGC GCACGATGCC TTTCGCTACT TTGGTCGTGC CTACGGGTTC GAGGTTCGTG GATTGCAGGG CATCAGCACG GCAACCGAAG CAGGCGCTGC TGATGTACAG GCGCTGGCTG AATTTATTGC AACGCGCCAC ATTCCGGCAA TCTTCGTCGA GACCTCCGTG CCACAACGCA CCATCGAGGC GGTGCAGGCA GCAGTGCAGG AGCGAGGTTT TACGGTAGCG ATTGGGGGCG AACTCTTCTC CGACGCCCTG GGGACACCCG GAACGTCGGA GGGAACCTAC ATCGGCATGG TGCGCCACAA TATCGACACG ATTGTACGCG CATTGCGTGG CGCCGCATGA
|
Protein sequence | MKHFGLLLLM LVLVACASPG APDGNLSDRP IRVVTTTGIV GDLVQNIGGE RVSVISLMGP GVDPHVYKAS ARDVIRLQDA DIIFYNGLHL EAAMGDVLER MQGRVRTVAV TAGIPRESLL PSDKYENLYD PHIWFDVTLW KRAAEHVRDA LIDLDPSNAS VYRANADNYI RQLDALHTYA LEQSATIPAG QRVLITAHDA FRYFGRAYGF EVRGLQGIST ATEAGAADVQ ALAEFIATRH IPAIFVETSV PQRTIEAVQA AVQERGFTVA IGGELFSDAL GTPGTSEGTY IGMVRHNIDT IVRALRGAA
|
| |