Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1578 |
Symbol | |
ID | 5539054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2027368 |
End bp | 2029155 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640893716 |
Product | extracellular solute-binding protein |
Protein accession | YP_001431689 |
Protein GI | 156741560 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.259952 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAGCA GAATCTTCCC CGTGAGTAAT TCGCATCTAT CGCCAGTCCA TTCCACTCCT GTGCTCCGGT TGCCGTTGCG CCGATTGTTT GGCGTCAGCA CATTTGTTGC AATTGCGTTG CTGGGGGTGC TTGTCGCCTG TGGTGCGCCG GAACAACCTG CCACACCCGC GGGCGTCTCG CCGATGCCTG CGCCAACCAC AGCGCCGCCG ACTGCGACAG CGCTGCCGCG CGGCGGCAAT CTTACAATCC GGTTGGCTGC CGATGTTGCT GAGTTGCGCC CCTGGCATCC ACGTACACGT GGCGAGGAAC AGCTCATCGC GCTGCTCTAC AGCGGTCTGA CCCGCCTGGA CGGCACGCTG GCGCCGCAGC CGGATCTGGC GGCAGGCTGG ACGGCTTCTG CCGATGGGCG GACAATTACA TTGACGTTGC GCCACGATGC GGTCTGGCAC GATGGACAGC CGGTGACGGC GGATGATGTG GTCTTTACTC TCAACGAATT GCGCGCACTC GAACCAACCA CGGCGTTGCT CGCCGGGTTA CGGCGTATGA CGGAGGTTAC AGCGCCGGCA ACCGATACCG TCGTGGTGCG TCTCGATGAA CGCTACGCGC CGATCTTTAG CCTGTTGACG GCGCCGGTGT TGCCGCGCCA TGCCCTGAGC GGCAGAAGTT TGGCGGATCT CAACGCCTGG GAAGCGCCGG TCGGCAGCGG TCCATTTCGC CTGGAGCGGC GTGAGCCGGG CACGGCGATC ACGCTGGCGG CCAATCAATC GTTTTACCGG GGCGCGCCGT TGCTCGACCG GGTGGTGTTC GTGGTGGCGC CCGATGCGCA GGTGGCGGCG TCCGCGCTCC AGAATGGGCA GTTGCTCTTG GCGGAACTGC CCTGGAGCGA AGGGCGCGCG CTCACCGAAA CAATGCCAAT GCTCCAAACC GGCGCATATG CCGAGAATGG CTACTACTTC CTGGCATTCA ACCTGCGCCC CAACCGTATC TTCAGTGATC TGCGGCTGCG TGAGGCGCTG GCGCTCACTA TCGATCTGCC GCGCATGATC CGGGAAGCGA CTAACGGGCA GGGCATGATC ATCGGGAACA GCGCCGCTCC TGGCTCCTGG GCAGACCTGA CGCCGCCGTC AACGACGACC GTGGACCTGG ATCGCGCGCG CGCGCTGCTC GATGAGGCAG GGTGGCGGCT CCCGTCGGAT GGCGTGGTGC GGCAAAAAGA TGGTGTGACG CTCACCGCGC AACTCTTTGT GCGCGCTGAC GATCCGCGTC GGGTGCGCGC TGCCGAACTG ATTGCTGGCG CCGCCGAGCA GATCGGGATG GACATTGTGG TGCAACCCGC TGACTTCGCA ACGGTGATTC GCTCGAAGTA TGCGCCACCC TATGATTTCG ACATGCTCCT CGGCAGCTGG ATCAATGGCG TCGCCGACCC GACTTTCGGT GACTACGCCT ACTACGATCC AGACGATTTT GCGCTGTTTC ATTCGAGTCA GATCAACCAG GGGGTGGCGG ATACGCGCCC TGTGTTGAAC TTTGTCGGGT TTAGCGACCC GGTGTACGAT GATCAGGCAG GCGCAGCGCG GCAATTGTAC GATCTGACAG AACGGGCGCA GGCAATCCGG CGGGCGCAGG AACGAGTGGC GCTGCTGCGT CCCTACCTGT TCCTGTGGAC GGATCGGTTG CCGGTGGCAT GCAGCGCGCG CCTGACAACG CTGGACGGAC CGATTAACCT GGCGACGCCG AACTATCTGT GGAATATCGA ACGGTGGTAT GTCACAGGTG AGGGTTGA
|
Protein sequence | MSSRIFPVSN SHLSPVHSTP VLRLPLRRLF GVSTFVAIAL LGVLVACGAP EQPATPAGVS PMPAPTTAPP TATALPRGGN LTIRLAADVA ELRPWHPRTR GEEQLIALLY SGLTRLDGTL APQPDLAAGW TASADGRTIT LTLRHDAVWH DGQPVTADDV VFTLNELRAL EPTTALLAGL RRMTEVTAPA TDTVVVRLDE RYAPIFSLLT APVLPRHALS GRSLADLNAW EAPVGSGPFR LERREPGTAI TLAANQSFYR GAPLLDRVVF VVAPDAQVAA SALQNGQLLL AELPWSEGRA LTETMPMLQT GAYAENGYYF LAFNLRPNRI FSDLRLREAL ALTIDLPRMI REATNGQGMI IGNSAAPGSW ADLTPPSTTT VDLDRARALL DEAGWRLPSD GVVRQKDGVT LTAQLFVRAD DPRRVRAAEL IAGAAEQIGM DIVVQPADFA TVIRSKYAPP YDFDMLLGSW INGVADPTFG DYAYYDPDDF ALFHSSQINQ GVADTRPVLN FVGFSDPVYD DQAGAARQLY DLTERAQAIR RAQERVALLR PYLFLWTDRL PVACSARLTT LDGPINLATP NYLWNIERWY VTGEG
|
| |