Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0320 |
Symbol | |
ID | 5537782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 392797 |
End bp | 394467 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640892484 |
Product | extracellular solute-binding protein |
Protein accession | YP_001430471 |
Protein GI | 156740342 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.144423 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCGCC GCATTCGCTG GCAGATTCTG ATAGCTGCGA TCAGTTCCCT GACCGTGCTC CTCCTCATGA GTTACCTGGC GCTGACGCGC GCATCGGTCG CCCGTCCGCT GACAGGCGGT GATTATGTCG AAGGGGTCGT CGGCGCACCG CTGCATCTCA ATCCACTGCT GGTCGATCCT GCCGCCGATC CGGTTGCTGC CGATCTACAC CGCCTGATCT TTGAGGGATT GACACGCCCC GGTCCTGATG GATTGCCCAT GCCGGCGCTT GCCGAGTCCT GGACGGTCGA TGACAGTGGG ACGGTCTACA CCTTTACGCT GCGCAGCGGC GTCGCCTGGC ACGATGGCGC GCCGGTGACG GTCGATGATG TGCTGTTCAC GATGCGCGCC GTGCAGGGTC CTGCGTTTGC CGGTGATCAG AATGTCGCCG CTTTCTGGCG CACGGTTCTG GTTGATCGCG CCGGAGAGCG CAGCGTCAGT TTCCGCCTTG ACTCTCCATT TGCGCCTTTC TTAAGGTTGA CTGGTTTTCC GATCCTGCCG GCGCACCTGT TGCGTGATAC GCCGCCGGAA CAGTGGGAGG CGATGGCATT CAACCGCTTG CCGGTTGGCG CCGGTCCGTA TCGATTGATC GAACGAGATG ATCAACATGC TCTTTTGCGC GCCAACTCGT CCTACTTTGG TTCGCCGCCG TTCATTGAAA CGATTGAACT GCGGTTCTTC CGCACCGAAC AGGATGCGCT CGCAGCCCTG ACGCGCGGTG ACATTCAGGG TCTTGCTTTC CTGAGCACCG GCGCCCTGGC GGATGTGAAC CTGCCGCGCA ATGTGGTGCG CCGACAGGCA TTGCTGGATG CCTGCACCGT CCTGACCTTC AATCTACGCG ATGGTCCGCT CACTGAGATT GGCGTGCGCC GCGCATTGGC GATGGCGCTC GATAAGGATG CGTTGATCGC CGGAGTGCTG AACGGTCAGG TCGCACGGCT TGATACGCCG ATCCTGCACG GATGGTGGGC TGAAGCGCCT GATGTGCAGT GGTACGAACC CGATCCCGCA CGAGCCGTAT CCGCACTCGA TGCGCTGGGA TACGTCGCCG GCGCCGATGG CATTCGCGCA CGCAACGGTC AACGACTTGC CTTTACCCTC CTGACCGACA GTTCTCCCAA ACGGCGCGCG GTAGCAGAAG AGATCGCCCG CCAGTGGAGC GCCATCGGCG TGCGCATTGT GATCGAGCAG GTTGAGTCCG GCGATCTCCA GCGCCGTTTG GAAACCCACG ACTTTACTAT GGCGCTGCAC GGCTGGCAAC GGCTCGGTTC CGATCCCGAC GTCTTTGAAC TCTGGCATTC GAGCCAGGCG GAACGCGGAC GGAACTACAC AGGTCTGACC GATGCTGACA TTGATGAACT GCTCTACAAC GCGCGTAAAA TCTACGACAT CGCTGACCGC GCCGCGCTCT ACAACGAATT TCAGCAACGC TGGGTCGATC TGGCGCCGGG GATTATGCTG TATCAACCGT TCCTGATTCA TGCCACCGTT GCCGACCTTG GCAGCATCAT CGCTGTTGCG CCTGATGAAG CCGTTTCGCC GCGCCTGATC ATGGGGCGCG AGGGGCGCTT CGCCGACGTC AACCGTTGGT ATCTGCGCAG CGATCGCGAA ATCCGCGGCG ATTTGCGGTG A
|
Protein sequence | MARRIRWQIL IAAISSLTVL LLMSYLALTR ASVARPLTGG DYVEGVVGAP LHLNPLLVDP AADPVAADLH RLIFEGLTRP GPDGLPMPAL AESWTVDDSG TVYTFTLRSG VAWHDGAPVT VDDVLFTMRA VQGPAFAGDQ NVAAFWRTVL VDRAGERSVS FRLDSPFAPF LRLTGFPILP AHLLRDTPPE QWEAMAFNRL PVGAGPYRLI ERDDQHALLR ANSSYFGSPP FIETIELRFF RTEQDALAAL TRGDIQGLAF LSTGALADVN LPRNVVRRQA LLDACTVLTF NLRDGPLTEI GVRRALAMAL DKDALIAGVL NGQVARLDTP ILHGWWAEAP DVQWYEPDPA RAVSALDALG YVAGADGIRA RNGQRLAFTL LTDSSPKRRA VAEEIARQWS AIGVRIVIEQ VESGDLQRRL ETHDFTMALH GWQRLGSDPD VFELWHSSQA ERGRNYTGLT DADIDELLYN ARKIYDIADR AALYNEFQQR WVDLAPGIML YQPFLIHATV ADLGSIIAVA PDEAVSPRLI MGREGRFADV NRWYLRSDRE IRGDLR
|
| |