Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3842 |
Symbol | |
ID | 5541346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5021678 |
End bp | 5023384 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640895952 |
Product | extracellular solute-binding protein |
Protein accession | YP_001433897 |
Protein GI | 156743768 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0450891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.108057 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACC GATGCTCTGA TCAGGGAGAA CAAAAGGTGC GTCGAATCGA TCGGCGCGCG TTCCTGCGTC TTGTGGCGGC AGGCGGAGGT GCGCTGGCAT TGCAGGCTTG CGGTGGCGGC GCTCCACCCG CTGCTGCGCC GACGACCGCC CCCGCTGCAC CGACGGTTGC CCCGGCTGCG CCGACGGCTG CTCCTGCCGC GCCGACGGCT GCTCCTGCCG CGCCGACGGC TGCACCCGTT GCATCGAACG CGCGCGGAGG CAAAGTCACC TGGGCGATGC TTGGTGATCC GGTGTCGCTC GAACCATACG GAATCAACAT CACGGGCCAG TACAATTACG AAGCGCGCGA GCCGATGTAT GACTCGCTGC TCGTGTGGGA TCGCGATCTG AAAGTACAGC CATCGCTGGC AGAGTCGTTC GAGACGCCGG ACGATACGAC CTATATCTTC AATCTGCGTT CGGGCGTGAA ATTCCACAAC GGCAGGGAAC TCACCGCCGA GGATGTCAAA TTCTCGCTCG ATACCATCAT CAATCCGCCT GACGGACGCA ACCCCGGTGC AGCATTCTTC GCCAATTTCG ACACGATTGA GGTCGTTGAC CCATTGACGA TTCGCATCAA TCTGAAGAAA ATCGACCCGA CCATTCCTGG GTTGTTCGCC TGGTCGCGCT ATACAAACAT CTTTCCCACC GACATGCCGT CGCAGATCAA TCCCGTGACC CAGGCGATCG GCACCGGTCC CTTCCGATTG GTCAGTTATA CGCCCAACGC CGAAATCGTC TATGAGCGCT TCTCCGATCA CTGGAACCCC GAACAGCCGA ACATCGATCA ATTGATCTAT CGCGTCATCC CTGAGGAAGA TGCCCGCATT GCGGCACTGC GCTCCGGCGA TATCGACGGG ACGGACGTTA CGCCGTTAGG TGCGCGGCGA CTCCAGAACG ACTCCGACAT CACCATTCTG AAAGGTCTCT ACTCGCAGCC GAAGGTGCTT CAGTTTACAC TGAAGGGCGG GAAACCGTGG GACATCAAAG AAGTGCGCCA GGCGATCAGT CTGACCATCG ACCGCCAGGA ACTGATCGAC AAGGTGATGG AAGGCGAGGC GGAACTGACG GGACCGGTCG TGCCCGGCTA TGGCGACTGG CCCCTCAGCC AGGATGAACT GCGCGCCGCG TACCAGGTTG ATGTCGAAAA GGCGCGCCAG TTAATGGCGC AAGCCGGTTA TGCCGACGGC TTCAAGGTGA CGGCGATGAC CTTCGCCAAC TACTCGAACG ATAATGCCAT CATTGTACAG GAACAACTGC GCCAGTTGAA CATCGACATG CAGATTGAAC AGATCGAGTT TGGCACATTC GCGCAGCGCG TCACCAATGG CGAGTTCGAG TGGTGCTTTA CGGCGCGCGG CATGCGCGCC GACGTGAGCG GATACCTGAA CGACTTCCGT CGTCTGGGCA TTGCCGAGAA GAACTGGTTT CCCGCCTGGG AGAACGCTGA GCTTAATGAG GCGTATGATG CAGCAATGGC GACGTTCGAT CAGGCAAAAC GACGCGAATT GATGCAGAAA GTGCAGCGCA TCGTCATTAA TGAAGCGCCA CACATCTATC TCTACCAGGA CTACCGTTTC TCGGCGGTTC GGAAGCGGGT GCAGAATTAC TACGTCGCCT TCACGACGTT CCGCCCCGCG CTGCGCGAAA TCTTTGTGAC CGCGTAA
|
Protein sequence | MNDRCSDQGE QKVRRIDRRA FLRLVAAGGG ALALQACGGG APPAAAPTTA PAAPTVAPAA PTAAPAAPTA APAAPTAAPV ASNARGGKVT WAMLGDPVSL EPYGINITGQ YNYEAREPMY DSLLVWDRDL KVQPSLAESF ETPDDTTYIF NLRSGVKFHN GRELTAEDVK FSLDTIINPP DGRNPGAAFF ANFDTIEVVD PLTIRINLKK IDPTIPGLFA WSRYTNIFPT DMPSQINPVT QAIGTGPFRL VSYTPNAEIV YERFSDHWNP EQPNIDQLIY RVIPEEDARI AALRSGDIDG TDVTPLGARR LQNDSDITIL KGLYSQPKVL QFTLKGGKPW DIKEVRQAIS LTIDRQELID KVMEGEAELT GPVVPGYGDW PLSQDELRAA YQVDVEKARQ LMAQAGYADG FKVTAMTFAN YSNDNAIIVQ EQLRQLNIDM QIEQIEFGTF AQRVTNGEFE WCFTARGMRA DVSGYLNDFR RLGIAEKNWF PAWENAELNE AYDAAMATFD QAKRRELMQK VQRIVINEAP HIYLYQDYRF SAVRKRVQNY YVAFTTFRPA LREIFVTA
|
| |