Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2424 |
Symbol | |
ID | 5539905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3117599 |
End bp | 3119785 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640894554 |
Product | extracellular solute-binding protein |
Protein accession | YP_001432522 |
Protein GI | 156742393 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.935396 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAACG CTCGAAAGCT CAACCGGCGC ACATTCCTGC GCCTGTCCGC TGTGACGGCG GCAAGCGCAG CGATTGCCGC CTGTGGCGGC GGCGGTCAGC CTGCCACCGC ACCGACGACA GCGCCCGCCG CGCCTCAACC GACCACAGCG CCAGCGGCGC CCGCTCCAAC CACTCCTCCA GCCGCAACAG CTGTTCCCCC GGTGACGACC CAATACAAAG AAGCGCCGAT GCTGGCAAAA CTGGTGCAGG AAGGTAAACT GCCGCCGGTG GACGAGCGCC TGCCGAAGAA CCCCTACACT CCGCCACACT CCTGGCTCAC AGTTGGCAAG TACGGCGGCG TGCTGAAGAA GACCTACAAC AACAACTGGG GTATCACCGG CTTCATCCAC GAGATGCAGT ATGGCTCGTC GCCGCTGCGC TGGCTTAAGG ATGGTCTGGC GATTGGTCCA GGCTTCGTCG AAAGCTGGGA ATCGAATGCC GACGCGAGCA AGTGGACGTT CAAGATCCGC GAAGGGATCA AGTGGAGTGA CGGTCAACCC TTCACCACCA AAGACATTAT GTACTGGTGG GAGTACACGG TCGGCGGCAA CGGCAAGGAG AAGGAGTACC CCGCCGGTCT CAAGCCGATC AACAGCCCGC CTGACGAGGC GCGCTCCGGC ACCGGCACCC TGATGACACT CAATGCGCCG GATGATTACA CCTTCGAGAT GGTGTTCGAC GCGCCGGCGC CGCTCACGGC GGACCGTCTG GCAATGTGGG TCAACATGTT CATCGGTCCG GCCTGGGTTA TGCCACGTCA CTACATGGAG CAGTTCAACC CGGTGCTCAA CCCCGATAAG TATAAGGACT GGGAAGAGCA TCAGCGCAAG TTCAACCACA ACAACCCCGA CTGCCCGCGC CTGACCGGTT GGAAACTGGA TATCTTCGAG GAGGGCGTCC GCGCTGTCTG GTCGCGCAAC CCGTACTACT GGGCGGTCGA TAAGGAAGGC AACCAGTTGC CGTATATCGA CCAGATCATT GTGACGGCGG TCAAGGACAA GGAGATCGAG AAACTGGCGT ACACCGAGGG GCGCGCCGAT CATGCGCACT TCCACAGCCA GGGTCTGGCG GATGTGCAGT CGTTGCGCGA CGCCGAAGCC AAGAGCGGTC TCGAGGTGCG CTTCTGGGAC TCGGGTTCGG GCACCGGTTC GCTCTACTTC TTCAACATGG ACTTCAAGGA CCCGAAGATG CGCGCCGTGT TCCGCGATCC GAAGTTCCGC CAGGCGCTGT CACACGCTTA CAATCGCGCC GATGTGCAGA AAGCGGTCTA TTTCGGGTTG GGCGAACTGA CGACCGGCAC CTTCAGCCCC AAGGCCATCG AGTACAACAT CAACGATCAG GGCAAGCAGG TGTATGCCGC CTGGCGCGAT AGCTACGTGA AGTACGATCC GGCGCTGGCT GAACAGATTC TGGACGAAGC CGGCTACAAG AAGGGACCTG ATGGCAAGCG CACGATGCCG GACGGCAGTC CGCTTCAGAT TCAGATCACC TATGGCGCCG ATCAGGCGCC CGGCGGTGAG CACCTGTCGA AGAACGAGCG CCTGGCGCGC GACTGGCAGG CGATCGGGAT CGATGCGGTG CTGACACCTA TTCCGGGTGA GGGCGCCGAC GAGAAGTGGC GCGCCGGCGA GTTGCCGATG AAGACCACCT GGGAGGTCGG CGACGGTCCC AACCACCTGG TCTTCCCCTC CTGGCTGGTG GCGGATGAGA CCGAGCGCTG GGCGCCGCTG CACGGTCGCG GGTATACGCT GCGCGGCACT GCGTCGGAGA AGGAGGAACT GGATAAGAAC CCATGGGACC GCAACCCGCC GCGCATCAAT CGTGGCGAGC CGGACTATAT GCCGGCGATT GGCAAGCTTC ACGAACTGTT CGACAAGAGC AAGGTGGAGC CAGATGCGAT GAAGCGCCAC CAACTCGTGT GGGATATGAT CAAAGTCCAC ATCGAGGAGG GTCCGTTCTT TACCGGGACG ATCGCAAACC CGCCGCGCAT CATTTTGGTG AAGAAGGGGT TGATGAACGT GCCGACCCGC GATGACCTTT TGAAGGAAGG GTTGGGCGGT TTCGTCAATC CGTGGATCAT CCCCTCTCCG GCGACCTATG ACCCGGAGAC CTGGTACTGG GATAATCCTG AGGCGCATAC GGCGTAG
|
Protein sequence | MSNARKLNRR TFLRLSAVTA ASAAIAACGG GGQPATAPTT APAAPQPTTA PAAPAPTTPP AATAVPPVTT QYKEAPMLAK LVQEGKLPPV DERLPKNPYT PPHSWLTVGK YGGVLKKTYN NNWGITGFIH EMQYGSSPLR WLKDGLAIGP GFVESWESNA DASKWTFKIR EGIKWSDGQP FTTKDIMYWW EYTVGGNGKE KEYPAGLKPI NSPPDEARSG TGTLMTLNAP DDYTFEMVFD APAPLTADRL AMWVNMFIGP AWVMPRHYME QFNPVLNPDK YKDWEEHQRK FNHNNPDCPR LTGWKLDIFE EGVRAVWSRN PYYWAVDKEG NQLPYIDQII VTAVKDKEIE KLAYTEGRAD HAHFHSQGLA DVQSLRDAEA KSGLEVRFWD SGSGTGSLYF FNMDFKDPKM RAVFRDPKFR QALSHAYNRA DVQKAVYFGL GELTTGTFSP KAIEYNINDQ GKQVYAAWRD SYVKYDPALA EQILDEAGYK KGPDGKRTMP DGSPLQIQIT YGADQAPGGE HLSKNERLAR DWQAIGIDAV LTPIPGEGAD EKWRAGELPM KTTWEVGDGP NHLVFPSWLV ADETERWAPL HGRGYTLRGT ASEKEELDKN PWDRNPPRIN RGEPDYMPAI GKLHELFDKS KVEPDAMKRH QLVWDMIKVH IEEGPFFTGT IANPPRIILV KKGLMNVPTR DDLLKEGLGG FVNPWIIPSP ATYDPETWYW DNPEAHTA
|
| |