Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1058 |
Symbol | |
ID | 5538524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1373498 |
End bp | 1375459 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640893194 |
Product | ABC-type dipeptide transport system periplasmic component-like protein |
Protein accession | YP_001431177 |
Protein GI | 156741048 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATTC TCGACAGGAA ACTTGCGCTG GCGATGCTGC TGGCGTTGAT AGTTCCAATT CTGGCAGCCT GTGGCGGCGG CGCGGCACAG GCGCCGGTCC GCGAAACGGT CGTGGTGACC GCCGAGCCAA TCCGCGAAAC CGTGGTGGTC CGCGAGACGG TGGTGGCGGA AGCTCCTACG GCGGCGCCAG CAGGGGCATT CACCACACCG CACCCGATTC TTGGCGACGT GCGCGTGCGT CAGGCGATTG CCTACTGCAC CAACCGACCG GAACTGATCC AGTCGGTCTA TAGCTACCTG ACGCCGGAGC AGCAGCAGGG GCTGCTGATG GACACCAACC TGCCCAAGAC GCACTGGGCG GCGGCGAGCG AGGCGGACGG CATCGTGACC TATCCGTTTG ACCCGGAGAA AGGCAAGGCG CTGCTCGATG AGGCGGGCTG GAAATTGGCA GAGGGCGCGA CTGTGCGCTC GAAAGATGGC GTGCCGCTGT CGCTCGAGTT CACGACGACC AACGCACAGT TCCGTATCAC CTGGGCGACG GTGCTGGAAA AACAGTTGCT GACCAACTGC GGCATCCAGA TCATCCGCAA GCACGCCCCG GCATCCTGGT GGTTTGGTGG CGCCTCCGGT CTGCGTCGGC GCGATTTTGA GCTTGGCGCG TTCGCATGGG TCGGCGAAGC GGATCCGGGC GGCCGCACGC TCTATGCCTG CGACCAGATT CCGCTGCCAG AAAACAACTG GAATGGCCAG AACTACATGG GTTGGTGCAA TGAGACCGCC AGCAAGGCGA TCACTGCAGC CAACAATACG CTGAACCGCG AGGAGCGCAT CAAGAACTAC AAGGCGTTCC AGGTTGAGTT CACGAAGGAT ATGGTCAGCC TGCCGTTGTT CCAGCGCCTG GAGGCGTATG CCTGGAATAA GGCGCTGAAG GGGTTGAAGC CTGACCCGAC CGAGTATATC ACGGCCAACG CCTACGAGTG GGAGCGCAGC GATAATGGCG ACACCATTAT TCTGGGTTTC ACCCAGGAGC CGGCTTCGAT GTTTACGCTG GTCGAGAGCG CGGCGGTGCA GCGCCAGGCG GCGCAACTCG TCGAGGGCGT GTTGACGACC CAGTATAGCT ACGACTTCCA GGCGGTGCTG CAGGATGGTC TCTCCACCAT TGAGAGCGGC AAGGCGAAGA ATGAGGTCGT TGAGGTCAAA GAAGGCGATA AGGTCTGGGA TGTCACCGGC GCCGCAGTCG AACTCAAGCC GGGAGTGGAA GTCGTCAACT CCGATGGCGA GACGGTCAAG TATGAGAGCG GCACGGTCAA GATGAACCAG CTCACCGTCA CCTACGACCT CATCAAGGGG ATCAAGTGGT CCGACGGCGA GCCGCTGAAG AAGGCGGACC TGGAACTGGC GGTGAAGATT GTCTGCGATC CTGAGTCGGG CAACGTCAGT CTTACCTTCT GCGAGTCGCA CGACAATGCC AACGGTGTCA CCTTCAACAG CGACACCAGC TACACGATTA AGTTCCTACC CGGTGTCCAG TGGCCAATCT ACTTCACGGC GCCGTATGGC GGCTACCCCT CGCATGTGAC CATTTCGGAC GGGCGGAAGC TGGCGGATGT GCCGGCCAAG GAGTGGGCGA CGCTGCCGGA GGTTGCTGAG TTGCCGCTGG GGTATGGTCC CTACATTCTG AAGGAGTGGA ACAAGGGCCA GAATATGAAG TTCGAGGCGA ACCCGAACTT CGTCCTCGGC GCTCCGAAGG TCAAGAATAT CGTGATCCAG TTCTATGCCG ACACCAACGC TGCCGTGGCG GCGCTGCTGA CCGGTGAGGT CGATATTCTG GAGAAGGCGA CGCTCGGCGC CGGTCCTGAG GTGGAGACGG TGCTCAAGGC GGCGGCGGAG GGCAAGATTG AAGCCAAGAC CGATGCCAGC CCAACCTGGG AGCACATGGA TATGAACCTG TTTGTCCGGT AG
|
Protein sequence | MKILDRKLAL AMLLALIVPI LAACGGGAAQ APVRETVVVT AEPIRETVVV RETVVAEAPT AAPAGAFTTP HPILGDVRVR QAIAYCTNRP ELIQSVYSYL TPEQQQGLLM DTNLPKTHWA AASEADGIVT YPFDPEKGKA LLDEAGWKLA EGATVRSKDG VPLSLEFTTT NAQFRITWAT VLEKQLLTNC GIQIIRKHAP ASWWFGGASG LRRRDFELGA FAWVGEADPG GRTLYACDQI PLPENNWNGQ NYMGWCNETA SKAITAANNT LNREERIKNY KAFQVEFTKD MVSLPLFQRL EAYAWNKALK GLKPDPTEYI TANAYEWERS DNGDTIILGF TQEPASMFTL VESAAVQRQA AQLVEGVLTT QYSYDFQAVL QDGLSTIESG KAKNEVVEVK EGDKVWDVTG AAVELKPGVE VVNSDGETVK YESGTVKMNQ LTVTYDLIKG IKWSDGEPLK KADLELAVKI VCDPESGNVS LTFCESHDNA NGVTFNSDTS YTIKFLPGVQ WPIYFTAPYG GYPSHVTISD GRKLADVPAK EWATLPEVAE LPLGYGPYIL KEWNKGQNMK FEANPNFVLG APKVKNIVIQ FYADTNAAVA ALLTGEVDIL EKATLGAGPE VETVLKAAAE GKIEAKTDAS PTWEHMDMNL FVR
|
| |