Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3845 |
Symbol | |
ID | 5541349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5026004 |
End bp | 5027698 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640895955 |
Product | extracellular solute-binding protein |
Protein accession | YP_001433900 |
Protein GI | 156743771 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.267838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0712114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTGC GCGATTTATT TGTCCGTCCA TCATCGATTC TGATGGTCCT GCTGATCGCT GGCTCGTTGA TTCTGGCTGC ATGCGGCGCA ACGCCGCAGC AGAGCGCTCC TGTTCCCACT CAGGCGGGCG CCGCACCGCC GCCGACATCG GTTCCTGCCA CCCCTGCCGA CGCCAGACCA CAGGCGACCG GTCGGACGCG CCTGGTTTTC TTCGGCAATC TGGAACCAGC CAGTATCGAT GTCATCACAA TGTCGGGCAA CGTCAATGAA CGTCAGGTCG CCGCACAGAT TTTCGAAACG CTGGTGTACA TGGATCAGAG CCAGCAGATA TACCCCGGTC TGGCGCGCGA ATGGAGCCGT TCCGATGATG CGACGACGTG GACGTTCAAG TTGGTTTCGG GAGTAAAATG GCACGACGGC ACGCCGTTCA CGGCTCAATC GGTGGCGGAC TATTTCGACT ACATTCGCGA TAATCCCGTC GGAAGCGGAC CTGGATCGCT CAAAGCCGTA ATCGGCAGCG CCAAAGCAGT CGATGAAACG ACGGTCGAAC TCACGCTCAC TGCGCCGCGT CCCGATTTGC TGATTGAGCT TGCGGATCCG GGGATGGGCA TTGGAAATGC CGCCTACCTG AAACGGGTGG GCGCCGATGC CGGTTTCAAT CCGGTGGGAA CCGGTCCATT CAAGTTCAAG GAATGGGTGC GCGGCAGCCA GATCGTGCTG GAGCGCAACC CTGAATGGAC CTGGGGGTCA CCCTTGTTCA AGATGAGCGG TCCTCCTTTG ATCGAAGAAG TGGTCTTTCG CTTTTCTTCT GAAGCGCAAA CACGACTTGC AGCGCTGGAA GCCGGCGAAG TCGATTTCGT TGACCTTCTG CCATTCCAGG ATGTCGTGCG TGTCCGCACC GATCCGCGCT TTACCGTCAC CGGCGTGCTG CTACCGGGCA TGCCGCAAAT GAACTATCTC AACACCAGCC TGGCGCCGAC CGATGACATA AATGTGCGCA AAGCGATTAT CTATGCCACG GACAAGCAAG GTATCATCGA GAGCGTTTAT TTCAACATGG TCGAACCGGC ATACGGACCG CTGTCGCGCG TCTTCCCAGA GTACGAACCG GCGCTGGAGC AGATGTACGA GTACAATCCT GAGAAAGCCG CGCAATTGCT CGAAGAAGCC GGCTGGCTTC CCGGTCCCGA CGGCGTTCGT GTCAAGGACG GTCGGCGCCT TGAAGTGACG ATCGTTGAGA ATAAAGGCTG GAACGATTGG GTCTATGTGC TGCAAGCCAA TCTCCAGGCT ATCGGCTTTG ACGCAAAAGT GCTCACCACC CAGGGCCCGT CGAATACGGA AGCGATTGCC AGCGGCAAAT ACCACGTTCC GGCTATGGGA GACGTTTTCG CTTCTGCCAG CCAGATGACC CGTGACTGGC ACTCAGAGGG GTACGGAACC TTTCCTTCCG GTCATTTTCT GAAAGGCGAA GACGGCGCCA GATTGGACAG GATGCTGAAA GAGGCGGAGA CGGAGATCGA CCCGGAAAAA CGCATCGAAA AGTATCGTGA GATTCAGAAG TTCATCATGG AGCAAGCGTT GATGGTGCCG ATCTTCGAAC TGTACTTCTA TGTCGCCCAC GCGAACAATC TGAAAGGCTT CGTCGTCGAT GGCACGGGTT TCTACAAGTA CTTTGCGCCA GCGTACTTTG AGTAA
|
Protein sequence | MALRDLFVRP SSILMVLLIA GSLILAACGA TPQQSAPVPT QAGAAPPPTS VPATPADARP QATGRTRLVF FGNLEPASID VITMSGNVNE RQVAAQIFET LVYMDQSQQI YPGLAREWSR SDDATTWTFK LVSGVKWHDG TPFTAQSVAD YFDYIRDNPV GSGPGSLKAV IGSAKAVDET TVELTLTAPR PDLLIELADP GMGIGNAAYL KRVGADAGFN PVGTGPFKFK EWVRGSQIVL ERNPEWTWGS PLFKMSGPPL IEEVVFRFSS EAQTRLAALE AGEVDFVDLL PFQDVVRVRT DPRFTVTGVL LPGMPQMNYL NTSLAPTDDI NVRKAIIYAT DKQGIIESVY FNMVEPAYGP LSRVFPEYEP ALEQMYEYNP EKAAQLLEEA GWLPGPDGVR VKDGRRLEVT IVENKGWNDW VYVLQANLQA IGFDAKVLTT QGPSNTEAIA SGKYHVPAMG DVFASASQMT RDWHSEGYGT FPSGHFLKGE DGARLDRMLK EAETEIDPEK RIEKYREIQK FIMEQALMVP IFELYFYVAH ANNLKGFVVD GTGFYKYFAP AYFE
|
| |