Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_1015 |
Symbol | |
ID | 4081703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | + |
Start bp | 1044845 |
End bp | 1047895 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638009375 |
Product | TonB-dependent receptor |
Protein accession | YP_616065 |
Protein GI | 103486504 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.501659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGTTTT CGCATGAAAT TCGGAATAAG GGCGCTCGCC GGTTGCGTGC GCGTTCGCTC GCGGCTGTGC TGGCATGGAG CAGCGCGGCT GCGGCCATGG CCGTCGCCAT CGCCACCCCC GCCCATGCCC AGGTTTCGAA CGCTTCGCTG CGCGGCACGG TAAAGGCCGA AGGCGGCGTG AGCCAGGTGA CCGCCATCAA CGTGAACACC GGTTTGACAC GCAGCGTCGC GGTCGGCGAA AACGGCAGCT ATAATATCGC CTCGCTTCCC GTCGGCACCT ATCGTCTTGA ACTCACCACG CCGGGCGGCG TGCGCCGCAC CGACGAATTC ACCCTGTCGG TCGGGCAGAG CGCCGTGCTC GACTTCGATT TCTCGCAGCC CGACATCGCT TCGGATGACG GCGCGATCAT CGTCACCGGC ACGCGCCTCC GTTCGATGGA AGGCGGCGAG GTGGGCACCA ATATCAGCCA GCGCCAGATC GAGGTGCTGC CGCAGAACAA TCGCAACTTC CTCGCCTTTG CCGACCTGGC TCCGGGCGTG CAGTTCGTGA CCGCTGGCAA CGATCAGTCG CGTCTGCAGG GCGGTGCGCA GAACAGCAGC ACCGTGAACG TCTTTATCGA TGGCGTCGGG CAGAAGGATT TCGTGCTCAA GAACGGCATT TCGGGTCAGG ATTCGACCCA GGGCAACCCC TTCCCGCAGC TTGCGGTCGG CGAATATCGC GTTATTTCAT CGAACTACAA GGCCGAGTTT GATCAGGTAA GCTCGGTTGC GATCACCGCC GTGACGCGGT CGGGGACCAA CGAGTTTCAT GGCGAAGCCT TCATCGACTA TACCGATCAG GGATTGCGCG ACCGTCGCCC CAACGAACTC ACCGGGACCA AGATCAAGAC CAAGGATTTT CAGTTCGGTG GTGCGTTGGG TGGGCCGATC ATCAAGGACA TGCTGCACTT CTTCGTCACT TATGAAGGAA AGCGCCAGGA GAATCCGCGC GACATCCGCC CTGGCTTCAA CCTGCCGCTC GACTTTTTCC CGGCCGAATA TCGGGGCGTT TTCGGTCCGA CGAATGCGAC GTTCAACGAA GATCTCTATT TCGGCAAGCT CAGCTTTCAG CCGACGTCGA GCGATCTGAT CGAATTGTCG GGACGGCACC GGCGCGAGAG CGGCGAGTTT CTGAGCAGCG GGATCAATGC GCGTGAGACG ATCAGCGCGC AAAAAGTGAT CGAATATCGC GGCACGGCGC GCTGGGAGCA CACCGCCGAC AACTGGATCA ACGATCTGAA ACTCACCTAT GAGGATGTGC GCTGGGCGCC GACCCCGGTC GTTTTCGGCA ATGGCAGCCT GTTCGCTTAT GCCGCACCGA ATGAGAGCAA CCCCGCGGTC ATCGACCGCG CCGATATTCT GCGCATCGGG GGCGGCGCCA ATTATCAGGA CAAGGGCCAG AAAGGTTGGG GCATCCAGAA TGATTTCACC TGGACCGGAT TCGAAGGACA TACGATCAAG TTCGGGGTCA AGTCGAAATG GGTCGAGCTG AACACGCTTC AACTCAACAA TTTCAACCCG CTCTATACCT ATAACGTCGC CTTCAATCCC AACGGCGGCA CATTCAATGA CGAAATCCCC TATCGGCTCC AGTTCGGGGC GCAGACCGGA ACCGGCAACC CCATCGTCAG CTCGGACAAC TGGCAGTTCG GCATCTATCT TCAGGACGAC TGGGAGGTCA CCGATCGCCT GACCCTCAAT CTTGGTGTCC GGTGGGATTA TGAGCGGACG CCGTCCTATC TCGATTTCGT CCATCCCGCC GATGCGGTGA ACGCCGTTTC GCCCGCCAAT TACCCCAATC TGGTCAACGC CGACTATGAC ATCAACGATT TCATTTCGAC CGGATCGGAG CGCAAGGCGT TCAAGGGTGC CTGGCAGCCG CGGATCGGTT TCAGCTATGA GCTCGACGAC GACGCGCGCT TTGTGCTCTT CGGCGGCTTC GGCCGCTCCT ATGACCGCAA CCAGTTCGAT TTCCTCCAGC AGGAAATCAG CGTCGGCTCC TTTGCAACGC GCACCTTCAA CTTCAACACC GGCGATCCTT TTAACATCTG CGCACCGGGG CCCACCTGCG TCACCTGGGA TCCCATCTAT CTGACCGAAG CCGGCCGCCA GCAGTTGCTG GCGCAGGCGG GACCCGGCGG TGGCCGCGAA CTGCGTTTCA TAACCAACAA GCTGAAAGTT CCCTATTCGG ATCAGTTCAG CCTGGGCCTC CGCGCCCGCG TGACGCCGCT CTTCGAAGCC GAAGTCGGTT ACAGTCATGT CGAGAGCAAG GACGGTTTTG CCTACCTGCT CGGCAACCGG CGCCCCGATG GCAGCTTCTT CCCGCCGGCA CCCGCCGCGC CGAGTTCGCC CTTCGGCTTT GCGCCGCCGG GTTTCGGGTC GATCATCATC GGCACCAACG GGCTCGAAAC GCGCGCCGAC ACGGGCTATC TCAAGCTGGT GAAAAACTAC ACGGCGGCAT CACCGTGGAG CCTTGCGGCG ACCTATACCT ACACCGAGGC TGAGGAAAAC AGGAACTTCG GTGAGGTCTT CAGTCTCGAT TTCCCGTCAA TCGAAGATTA TAATTTTGCG CGGTCGGCGG GGGTGCGCAA ACATCGCTTC GTCGCCGCGG GGTCGGTCGA CCTGCCGATC GGGGTGACGC TGTCGGGCAA GTTCACTTTG GCGTCGCCGC CCTATCTCAA GGCGTTCGTA AATACCGGCG GGGACAATCC GTCGCGCACG GTCATCTCGA ACGAGGCGAA GGGTAATGGC GACCGCTGGG GCCTGCGCCA GTTCGACCTT GCGATCATCA AATATATCCC GTTCCGCTTC ATCAGCGACG AATCGCGGCT TCGCCTGCGT CTCGACATCA TCAACCTGTT CAACGACCGC AACTTTGTGG ATTACAACAA TAATCCGGCG GACAATACGC GAACGCCGTC CAGCCCCACC ATCTATCGCG AGATTTCGGG GATCGGCGTT GGCGGCAACC CGCCGCGTAC GGTAAAGGTC TCGGCCGGTT TCTCTTTCTG A
|
Protein sequence | MTFSHEIRNK GARRLRARSL AAVLAWSSAA AAMAVAIATP AHAQVSNASL RGTVKAEGGV SQVTAINVNT GLTRSVAVGE NGSYNIASLP VGTYRLELTT PGGVRRTDEF TLSVGQSAVL DFDFSQPDIA SDDGAIIVTG TRLRSMEGGE VGTNISQRQI EVLPQNNRNF LAFADLAPGV QFVTAGNDQS RLQGGAQNSS TVNVFIDGVG QKDFVLKNGI SGQDSTQGNP FPQLAVGEYR VISSNYKAEF DQVSSVAITA VTRSGTNEFH GEAFIDYTDQ GLRDRRPNEL TGTKIKTKDF QFGGALGGPI IKDMLHFFVT YEGKRQENPR DIRPGFNLPL DFFPAEYRGV FGPTNATFNE DLYFGKLSFQ PTSSDLIELS GRHRRESGEF LSSGINARET ISAQKVIEYR GTARWEHTAD NWINDLKLTY EDVRWAPTPV VFGNGSLFAY AAPNESNPAV IDRADILRIG GGANYQDKGQ KGWGIQNDFT WTGFEGHTIK FGVKSKWVEL NTLQLNNFNP LYTYNVAFNP NGGTFNDEIP YRLQFGAQTG TGNPIVSSDN WQFGIYLQDD WEVTDRLTLN LGVRWDYERT PSYLDFVHPA DAVNAVSPAN YPNLVNADYD INDFISTGSE RKAFKGAWQP RIGFSYELDD DARFVLFGGF GRSYDRNQFD FLQQEISVGS FATRTFNFNT GDPFNICAPG PTCVTWDPIY LTEAGRQQLL AQAGPGGGRE LRFITNKLKV PYSDQFSLGL RARVTPLFEA EVGYSHVESK DGFAYLLGNR RPDGSFFPPA PAAPSSPFGF APPGFGSIII GTNGLETRAD TGYLKLVKNY TAASPWSLAA TYTYTEAEEN RNFGEVFSLD FPSIEDYNFA RSAGVRKHRF VAAGSVDLPI GVTLSGKFTL ASPPYLKAFV NTGGDNPSRT VISNEAKGNG DRWGLRQFDL AIIKYIPFRF ISDESRLRLR LDIINLFNDR NFVDYNNNPA DNTRTPSSPT IYREISGIGV GGNPPRTVKV SAGFSF
|
| |