Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3119 |
Symbol | |
ID | 5199574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 3426449 |
End bp | 3428194 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640582667 |
Product | peptidase S10, serine carboxypeptidase |
Protein accession | YP_001263606 |
Protein GI | 148556024 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.83327 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGGAA CCCGGCCGTC TTCGATCAGA TGCGCGGCGC TGTCCAGCCG TTCCGGCGAC GGCGACCGGA GGGGAATTGA CGGCCGGCGA TCGATCCCGG ATGGATCGGG ACTGGTTTTC CGCGCCCATG AGGTATCCAT GCGCCTTGCC CTGATCGCTG CCCTCGCCGC GCTGGCGACG ACTGCCCTGC CCGCCGCCCT TTCCGCCGAG ACGCCCGAGG AGAAGGACGC GGCGGCATCC GCCGCGCCCG CGCGCTTCCA GCCCGACGAG GTGCGATCGA CCGGATCGGT CGCCGTCGGC GGCGTGCGCA TCCCCTATCA GGCGGTGGCG GGTACGCTGG TCGTCCATCC CAAGGGCTGG GACGACGTCC CCCCGCGCGA CGAGGACGGC CGCAAGGCCG CGAAGGAGCA GGCCGAGGCG TCGATGTTCT ACGTCGCCTA TTTTAGGACC GGCGCCCCCG CAGCCGGACG GCCGATCACC TTCCTGTTCA ACGGCGGGCC GGGATCGTCG AGCATCTGGC TGCACATGGG CGCGTTCGGC CCGGTGCGGG TCGAGGCCGG CGACGCGGGC CAGGCGTCGG CGGCGCCCTA CCGCGTCGTC GCCAACGACA TGGCGCTGCT CGACGCGTCG GACCTCGTCT TCATCGACGC CCCCGGCACC GGCTTCAGCC GGGTCGCGGG GAAGGACAAG GACAAGGCCT TCTACGGCGT CGACCAGGAC ATCCACGCCT TCGCCCGCTT CATCGTCCAG TTCCTGTCGA AGCACGGCCG CTGGGCGTCG CCCAAATATC TGTTCGGCGA AAGCTATGGC ACGATGCGCG CCGCCGGCCT CGCCAGGGCG CTGCAGGACG AGGATGTCGA CCTGAACGGC GTGATCCTGC TGTCGGACAT ATTGAACTGG GACCTGATCC CCGATGATCC GCAGCTCAAT CCGGGGGTCG ACCTGCCCTA TGTCGTCTCG CTGCCGACCT ATGCGGCGAC CGCCTGGTAT CACCGTCGCG TCGCCAACCG GCCCGACGAC CTCGCAAGCT TCCTGGCCGA GGTCGAGCGC TTCGCCACCA CCGACTATGC ATTGGCGCTG ATGCAGGGCA ACGCGCTGCC GGCGGCCGAG CGGCAGGCGA TCGCGGAGAA GCTGTCGGGC TATACCGGGC TGCCGGCCGC CTATCTGCTG CGGAGCAACC TGCGGATCGA ATATGGCGCG CTGCAGAAGG AGCTGCTGCT GGACCGCGAC CTGACCACCG GCACGCTCGA CACCCGCTTC ACCGGCTGGA CGATCGACCC GCTGAGCAAG GTCGCCGGCT ATGACCCGCA AGGGTCGGCG ATCGGCGCGG CCTATGCCGG CGCGTTCAAC GACTATGTGC GCGGCACGCT GCGCTATGGC GAGGGGCGGC ACTACCAGAC CAGCCTCGAC GTCTATGGCA GCTGGGACTA CCGGCACCAG CCGCCGGGCG CCGACAAGCC GCTGATCGCG CTGCCCAACG TCCTGCCCGA CCTGGCGGTG GCGATGAAGC GCAACCCGAC GATGAAGGTG ATGGTCAACG GCGGCTATTT CGACGTGTCG ACGCCCTATT TCGCGGGACG CTACGAGCTG CGGCACCTGC CGGTCCCGGC CGCGCTGACC GGCAATATCG AGTATCGCTA TTATCCGTCC GGCCACATGG TCTATCTGCA CCGGCCGTCG CTGAAGGCGC TGCACGACAA TGTCGCCGAC TTCATCCGCC GGACCGACAA CGCCGCCGCG AAATAA
|
Protein sequence | MVGTRPSSIR CAALSSRSGD GDRRGIDGRR SIPDGSGLVF RAHEVSMRLA LIAALAALAT TALPAALSAE TPEEKDAAAS AAPARFQPDE VRSTGSVAVG GVRIPYQAVA GTLVVHPKGW DDVPPRDEDG RKAAKEQAEA SMFYVAYFRT GAPAAGRPIT FLFNGGPGSS SIWLHMGAFG PVRVEAGDAG QASAAPYRVV ANDMALLDAS DLVFIDAPGT GFSRVAGKDK DKAFYGVDQD IHAFARFIVQ FLSKHGRWAS PKYLFGESYG TMRAAGLARA LQDEDVDLNG VILLSDILNW DLIPDDPQLN PGVDLPYVVS LPTYAATAWY HRRVANRPDD LASFLAEVER FATTDYALAL MQGNALPAAE RQAIAEKLSG YTGLPAAYLL RSNLRIEYGA LQKELLLDRD LTTGTLDTRF TGWTIDPLSK VAGYDPQGSA IGAAYAGAFN DYVRGTLRYG EGRHYQTSLD VYGSWDYRHQ PPGADKPLIA LPNVLPDLAV AMKRNPTMKV MVNGGYFDVS TPYFAGRYEL RHLPVPAALT GNIEYRYYPS GHMVYLHRPS LKALHDNVAD FIRRTDNAAA K
|
| |