Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_1076 |
Symbol | |
ID | 5196801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 1202508 |
End bp | 1204124 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640580621 |
Product | peptidase S10, serine carboxypeptidase |
Protein accession | YP_001261580 |
Protein GI | 148553998 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.231006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTG TCAGGCGCTT GTCGCTGCTC GTATCGCTGT CGAGCCTGCT CGCCATGCCG GCCCTGGTAC AAGCGCAAGA TGCAGCATCT CCAGCGCTCT CCCCAGAGCC TTCCAGGTCG ACATGGCCTG CCGTCACCCG CCCGGCCGAC CGGGGGCCGC GCCACTATTC GGCGGTGCGC AGCGGCACCT TTGGCGGCAA GAGCCTGCGC TATCGGGCCG TGCTGTCCGA GATGCTGGTC AGGGACCGCG CGGGCAAGCC GGCATCGAGC CTGTTCGTCA CCGCCTTCGT CGCCGGAATC GACCAGAGGC TCGCGGCCCA GCGACCGGTC ATCTTCATCT TCAATGGCGG GCCAGGCGGC TCCTCCAACA CATTGATGTT CGGCGCCATG GGGCCTTCAC GGTTGCAGGC GTTCGACGTG GCCGCGATCG GCAATGCCAA GACACCGGTG GTGCCGAACG AGGATGCCGT CCTCGACATA GCCGATCTCG TCTTCATCGA CGCGCCTGAA ACGGGTTATG GGAGGCCCCT GCCGGGGAGC GACGAAAAGA CCTTTCGCTC GAACGACGGG GATTCCAATG CCTTCGCGCA GGTGATTCTG CGCTGGCTCA CCGACAATGG CCGCATGGCC TCGCCCGTTT ACATCATGGG CGAAAGCTAT GGGTCCATCC GCGCGGTGCT GCTCGCGCGC GATCTTCGAG TGGCGACGCC GCGTGTCGAG CCCGCAGGCC TGATCCTGGT ATCCCAGGCT CTGTGGTACA ATGGGCCAGA GACCGGCATG ACGGTGCTTC CCGATCCCGT CCGTGCGGTC AACAGCCTGC CCGATATTGC GGCGCTGGCC TGGCATCATG GTCTCATCGA CAACAGGACG CAGACGCTCG AGCAGGCGGT GCGGGCCGCC CAGACCTTTG CACTCCAGGA TTATGCCAAG ATACTGATCG CGGGAAACCG CGCTCCCGAA ACGGAGCGGG CCTGCGTTGC CGAGCGGCTC GCACAGTTGA CCGGCGTGCC GGCCTCGACC TGGCTCGCCG GCTCGTTGCG GCTGTCCAAT ATCCGACGGC AGATGCTTGC CGGCCGTAAC CTGGCGCTGG GACAGTTCGA CGGCCGCGAG ATCGAGCCGT TGCAGGGTAT CGTCGACGAT TCACATCGCG ACTTCAAGGC GATGATGGCT GGGCTTACCG CGGCAACAGG GCAGTTGCGT CGAGACCTGT TCCACGCCGA AGGACTGCCC GACTATCGTT CGACCGTCGA CAGTCCGCCG GCGTTCGAGG AGACCTGGAC GTTCAACAAG GCACCGATGC CCGGCACCGA GATCATCCTG CGCGAGCAGA TGGCGGCCAT GCCCAAGATG CGGTTGATGG TGACGCAAGG CGTCTTCGAC ACCACAACCA CCATGGGCGA GACCGATTAT CAGTTCGCCC AGATCGCCGC GCCGCAGGAC CGTACGACCT TCGCTTATTA TCCCGGCGGA CATATGCTCT ATTCCGAGGA TGAGGGGCGA CGCGCCTTCC TGAGCGACGT TCGAACCTTC ATCAGCGGCC AGGTGCTGGC TCGGAGGCCC TTCCCGCACC CGGCACCAGG CCAGATCGGC GGGCGCAAGC AGGCGACCGG GCAATGA
|
Protein sequence | MKIVRRLSLL VSLSSLLAMP ALVQAQDAAS PALSPEPSRS TWPAVTRPAD RGPRHYSAVR SGTFGGKSLR YRAVLSEMLV RDRAGKPASS LFVTAFVAGI DQRLAAQRPV IFIFNGGPGG SSNTLMFGAM GPSRLQAFDV AAIGNAKTPV VPNEDAVLDI ADLVFIDAPE TGYGRPLPGS DEKTFRSNDG DSNAFAQVIL RWLTDNGRMA SPVYIMGESY GSIRAVLLAR DLRVATPRVE PAGLILVSQA LWYNGPETGM TVLPDPVRAV NSLPDIAALA WHHGLIDNRT QTLEQAVRAA QTFALQDYAK ILIAGNRAPE TERACVAERL AQLTGVPAST WLAGSLRLSN IRRQMLAGRN LALGQFDGRE IEPLQGIVDD SHRDFKAMMA GLTAATGQLR RDLFHAEGLP DYRSTVDSPP AFEETWTFNK APMPGTEIIL REQMAAMPKM RLMVTQGVFD TTTTMGETDY QFAQIAAPQD RTTFAYYPGG HMLYSEDEGR RAFLSDVRTF ISGQVLARRP FPHPAPGQIG GRKQATGQ
|
| |