Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_2313 |
Symbol | |
ID | 5200134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 2573379 |
End bp | 2574440 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640581859 |
Product | amidohydrolase 2 |
Protein accession | YP_001262810 |
Protein GI | 148555228 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0234215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAGCT GCCCCCGCTG CCGGATGGCG AGACGATCCT TCCTCAAGGC CGCGGCGGGC GGGGCGGCGG CCGCGCTCGC GGCGGGCCTG CCCGGATCGG CGTTGCTTGC CGCCGGCGGC ACCGCCGGGG CGATCGACTT CCACACCCAC ATGATCGATC CCGACCTGCC CAACCTGATC ACCGGGCAGA TGGTGAGCAC CGATCTCGCC GCCTGGATGC AGGGCTTCGC CTCGCCCGAC GTCCATGTCG CGCACATGGA CAAATATGGC GTGCAGGCGC ACGTCGTCGG TCATTCCAAC GCCACCCAGG GGATCAGCTG GGGTGACGCC CGGCACGACC TGGGCGTCCA CCAGCGGGTC AACGACCGGA TCGCGCGCGA ATGGGTGAAG GCCCATCCCG GCCGCTTCCA CGGCGCCTTC GGCCTGCCGA CCCAGGACCT CAAGCTCGCC ATTCCCGAGC TCGAGCGCGC GGTGACGCAA CTCGGCATGA AGGTGCTGCA GCTCTCCTCG CAATCGCCCG ACGGCGCCTA TTTCGGCGAT CCGCGCTTCG ACCCGCTGTG GGAGGCGGTC CAGCATTTCG GCGTCACCGT GTTCATCCAT CCGCACGGCC AGGGCAAGGA GCCGCCGCTC GACGGCTTCG CGCTGGCCAA TTCGGTCGGG CAGGGGGTCG AGGAGGTCAA GGTGATGACC TCGATCATCT ACAATGGCGT GTTCGACAAA TTCCCGCGCG CCAAGATCGT CATCGCGCAT GGCGGCGGTT TCCTGCCGCA TTATTACGGC CGGCTCGACC GCAACGCGCA CGAGCGCCCC GATACCAGCC GCAACATCAG CAAGCTGCCG AGCGCCTATC TCAAGGACTT CTACTACGAC AGCTGCGTCT ACGGGCCCGA GATCCTCGCC GCGCTGATCC GGGTGGTCGG CGTCGACCGG ATCGTGCTCG GCAGCGACTT CCCGGTGGGC GAGGCCGACG GCCTGTCGGC GCTGCGGGCG ACGCCCAACC TCTCGGCCGC CGACGTGACC CGGATCGCGC GGACCACGCC CGCCGCTCTG CTCGGCCTCT GA
|
Protein sequence | MMSCPRCRMA RRSFLKAAAG GAAAALAAGL PGSALLAAGG TAGAIDFHTH MIDPDLPNLI TGQMVSTDLA AWMQGFASPD VHVAHMDKYG VQAHVVGHSN ATQGISWGDA RHDLGVHQRV NDRIAREWVK AHPGRFHGAF GLPTQDLKLA IPELERAVTQ LGMKVLQLSS QSPDGAYFGD PRFDPLWEAV QHFGVTVFIH PHGQGKEPPL DGFALANSVG QGVEEVKVMT SIIYNGVFDK FPRAKIVIAH GGGFLPHYYG RLDRNAHERP DTSRNISKLP SAYLKDFYYD SCVYGPEILA ALIRVVGVDR IVLGSDFPVG EADGLSALRA TPNLSAADVT RIARTTPAAL LGL
|
| |