Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3091 |
Symbol | |
ID | 5196988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 3391484 |
End bp | 3393823 |
Gene Length | 2340 bp |
Protein Length | 779 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640582639 |
Product | TonB-dependent receptor |
Protein accession | YP_001263578 |
Protein GI | 148555996 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.308245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGT TCAAAGCGCT CGGTTCGACG GCCATCATTG GATGCATGGC CCTCGCTCTC ATAAGCCCGG CCCATGCGCA GGACGCCGCC GCTCCAGCCG CCGACGGCGA CGAGATCATC GTCACCGCCC AGGGGCGCGA GCAGAATCTT CAGGACGTGC CGGTCTCCGT ATCGGCCGCC ACCGGCATCA GCCTTCAGAA GCAGGCGATC ACGACGCTCG AGGCGCTCTC GGTGCGCCAG CCGAACTTCC GCATCTCGCA ATCGCCGGCG TCGGACTATG TCGTGATCCG CGGCATCGGC TCCTCGGCCA ATATCGGCTT CGAGCAGTCG GTCGGCACCT TCGTCGACGG CGTCTATCGT GGCCGCTCCC GTTCGACCCG CGCCGCCCTG TTCGACCTGG AGCGCGTCGA GATCCTCAAG GGTCCGCAGA CGACCTATTT CGGCAACAAC TCCATCGCCG GCGCGCTCAA CATCACCACC CGCAAGCCCG GTCGCGATCT CGCGGTGAAC GGCACCGCCT CCTATTTCCC GAACACCGGC GAATATCTCG TCGAGGGCGG GATCACCCTG CCGGTCACCG ACCGCCTTTC CCTGCGTCTC GCCGCCCGCC AGTCGGGCAT GGACGGCTAT ATCAAGAACA TCCAGACCGG GAAGGACGGG CCCCATCTCA ACGACAGGAT CGGTCGCGTC TCGATGGCCT GGGCGCCGAC GGACGGGATC GAGATCGACG CCCGCCTCGA TGTCGGGCGG ATGCGCGATA CCTCGGTGTT CAACGTCGAA CTGCTCGACT GCCCGACGTC CGCACCCTTC GCCGGTCCGG CGGGGCCGTG CGCCCGCTAT CTGAACGCCA GCGGAGGCTC GGTCGACAGC AAGCTCGATC GCGTGAGCGG CGCCAATCCC TCCTACTTCG ACTACGACAT GGTCGAAGGC GTGTGGGGGA TGAAGATCGC GGCGGGCGAC AACACGCTCA CCCTGACGAC CGGATATTTC CACCACAAAT ATCACCTGCT CAACGACCCG GTCCCCGTGC CCGGGACGCG CGGCGGCAGC GCCGTCGGCA CCACCACCGC GCTTCCGATC GCGCTGTTCG AGAAATATGA CCAGGTGAGC CAGGAGCTTC GCTTCGCCTC GCCCGAGGAC CGGACGATCA GCTACATGTT CGGCGCCTAT TACCAATATG GCAAGCTGAC GACCAACCTC ATCCAGGGCT TCTATTTCGC TCCGGTCGCC GCGGCCACGG GCGGCCTCAT CCCGCTGGCC ACGCCGGTGG CCGGCAGCAT CACCACCACC GAGCGCAGCG ACGTCTTCTC GGGCTTCGCC GCCGCCACCT GGCGCGCGAC CGACGCCCTG CGCGTCAATG TGGGGGGCCG CTTCTCGCTG GTCGACAAGC ATGATGCGCG CGCTACCCAG ATGGGGACGG CGGCCTCGAT CCCGTCGCTC GCCAATTTCG TGCCGTTCAC CGTCGCGCTT ACCCAGCTCT ACGCGGCGAG CGGCATCAAT CCCGGCAATT ACGCGGTGCC GAACCGCTCC GACCATGCGT TCCTGCCGAG CGCGAGCATC CAATATGATC TGTCGCGAAA CGCCATGGCC TACGTCTCCT ATGTCGAAGG CTTCAAGGCT GGCGGCTACT CGATCGGGAC GACCAATTCG AGCTTCGATC CCGAACGGGT GAAATCCTAC GAGCTCGGGA TCAAGGCCGA CCTTCTCGAT CGCCTGCTGA CGGTCAACCT GGCCGGCTTC TACAGCCGTT ACCGCAATCT CCAGGAGACC GCGACCGTCA CCTCGGGCAC GGTGGTGCGC CAGTTCGTGA CGAACGCCGC CAAGTCCAAG GTCAAGGGCG TCGAGCTCGG GCTGACCGTG CGGCCGAGCT CGAACGTCAC CCTGACCTCG AACGTCGCCT ATCTGTCGTC GCGCTACGCC GATTATCCCA ATGCGCCCTG CACGACGACG CAGCAGGCGC TGGCGCCGGT GTGCGTGCAG GACCTGTCGG GAGCGCGGCG GGCCTTCGCG CCGAAGCTGA GCGGAAATGT CGGCATGAAC GTCACCCAGC CGCTCGGCCG CTACCAGCTC AGCTTCGACG CCAATGGCTA CTTCACCACC CGCTTCCTCC AGATCGCGAC CGGCGACTAT CGCATCTCGC AGGAGGGCAA CGTCAAGATC GACGCGCGGA TCGGCTTCGG TCCGGCCGAC GGCCCATGGG AACTGGCCGT CGTCGGCAAG AACCTCACCA ACAAGCTGAC CGCGGGCTAT CGTCAGGTGG TGCCGGGCGC GACAGGCAGC CTCGCCGCCA TGGCCGATGC CCCGCGCTCG ATCGGCCTGC AGGGTTCGTT CCATTTCTAA
|
Protein sequence | MTKFKALGST AIIGCMALAL ISPAHAQDAA APAADGDEII VTAQGREQNL QDVPVSVSAA TGISLQKQAI TTLEALSVRQ PNFRISQSPA SDYVVIRGIG SSANIGFEQS VGTFVDGVYR GRSRSTRAAL FDLERVEILK GPQTTYFGNN SIAGALNITT RKPGRDLAVN GTASYFPNTG EYLVEGGITL PVTDRLSLRL AARQSGMDGY IKNIQTGKDG PHLNDRIGRV SMAWAPTDGI EIDARLDVGR MRDTSVFNVE LLDCPTSAPF AGPAGPCARY LNASGGSVDS KLDRVSGANP SYFDYDMVEG VWGMKIAAGD NTLTLTTGYF HHKYHLLNDP VPVPGTRGGS AVGTTTALPI ALFEKYDQVS QELRFASPED RTISYMFGAY YQYGKLTTNL IQGFYFAPVA AATGGLIPLA TPVAGSITTT ERSDVFSGFA AATWRATDAL RVNVGGRFSL VDKHDARATQ MGTAASIPSL ANFVPFTVAL TQLYAASGIN PGNYAVPNRS DHAFLPSASI QYDLSRNAMA YVSYVEGFKA GGYSIGTTNS SFDPERVKSY ELGIKADLLD RLLTVNLAGF YSRYRNLQET ATVTSGTVVR QFVTNAAKSK VKGVELGLTV RPSSNVTLTS NVAYLSSRYA DYPNAPCTTT QQALAPVCVQ DLSGARRAFA PKLSGNVGMN VTQPLGRYQL SFDANGYFTT RFLQIATGDY RISQEGNVKI DARIGFGPAD GPWELAVVGK NLTNKLTAGY RQVVPGATGS LAAMADAPRS IGLQGSFHF
|
| |