Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_2179 |
Symbol | |
ID | 5200635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 2440866 |
End bp | 2443892 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640581724 |
Product | hypothetical protein |
Protein accession | YP_001262676 |
Protein GI | 148555094 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0315308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000146786 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATATCG TCGCTCGCCT GAAGATGAAT GGGCAGAATT TCTCGGCGGA GTTGGGCAAG GTGCTCGGTG ATGCTGAAAG GAAGTTCGGC AACTCCGGCA GCGTTATCGG CCGCAACCTC GGTGATGGGA TCGGCGGCGG ATTGCAGACG GCTGCCTCGC GCGTGCCTGT TCTTGGCAAT GCACTCGCCG GGCTGTCGGG CACAGCTCTC GTAACTGCGG CGGGCTTGGG GGCGGTAACC CTCGCGATGG CGCACGGTGT CACTGGCATC GAAGAGTATG AGGGGGGTGT CCGCCGGCTA GATGCCGCGC TACGGGCTAC CGGAAATACT ACCGGTCTGA CGCGCGATCA GCTGATCGCC CTTGCCAATG ACATGGAAGG CGCCTGGGTG ACGCCCGCCG AAGATATCAT GGCGGCCGAG CAGGTATTGG CCTCGTTCGG CGGTGTGGCC GGGAGCGTGT TCGAGCGCGC TATCGCAGGG GCGGCGGACC TTTCGGAGGT GTTTGGCGGA GACCTTTCGT CGAATGCTGA AAAGGTCGGA ACTGTTCTCC AGAACTTGGC GCAGGGTGAG GTCAAGGGAC TTGAGCGGGG CTTTAAGTTC CTAGGAACGG AAACACTGAA TTCGATCGGT CATCTGGCGA AGGTGGGGAA AACTGCCGAG GCGCAGGAAG CGCTACTAAC CGCGCTGGAA GAGAGGATCG GCGGGTCGAA GGAGGCTGGG GGCAAAGGCC TGAACGGAGC CTTCTTCCGG CTCGGCGAGT CGATCAATGG TGTCATTGAG AACCTCACGC GCAGTACCGG TGTTTATGAA GGTAGCATCA CGTGGGTTGA TGCTCTGACC GTGAAGGTCA ACAAGCTCGC GGACAGCATC GACCGCGTGG GTGCGGGGCG GACGCTCGGC TCTCTTTGGA TGGGCTATGG TCTCCCCGAC GGGCCAGGAG CGGGCGCGCC GGAAAAACTG ACGGACGATC CCTATGGGCT CAATCGGCCC GGTCTATCCC TAGGCGATAT CATCGCGCGA AAGGCCGCAG GCGATGCATA CGAATCGGCG GAAGCGCAGC GCAAAGCGGA GGACGCCCGC GAAGCGGCTG AGAAGGCGTC GAAGAAGGCG CGGGAGGATC GGGAGCGCGA AGCGGAACGG GAAGCCAGCG ACTTGGCTCG GCGTCTGGAA GCGGTTCGCG ACAAGTACGG GACCACCACC ACGGCGGCGC GCGACTATGC TGACGCCCTA GCCGACATCA ACCGACTGAC CGACGCCGGC AAGCTCGACG GCACCGAAGC CGGGTTGCTC CGCCTCGAAG CCTACAGGCG CAAGGCAGCG GCTGACACCG AAGCCATGCG CCGTGCGATT TCGGCAACCG GGGCGACTGA TCTCCGCGAT CTCGTCGAGA ATGGCGATGT TGCCGCAGGT ATCGTAGGCG AAAAGGGCAA GGCGGCGATC GACGATTTCG TTGATTACTA CCATGACCGA CATATCGCCG CGATCGAGGA TATGGCCGAC ATCATGACCG ACCTGATCGG CGGCAAGGCC GGTCGGCTGA TCGGCGACCT GATCTCCGTT GCCGGCGGCG GCCGGGCCAA CAACAGCGCG CTCAACCTGT TGCTGAAATG GAAGGGGACG CCCCCGATCG TCGGCGAGCC CGGCGACAAC GGCGAGGTCG CGGGATCGGT CGACAAGCTC GGCAAGCACA TGGAGGGTAT CTTCGGGCTG TCGGGCGAGT TCACGCAATC GCTGGGCCGG ATGCTCGCGG GCGCCGGCCT CGGTGCGGCA GCGGGTAGCC TGGTCGGATC GTCGCAGGCT TCGCAGTTCG GGGCCATGGC TGGCGGCGCG CTGGGCGAGA AGGCCGGCGA CATGCTCGGC AAAACGTTCG GGAAACAGCT GGGGAAGCTG GCGGGCTTCG CCGGTCCGCT CGGCTCGATC GCGGGCGGCC TGGTCGGCAG CGCGATCGGC GGCTTGCTCG GCGGCGTCAA ATGGGGCGCG TCGACCATCT CGTTTTCGGG CGGAGAGTTC TCGGCCGGCA AGGCGACGGG CAATAGCGGC AGCGCCCGGC AGAACGCCAG CGCCAGCGCG AACAGCGTCG TCAACAGCCT GGAACGCATC GTCGAGCAGC TCGGCGGCAC GGTGCTGTCG TCGCCGAACA TCACCATCGG CCAGCGCCAT GGCGACTATC GCGTCAATAC CGGCGGCACG TCGTTGAAGA TCAAGAAAGG CGCCGTCGAG TTCGACGACG ATCAGCAGGG GGCGGTCGAA TATGCGATCC GCCAGATGCT GGCCGGCGCG GTGATCAACG GGATTTCGCA GGCGGCGAAG AATGTCCTGA ACAATCCCGA CAATGATCTG GAGGAAGCCG TTGCGAAGGC CGGCTATATC GAGGCGATCC CCAAGGCGCT GAAGGCCCGG CTCGATCCGG TAGGCGCCGC GATCGACGCA CTCAACGACA AATGGGAAAA GACGGTCGAG GCGCTTCGCG AGGGCGGCGC CGGTGCCGAG CAGATGGCCG AGGCGCAACG GCTCTACAAT CTTGAGCTGG AAGATGTGAA GATCAACACC AAGGCGGCCT CGGCCAGCCT CAACGAGTTC CTGAAGGGGC TCAAGGTGGG ATCGTCGTCG CCGCTGTCGC TGCGCGATCA GGAGGCCGCC GCCAAGGCCG AGTTGCAGCC GTTCCTCGAT CGGATCGCCG CCGGCAGCTC GATCGATCAG GACGCCTATC AGTCGGCCGC GCAGACCTTC CTCGATATCG AGCGGCAGTT GTACGGGTCG ACCACCGCCT TCTTCACGGC GTTCGATCAG ATACAGGCGG CGACCGCCAA GGCGATCGCG ACGATCGACA ATGCCGTGCC GATCAGCGAC CCGGCCGCCG ATCCGTTCGC GGAGAAAACG GCGACCGCGA CCCAGACCAG CGCCCAGCTG CTCGACCAGA TCAGCGGGCA ATTGCAGGGG CAGAGCGCGC AGCTCGCGGC GCTGCTGGAG GCATTCAAGG CCAATGGGGG CGGCTTCGGC TTCATCGGTG GCGGGAGGGC CTTCTAA
|
Protein sequence | MDIVARLKMN GQNFSAELGK VLGDAERKFG NSGSVIGRNL GDGIGGGLQT AASRVPVLGN ALAGLSGTAL VTAAGLGAVT LAMAHGVTGI EEYEGGVRRL DAALRATGNT TGLTRDQLIA LANDMEGAWV TPAEDIMAAE QVLASFGGVA GSVFERAIAG AADLSEVFGG DLSSNAEKVG TVLQNLAQGE VKGLERGFKF LGTETLNSIG HLAKVGKTAE AQEALLTALE ERIGGSKEAG GKGLNGAFFR LGESINGVIE NLTRSTGVYE GSITWVDALT VKVNKLADSI DRVGAGRTLG SLWMGYGLPD GPGAGAPEKL TDDPYGLNRP GLSLGDIIAR KAAGDAYESA EAQRKAEDAR EAAEKASKKA REDREREAER EASDLARRLE AVRDKYGTTT TAARDYADAL ADINRLTDAG KLDGTEAGLL RLEAYRRKAA ADTEAMRRAI SATGATDLRD LVENGDVAAG IVGEKGKAAI DDFVDYYHDR HIAAIEDMAD IMTDLIGGKA GRLIGDLISV AGGGRANNSA LNLLLKWKGT PPIVGEPGDN GEVAGSVDKL GKHMEGIFGL SGEFTQSLGR MLAGAGLGAA AGSLVGSSQA SQFGAMAGGA LGEKAGDMLG KTFGKQLGKL AGFAGPLGSI AGGLVGSAIG GLLGGVKWGA STISFSGGEF SAGKATGNSG SARQNASASA NSVVNSLERI VEQLGGTVLS SPNITIGQRH GDYRVNTGGT SLKIKKGAVE FDDDQQGAVE YAIRQMLAGA VINGISQAAK NVLNNPDNDL EEAVAKAGYI EAIPKALKAR LDPVGAAIDA LNDKWEKTVE ALREGGAGAE QMAEAQRLYN LELEDVKINT KAASASLNEF LKGLKVGSSS PLSLRDQEAA AKAELQPFLD RIAAGSSIDQ DAYQSAAQTF LDIERQLYGS TTAFFTAFDQ IQAATAKAIA TIDNAVPISD PAADPFAEKT ATATQTSAQL LDQISGQLQG QSAQLAALLE AFKANGGGFG FIGGGRAF
|
| |