Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_0143 |
Symbol | |
ID | 5605951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | - |
Start bp | 154409 |
End bp | 156061 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640935630 |
Product | hypothetical protein |
Protein accession | YP_001476381 |
Protein GI | 157368392 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03368] cellulose synthase operon protein YhjU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCAA CAAATAACCC GCAATCCGAT AACTCCCTGT GGCGCTACTG GCGTGGGCTG GGGGGCTGGA ACTACTACTT CCTGGCCAAG TTTGCCCTGC TGTGGTTCGG TTACCTGAAT TTCCACGCAC TGCCAAACCT GGTGTTCATG GCATTTTTGT TGATGCCGAT ACCGGCGCTG CGCGTGCATC GCTGGCGCCA TTACCTGGCG ATCCCGATCG GTTTCGCGCT GTTCTATCAC GACACCTGGC TGCCCGGCAT CAACAGCATC ATGAGCCAGG GCTCGCAACT GACCGGCTTC AGTGCCCAGT ATCTGCTGGA GTTGACCAAC CGCTTTATCA ACTGGCAGAT GATTGGCGCC GCCTTTGTGA TGCTGATTGC CTATCTGTTT TTATCCCAGT GGATACGGGT GACGGTGTTC ACCGTTGCCG CGCTGGTGTG GCTGAACCTG GTCAATATTG CCGGCCCGGC AGTGTCACTG CTGCCCGCCA GCAGCACCGC TTCCACCAGC GGCACACCGG CAGCGACAGC CCCGGCGGCT GGCGGTGATA GCGCCCCGGC GGACAGTGCG CCACCCACCA GCGCCAACCT GACGGCCTAT CTGAACCAGT TCTACGATAA GGAAAAGGCC CGCGCCACCG CCTTCCCGGC CAGCCTGCCT GCCGATGCTC AGCCATTCGA CCTGCTGGTG ATCAATATCT GTTCGCTGGC TTGGGCCGAC ATGGATGCGG TGAAACTGGA AAACCACCCG CTGTGGTCGA AGATGGACAT TATGTTCGAC AACTTCAACT CGGCGACCGC CTACAGCGGC CCGGCGGCCA TCCGTTTGCT GCGCGCCAGC TGCGGTCAAC CCTCACACCA CGACCTGTAT CAACCGGTGA ATCAGCAGTG CTATCTGTTT GATAATCTGG CCAAGCTGGG CTTCAAAGAA CAGCTGATGC TCGATCACTC CGGCGTGTTC GGCAACTTCC TCAAAGAGCT GCGTGAACAA GGGGATATAC AGGCCCCGCT GATGTCTCAG GCCGGTATCG GCAATGAACT GGCGTCGTTT GACGGCGAAC CGATTTACAA CGATCTGGAA CTGCTCACCC GCTGGCTGAG CCAGCAGCAA AAGGCCGGCG ATACCCGCAG CGCCACCTTC TTCAACGTCA TTCCGCTGCA TGACGGCAAC CGTTTTGTCG GTTCGAACAA GAGCGCCGAC TATCAGCCGC GGGCGCAAAA ACTGTTCGAC CAGTTGAATA CCTTCCTTGA CCAACTGGAG AAATCCGGAC GCAAGGTGAT GGTGGTGATT GTGCCGGAAC ACGGTGCAGC GCTGGTGGGG GACAAAATGC AGATGTCCGG CCTGCGGGAT ATCCCCAGCC CGAACATTAC CCATACGCCG GTGGGCATCA AACTGGTGGG CATGAAAGCG CCGCATCAGG GCAGCCCGTT GCAGGTTAAA ACGCCGAGCA GTTACCTGGC GCTTTCCGAG CTGGTCTCAC GCCTGGTCGA CGGTAAAGCG TTCACCGCCC CAAGCGTCGA CTGGCAGGCG CTGACGCAGA ACCTGCCGCA GACGGCGGTC ATTTCGGAGA ATGACAACGC CATCGTGATG CAATACCAGG GTAAACCGTA CATCCGCTTG AACGGCGGTG ATTGGGTGCC TTACCCCCAG TAA
|
Protein sequence | MKPTNNPQSD NSLWRYWRGL GGWNYYFLAK FALLWFGYLN FHALPNLVFM AFLLMPIPAL RVHRWRHYLA IPIGFALFYH DTWLPGINSI MSQGSQLTGF SAQYLLELTN RFINWQMIGA AFVMLIAYLF LSQWIRVTVF TVAALVWLNL VNIAGPAVSL LPASSTASTS GTPAATAPAA GGDSAPADSA PPTSANLTAY LNQFYDKEKA RATAFPASLP ADAQPFDLLV INICSLAWAD MDAVKLENHP LWSKMDIMFD NFNSATAYSG PAAIRLLRAS CGQPSHHDLY QPVNQQCYLF DNLAKLGFKE QLMLDHSGVF GNFLKELREQ GDIQAPLMSQ AGIGNELASF DGEPIYNDLE LLTRWLSQQQ KAGDTRSATF FNVIPLHDGN RFVGSNKSAD YQPRAQKLFD QLNTFLDQLE KSGRKVMVVI VPEHGAALVG DKMQMSGLRD IPSPNITHTP VGIKLVGMKA PHQGSPLQVK TPSSYLALSE LVSRLVDGKA FTAPSVDWQA LTQNLPQTAV ISENDNAIVM QYQGKPYIRL NGGDWVPYPQ
|
| |