Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_2039 |
Symbol | |
ID | 5079465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | - |
Start bp | 2324315 |
End bp | 2325475 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640499201 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_001183559 |
Protein GI | 146293135 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000051 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGAGA TCCTCTTTCT GTTGCTTCCC ATTGCTGCCG GTTACGGTTG GTATATGGGT CGGCGGAGCA TAAGGCAGAA TCAAAGTAAT CAGCGCAAGC AATTAAGTCG TGATTATTTC ACGGGCCTTA ATTTCTTATT GTCCAATGAG TCGGATAAAG CGGTTGATTT ATTTATCAGT ATGTTAGATG TAGACGATGA GACTATCGAC ACTCATTTGT CCTTAGGTTC ACTTTTCCGT AAACGTGGCG AAGTTGATCG CTCCATTCGT ATTCACCAAA ATTTGATTGC TAGGCCTACA CTCACCAATG AGCAGCGCGA CATTGCTATG ATGGAGCTGG GTAAAGATTA TCTGGCGGCA GGATTTTATG ATAGGGCTGA AGAAATCTTT ATTAATCTTG TAAGCCAAGA TGATCACAGT GAAGAATCTG AAACACAATT AATTGATATT TATCAAGTGA TTAAAGAGTG GCAAAAAGCT ATTGATATCA CAAAACGTTT AAGCCGCAAG CGCCAGCAAG TGCTTAAACC TATCATTGCT CACTTCTATT GTCAGTTGGC CGATGAAACC AATGATGACG ACAAAAAAAT TAAGCTATTA CAACAGGCAC TTAAACAGGA TCCTAAGTGT GGCCGGGCCT TACTCACACT TGCAAAAAAA TTCCTCGATA TTCAAGATTA CCCACAATGT AAAGCCATGC TAGCGGCGCT CAAAAAAGCG GATATTGAAC TCTTTGCTGA TGCATTGCCG ACAGCGAAGC AAGTTTACCG TGATACCCAA GATAAAGAGG GCTATCAGGA ATTACTTGCA GGGGCTATGG CAGAAGGCGC TGGAGCCTCT GTGGTTGTCG CTCTCGCACA GCATTTGATC AGCCTTGATG AAATTAAAGC GGCTGAAAAT ATAGTGCTTG ATTCGCTATA TCGCCATCCC ACCATGAAAG GCTTTCAACA CCTAATGCAG ATGCACCTAC GTCAAGCTGA AGACGGACAA GCTAAGCAAA GTTTAGCTAT GTTAGAGCAA CTTGTTGAAC AACAAATAAA ATTTCGCCCT AGTTACCGTT GTAAGGAGTG TGGTTTTCCA TCACATGCGC TTTACTGGCA TTGCCCATCT TGTAAAAATT GGGGCAGTAT AAAACGGATC AGAGGGCTTG ACGGCGAATA A
|
Protein sequence | MLEILFLLLP IAAGYGWYMG RRSIRQNQSN QRKQLSRDYF TGLNFLLSNE SDKAVDLFIS MLDVDDETID THLSLGSLFR KRGEVDRSIR IHQNLIARPT LTNEQRDIAM MELGKDYLAA GFYDRAEEIF INLVSQDDHS EESETQLIDI YQVIKEWQKA IDITKRLSRK RQQVLKPIIA HFYCQLADET NDDDKKIKLL QQALKQDPKC GRALLTLAKK FLDIQDYPQC KAMLAALKKA DIELFADALP TAKQVYRDTQ DKEGYQELLA GAMAEGAGAS VVVALAQHLI SLDEIKAAEN IVLDSLYRHP TMKGFQHLMQ MHLRQAEDGQ AKQSLAMLEQ LVEQQIKFRP SYRCKECGFP SHALYWHCPS CKNWGSIKRI RGLDGE
|
| |