Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_2069 |
Symbol | |
ID | 7088362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 2450084 |
End bp | 2451244 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643460972 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_002357996 |
Protein GI | 217973245 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000265574 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000000000285968 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCTTGAGA TACTCTTTCT GTTGCTTCCC ATTGCTGCCG CCTACGGTTG GTATATGGGG CGACGGAGCA TAAGGCAGAA TCAAAGCAAT CAACGTAAAC AATTGAGCCG GGACTATTTT ACTGGCCTCA ATTTCTTACT GTCGAACGAA TCGGATAAAG CGGTTGATTT GTTTATCAGT ATGCTAGATG TTGACGATGA AACCATAGAT ACCCATCTGT CACTTGGTTC GCTGTTCCGC AAGCGAGGCG AGGTCGACCG CTCCATTCGT ATCCATCAGA ATTTAATTGC CCGTCCAAAC CTAGCCAATG AGCAGCGCGA CATCGCTATG ATGGAGCTGG GTAAAGATTA TCTGGCCGCG GGTTTTTACG ATAGGGCCGA AGAAATCTTT ATCAACTTGG TTAGCCAAGA CGATCACAGT GAAGAGTCAG AAACTCAGCT CATTGCTATC TATCAAGTGA TTAAAGAGTG GCAAAAAGCC ATTGATATCA CTAAGCGTTT GAGTCGTAAA CGTCAGCAAG TCTTAAAGCC GTTAACGGCG CATTTCTATT GCCAGTTAGC CGATGAAGCC AGCGATGATG CGCAGAAAAT TAAACTGCTG CAACAAGCGC TAAAGCAAGA TCCGCAATGC GGTCGTGCGT TATTGACCTT AGCGAAAAAA TTCCTCGATA TTCAAGATTA TGCTCAGTGC AAGCAAATGC TGCTGCAACT GAAAAAAGCC GATATCGAGC TTTTTGCCGA TGCAATCCCC ACGGCCAAAC AAGTTTATCG CGACACACAA GACAAAGAAG GCTTCCAAGA GTTACTGGCG GGCGCTATGG CCGATGGCGC GGGTGCTTCA GTGGTTGTCG CCTTAGCGCA GCACATGATA AGTCTGGATG AGATTAAAGC GGCAGAGACT ATGGTGCTCG ATGCCCTATA TCGCCATCCA ACCATGAAAG GTTTTCAGCA CTTGATGCAA ATGCATTTGC GTCAAGCAGA GGAAGGGCAA GCCAAGCAAA GTTTAACTAT GCTAGAGCAG CTCGTTGAGC AACAAATTAA ATTCCGTCCA AGCTACCGTT GTAAAGAGTG TGGTTTCCCT TCACATGCAC TTTACTGGCA TTGTCCTTCC TGTAAAAACT GGGGCAGTAT CAAGCGGATC AGAGGCTTAG ACGGCGAGTA A
|
Protein sequence | MLEILFLLLP IAAAYGWYMG RRSIRQNQSN QRKQLSRDYF TGLNFLLSNE SDKAVDLFIS MLDVDDETID THLSLGSLFR KRGEVDRSIR IHQNLIARPN LANEQRDIAM MELGKDYLAA GFYDRAEEIF INLVSQDDHS EESETQLIAI YQVIKEWQKA IDITKRLSRK RQQVLKPLTA HFYCQLADEA SDDAQKIKLL QQALKQDPQC GRALLTLAKK FLDIQDYAQC KQMLLQLKKA DIELFADAIP TAKQVYRDTQ DKEGFQELLA GAMADGAGAS VVVALAQHMI SLDEIKAAET MVLDALYRHP TMKGFQHLMQ MHLRQAEEGQ AKQSLTMLEQ LVEQQIKFRP SYRCKECGFP SHALYWHCPS CKNWGSIKRI RGLDGE
|
| |