Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shew_1957 |
Symbol | |
ID | 4920990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella loihica PV-4 |
Kingdom | Bacteria |
Replicon accession | NC_009092 |
Strand | + |
Start bp | 2262067 |
End bp | 2263227 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640163526 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_001094082 |
Protein GI | 127512885 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000379744 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000558993 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTAGAGA TTCTGTTTCT GTTGTTACCC ATTGCCGCCG GTTACGGTTG GTACATGGGT CGCCGTAGTA TCAGGCATAA GCAGAATAGT AAACGTAAGC AGTTAAGCCG CGATTACTTT ACCGGCCTTA ATTTCCTTCT GTCTAATGAA TCGGATAAGG CAGTCGATCT CTTTATCTCC ATGCTGGACG TGGATGACGA CACCATTGAT ACCCATCTCT CCCTCGGCTC CTTGTTTCGC AAGCGAGGCG AGGTAGACCG CTCGATTCGT ATTCACCAAA ACCTGATTGC CCGCCCAACT CTGACAACCG AACAGCGTGA CATCGCCATG ATGGAGCTGG GCAAAGACTA TCTGGCCGCC GGCTTCTACG ACAGAGCCGA GGAGATCTTC CTTAATCTGG TTCGCCAGGA AGATCACAGC GAAGAGGCCG AAGATCAGCT GATCGCCATC TACCAGGTGA CCAAAGACTG GCAAAAAGCA ATAGATATCA TCAAGAGCCT CAAGCGTAAG CGTCAGCAAT CGCTCAAACA CCTGCAGGCC CATCTCTATT GTGAGCTTGC CGATGAGGCC AGCGACAGCG AGCTCAAGCT TAAACACCTG GCACAGGCGA TAAAGCAAGA TCCCCAATGT GGCCGCGCCA TGTTAACCAG CGCCAAGCTG TTCCTCGCTC AGCAGGAATT TGGCCGCGCC AAGGAGATGC TCTGCCGGTT GAAAGATGCC GATATCGAAC TCTTTCCCGA GGCGCTCGCC ATCGCCAAAG AAGTTTATCA ATCGACCGAG GATCTCGGCG CCTATCGTGA ACTGCTACGC GAAGCGTTAG AGCAGGGGGC TGGCGCGAGT GTGGCCATCA CTCTGGCGCA GCAGATGATC ATTCAGGGAG AAACCCAAGA CGCCGAGAAG TTGATTCTCG ATGGCCTCTA TCGCCATCCG ACCATGAAGA GTTTCCAGCA TCTGATGAAG ATGCAGATCC AACACGCCGA AGATGGTCAG GCAAAACAGA GTTTGAACAT GCTCGCCGAA CTAGTCGAGC AGCAGATAAA ATTCCGTCCC AGTTACCGCT GTATTGAGTG TGGTTTCCCG TCCCACACCC TCTACTGGCA TTGCCCTTCC TGTAAGAGTT GGGGCACCAT CAAGCGGATC CGCGGACTCG ACGGGGAGTA A
|
Protein sequence | MLEILFLLLP IAAGYGWYMG RRSIRHKQNS KRKQLSRDYF TGLNFLLSNE SDKAVDLFIS MLDVDDDTID THLSLGSLFR KRGEVDRSIR IHQNLIARPT LTTEQRDIAM MELGKDYLAA GFYDRAEEIF LNLVRQEDHS EEAEDQLIAI YQVTKDWQKA IDIIKSLKRK RQQSLKHLQA HLYCELADEA SDSELKLKHL AQAIKQDPQC GRAMLTSAKL FLAQQEFGRA KEMLCRLKDA DIELFPEALA IAKEVYQSTE DLGAYRELLR EALEQGAGAS VAITLAQQMI IQGETQDAEK LILDGLYRHP TMKSFQHLMK MQIQHAEDGQ AKQSLNMLAE LVEQQIKFRP SYRCIECGFP SHTLYWHCPS CKSWGTIKRI RGLDGE
|
| |