Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3371 |
Symbol | |
ID | 4243466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 5168831 |
End bp | 5170021 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638108356 |
Product | glycosyl transferase family protein |
Protein accession | YP_722946 |
Protein GI | 113476885 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.547122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTAA CTTTTTGCAT AATTGTTAAA AATGAGGAAA AAAATTTGCC ACGATGTTTG GCAAGCGTGA AAAATGTAGT AGATGAAATA GTTGTTTTGG ACACGGGTTC AACGGACCGA ACCCCAGAAA TAGCTCAAGA ATTTGGTGCA AAAGTACATT ATTTTGAATG GTGTAATGAT TTTGCAGCAG CTAGAAATGA ATCTTTGAAA TACGTTACAG GTGACTGGGT TTTAGTATTA GATGCTGATG AATATCTCTC ACCAAAAATA GCTCCACATA TTAGACAAGC TATTCAGAGC GATCGCTATA TTTTAATTAA TTTAATTAGA GAAGAAATAG GAGCACAACA ATCTCCATAT TCTTTAGTTT CTCGTCTCTT TAGAAATCAT CCTGGGATAA AATTTTCTCG ACCTTATCAT GCCATAGTTG ATGATAGTAT TTCTGAGATT TTAGCAAAAG AATCTAACTG GCAAATAGGT TCTTTGTCAG AAATAGCAAT TTTACATGAG GGTTATCAAA AGGGAGAAAT TACTTCTAAA AATAAACTCG AAAGAGCCAA GGCAGCGATG GAAAGTTTTA TTATAGACTA TCCTAATGAT GCTTATGTTT GTAGTAAATT GGGAGCTTTA TATTTGGAAG TTGATGAAAG AGAAAAAGCG ATGGAATTAT TGTTTCGAGG TTTGCAAGAT TCGGAGGTAG ATAAAACAGT TTTGTATGAG CTACATTACC ACTTAGGAAT TGCCTATAGA CAAAAGCAAG AAAAAGAAAT GGCAAAGCGT CATTATCAAA TAGCAACAGA GTTAAATATT TTGCCTAAGT TGAAACTAGG AGCATACAAT AATTTAGGTA GTTTGTTGAA AGAAGAAGGA AATTTAGAGC AGGCAAAAAC TAACTATGAA ATTGCTTTGC AGATAGACTC TAATTTTACA GTTGGTCATT ACAACAGGGG TATGGTTTTG AAAGAAATGG GTTGGTTTAC TGAAGCGATC GCCTCTTATC AAAAAGCAAT TCAACTTGAT GCTAACTATG CAGAAGCTTA TCAAAATTTA GGAGTTGTAT TGTTGAAAGT AGGTCAAGTA CCAGAAAGTT TAGAAGCATT TGGGAAAGCT ATTAGTTTGC ATGAAAAATA TAACCCTATT GAAGCTAAAA GAATTAGGCA AGGTTTAGAA TCAATGGGTT TTATGTTTTA G
|
Protein sequence | MNLTFCIIVK NEEKNLPRCL ASVKNVVDEI VVLDTGSTDR TPEIAQEFGA KVHYFEWCND FAAARNESLK YVTGDWVLVL DADEYLSPKI APHIRQAIQS DRYILINLIR EEIGAQQSPY SLVSRLFRNH PGIKFSRPYH AIVDDSISEI LAKESNWQIG SLSEIAILHE GYQKGEITSK NKLERAKAAM ESFIIDYPND AYVCSKLGAL YLEVDEREKA MELLFRGLQD SEVDKTVLYE LHYHLGIAYR QKQEKEMAKR HYQIATELNI LPKLKLGAYN NLGSLLKEEG NLEQAKTNYE IALQIDSNFT VGHYNRGMVL KEMGWFTEAI ASYQKAIQLD ANYAEAYQNL GVVLLKVGQV PESLEAFGKA ISLHEKYNPI EAKRIRQGLE SMGFMF
|
| |