Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4332 |
Symbol | |
ID | 4245984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6677849 |
End bp | 6678901 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638109219 |
Product | hypothetical protein |
Protein accession | YP_723797 |
Protein GI | 113477736 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATC GCTTTCGAGA ATTACTCAAA ATCATTGGCA GTGGAACCCA CACAGGTAAA AATTTAACTC GCCAAGAAGC AGCAGCAGCA ATGCGCATGA TGTTATTAGG AGAAGCAACA CCTACACAAA TAGGTGCTTT TCTTATTGCT CACCGTATTA AACGCCCCAC TGGTGAGGAG TTAGCAGGAA TGTTAGATAC CTATAATGAA TTGGGGCCAA AACTTAAAAG TCAACCATCT ATGGGTACAG TAACCGTCTT AGGTTGTCCT TATGATGGGC GATCGCGGAC TTCACCTGTC ACCTTACTCA CAGCTTTAAT TTTGGCAACA GCAGGAGTAT TTGTTGTCAT CCACGGTGGA AGGCGGATGC CGACAAAAGA AGGCATACCT TTTATTGATA TCTGGCAAGG ACTAGGGGTT GAGTGGGGAA AATTATCGCT GGTGGAGGTT CAACGAGTAT TTGAGGAAAC TGGTCTAGGG TTCGTCTATT TACCAAGACA TTTTCCTCAA GCAGATGCTT TAGTAAAACA TCGTCGAGAT ATTGGTAAAC GACCTCCTAT CGCAATAATG GAATTAATTT GGGTACCCTT GGCGGGAGAA GTTCATTTAG CTGCAGGGTA TGTTCATCCT CCCACAGAAG GTATGTTTCG TGAAGTATTG GAATTACATG GTTTGAGGAA TTATACAACG GTGAAGGGGT TAGAGGGAAG TTGTGACTTG CCCCGCGATC GGACAGCTAT TATTGGGGTA TCGTTGTCAT CTGGGAATGA TGCCACATTT GAACGTCTAT TGTTACATCC GAGTGACTAT AGTTGTGGAG GGAAGGAAGT TGTATTGGGT TCAACTGCAG AGTTAGTAGA AGAGATACAA AAAATACTAC AGGGTAAAGC CAGTAAGTTA ATGTCAGCAG TTATTTGGAA TGGCGCTTTT TATTTGTGGC GTTGTGGAAT TTGCTCTGAT ATTAATGAAG GTTTGTTGAA AGCGGAAAGT TTATTAAATA GTGGTAAAGT TAGGGATAAG TTGAGAGAAA TTAAAGCAAA AATTGAGATA TAA
|
Protein sequence | MSDRFRELLK IIGSGTHTGK NLTRQEAAAA MRMMLLGEAT PTQIGAFLIA HRIKRPTGEE LAGMLDTYNE LGPKLKSQPS MGTVTVLGCP YDGRSRTSPV TLLTALILAT AGVFVVIHGG RRMPTKEGIP FIDIWQGLGV EWGKLSLVEV QRVFEETGLG FVYLPRHFPQ ADALVKHRRD IGKRPPIAIM ELIWVPLAGE VHLAAGYVHP PTEGMFREVL ELHGLRNYTT VKGLEGSCDL PRDRTAIIGV SLSSGNDATF ERLLLHPSDY SCGGKEVVLG STAELVEEIQ KILQGKASKL MSAVIWNGAF YLWRCGICSD INEGLLKAES LLNSGKVRDK LREIKAKIEI
|
| |