Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3477 |
Symbol | |
ID | 4244477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5355116 |
End bp | 5356810 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638108451 |
Product | glycosyl transferase family protein |
Protein accession | YP_723040 |
Protein GI | 113476979 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTAACA AAATATTAAC AAAGAAAAAC TTACATTCTC ATATAGATTT ACTGCTAACG CTAGGCTTAT TGCTAGCAGC AGTAGTACTT TTTTCTACTA ATTTAGGAAC ATTGCCTTTA CGAGACTGGG ATGAAGGAAT AGTTGCCCAA GTAGCCAGAG AAATCAGCCG AGACAAGTGG AATTGGCTTT ATCCTACTAT CAATAATACT CCTTATTTTA ATAAACCACC ATTGATACAT TGGTTGATCG CTTTTATCTA TAGTATTGCA GGAGTCAATG AATTCAATGC CCGTTTATTT CCCGCTTTGT TAACAGCATT TTCTGTACCT TTAATCTATG GTATTAGTCG AGAATTATTT CACCCACGCA CTCCGGCAAT TTTTACCGCT TTGGTTTACT TAACTTTACT CCCTGTAGTG CGACATGGGC GTTTAGCAAT GTTAGATGGT GCAGTAGTTT GTTTTTTTTT ATTAATGATT TGGTGCGTAT TGCGCTCACG GCAAAACCTC CGCTATTCTC TCCCTATTGG TATTAGTTTT GCTTTAGTTT CCCTCAGCAA AGGAATTATT TTAGGATTAC TTTTAGGAGC GATCGCTTTT TTGTTTCTCT GGTGGGATAC TCCCCGACTA TTAAGCAATA AATATTTTTG GAGTGGTATC CTATTAGGTA TGTTACCAGT TTTTCTATGG TACACAGCTC AATTTTTTCA TTACGGCGTT GAATTTTTCT ATGCCAACTT TTTCCATCAA TCTTTAAAAC GTATTTGGCA ACAAGTAGGT AATCATGATG GACCTATTTG GTATTATTTA TTAGAAATTA TTAAGTATAG TTTTCCATGG CAGCTATTCT GGTTACCAGG ATTATATTTG AGTTGGAAAA ATCGTAGTCT GAGTTGGGGT AAATTAGTTT TGATTTGGAC TGGAGTTTAT CTGTTTGCTA TTTCATTAAT GAATACAAAA CTTCCTTGGT ATGTGTTACC TATTTATCCA GCTTTTGCCT TAGCAGTAGG TAGTTATATA ACAGAAATCT GGGATCAATT TCCATTAGAC TTAGGATGCT TTTGGTTTTC AGGTGATAAA TATCTTCCAA CTCACGGAGA TCATAAAAAA CATCTCTGGA GTCTTCCCAC TATTTACCAT CGTTTAGTAG TAGCTTTATT TGCACTACTA GCAATAATAG CTTGGGCTGC TTCTGTTTAC TTTAGTGGGA TTTTTGATCT AGGGGAGCAA AATTTAGCAA AACCTAACTT ACAGTTACAG TTAGTTGCTG TGGCTCTTGC ATTGACAATG ACAATGGTAA CTCTACTGTT AAATAAACAA CAGCATCAAT TTTTATTAAT TTTGATTTGG GGAACATATC TTTCACTGCT AATGTTTGTT AGCTCTCCTT ATTGGATATG GGAATTGGAA GAAAATTATC CAGTCAAACC AGTCGCAGAA ATGATTCAGA AAGATACCCC CCCGGGTCAG GTTATTTATT CTTTTGACAC TAAAGACCGT CCCTCTTTAA ATTTTTATAG CGATCGCCTC ATTAAGCGTG TTGGCCCAAA AAAAATTCAA CAGCAATGGC AAAAAACAAC TCAACCCTAT TTATTAGTTG AAGCATTAAC TCTAAATAAT CTCCCCCTAG AAAATTTTCA GGTTTTGAAT ACTGTCAAAG GATGGTCCTT AGTTACAAGA GAAGGTAAAA GATAA
|
Protein sequence | MLNKILTKKN LHSHIDLLLT LGLLLAAVVL FSTNLGTLPL RDWDEGIVAQ VAREISRDKW NWLYPTINNT PYFNKPPLIH WLIAFIYSIA GVNEFNARLF PALLTAFSVP LIYGISRELF HPRTPAIFTA LVYLTLLPVV RHGRLAMLDG AVVCFFLLMI WCVLRSRQNL RYSLPIGISF ALVSLSKGII LGLLLGAIAF LFLWWDTPRL LSNKYFWSGI LLGMLPVFLW YTAQFFHYGV EFFYANFFHQ SLKRIWQQVG NHDGPIWYYL LEIIKYSFPW QLFWLPGLYL SWKNRSLSWG KLVLIWTGVY LFAISLMNTK LPWYVLPIYP AFALAVGSYI TEIWDQFPLD LGCFWFSGDK YLPTHGDHKK HLWSLPTIYH RLVVALFALL AIIAWAASVY FSGIFDLGEQ NLAKPNLQLQ LVAVALALTM TMVTLLLNKQ QHQFLLILIW GTYLSLLMFV SSPYWIWELE ENYPVKPVAE MIQKDTPPGQ VIYSFDTKDR PSLNFYSDRL IKRVGPKKIQ QQWQKTTQPY LLVEALTLNN LPLENFQVLN TVKGWSLVTR EGKR
|
| |