Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3568 |
Symbol | |
ID | 4244317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5484593 |
End bp | 5487043 |
Gene Length | 2451 bp |
Protein Length | 816 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638108536 |
Product | hypothetical protein |
Protein accession | YP_723125 |
Protein GI | 113477064 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4354] Predicted bile acid beta-glucosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.592893 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAAC AAAATCCTAT CCCTAACATT CCAGCCATAT CCTGGAATCG CCCCATTGGT CTTGATTGGG ACAAACCTTA CACAGTCCGT TATAGTAGTA ATATTGATGA TGGCCCATGG CACGGAATGC CTCTGGGAGG TTTCGGTGCA GGTGCCACTG GTCGTTCCCC CCGGGGGGAT TTTAATCTTT GGCATCTTGA CGGTGGGGAG CACATTTATC AAAATCTGCC CGCCTGTCAA TTTAGCGTTT TTGAAGAAAC AAAAGGACAG AAACAAGCTT ATGCTTTATC TACCGAATTA CCCACTGATG GTAGTCTATC TGCTTGGCAG TGGTATCCTC GGGAAAAAGC CGGGCTCAAG ACTGGCACTT ATCACGCTCT TTACCCCCGC AGCTGGTTTG TTTATGAGAA TGTATTTACT GCACAATTAA GTTGTGAACA ATTTTCCCCA ATTTTGGCAG GTAACTATCA AGAAACCAGT TATCCGATCG CTATATTTGA ATGGACTGCC CATAATCCCA CAGATGAAGC AATTACACTT AGCATAATGC TAAGTTGGCA AAATACTGTT GGTTGGTTTA CCAATTCTGT GAAAACACCA GAAGTGAGAG TGCGAGATGA TGGTAGCCCA GTTTATGAAT ACAAACCTCG GTGGGGAGAG AGTACAGACA ACTTTAACTT ATTAGTAGAA GATTTTCACC GTATTGGTTG CACTATGACC AAATTAAGTA TTGCCGACGA ACCTGCAGAA GGAGAGGGGC AAATGGCGAT CGCGACTTTT ATCAATGCCG GCATGGAAGT TTTCTATCAT ACAAGGTGGA ACCCTACTGG TACTGGGGAG GATATCTGGC ATTACTTTGC TCTAGATGGC TCTCTCATAG ATGAAGAAAA CGAACTACCA GCAACTGAGG GAGAACAAAT TGGTGTGGCT ATATCAGTTC GTTTTACTAT CAGACCAGGG AAAAACAGGA AAATTCCTTT TTTTTTAGTT TGGGATTTAC CTGTGACTGA CTTTGGTAAT GGGGTTTCAT ATTACCGCCG TTATACAGAT TTTTATGGTC GCAATGGCAA AAATGCCTGG TCGATGATTC GTACTGCCAT GAAGCATTAT CAAACTTGGC GGGAAAATAT TGAAGCTTGG CAAAATCCGA TTTTGCAACG GGAAGATTTG CCTAATTGGT TGAAAATGGC TTTGTTGAAT GAACTTTATG ACCTCACCAG TGGTGGAACT ATTTGGGCGG CTGCTAGCGA TAGCGCTCCT TACGGTCAGT TTGCTGTTCT GGAATGTCTT GATTATCGTT GGTATGAGAG TTTGGATGTT AGGTTGTATG GTTCTTTTGG GTTGTTAATA TTGTGGCCAG AGTTGGAAAA GTCTGTTTTA GTTGCGTTTG CGCGAGCAGT TTCGACAGCG GATGATACAC TGAGAATTAT TGGTTACAAT CAAGTGTCGG CAGTGAGAAA GGTTGCTGGA GCTACTCCCC ATGACTTGGG TGCACCGAAT GAGCATCCCT GGGAAATGAC AAATTATACC AGTTATCAGG ATTGTAATCA ATGGAAGGAT TTATCTAGTG ATTTTGTATT GCAGGTATAT CGAGATTTTT TGTTGACTGG TGCAGATGAT TATGAGTTTC TCTGGCAGTC TTGGTCTGCT ATTACGGAAA CTCTAGCATA TCTTAAGGGT TTTGACAAAG ATAATGATGG CATTCCTGAA AATGAGGGAG CGCCTGACCA AACTTTTGAT GATTGGCAGT TGCGGGGTGT GAGTGCATAT TGTGGTGGAC TATGGCTGGC TGCACTTGAG GCAGCGATCG CTATTGGGAA AGTTTTAATT GAGCATCCTA GGGAAATTCC TTATTACCCG CCTAAGGGTT TCTATTCTGA GGTTGATAAA AATTCTGTAG ATGCAATTAA TAATCAGGTT TATTTGTATC AAGGTTGGTT AAAAAAAGGA TTGCCTATTT ATCAAGAAAA GTTATGGAAT GGGGAATATT ATCGTTTGGA TAGTGAGAGT AATTCGGAAG TAGTAATGGC TGATCAATTA TCGGGTCAAT TTTATGCCAA GCTGTTAAAT TTGGAGGATA TTGTCCCTGC TGAATGTGCT TTGTCTGCTT TAAAAACTGT CTATAATTCT TGTTTTAAAA ACTTTCATAA TGGCAAGTTT GGTGCGGCAA ATGGGGTGTT ACCTGATGGT TCTCCAGAGA ATCCTAATGC TACTCATCCT TTAGAGGTTT GGACAGGAAT TAATTTTGGT TTAGCGGCTT TTATGGTGCA AATTGGTATG AAAAAAGAAG CTCTGGAAAT AACTGAGGTA GTTGTGGGAC AAATTTATGA AAATGGTTTA CAATTCCGCA CTCCAGAAGC TATTACAGTA ATGGGTACTT TTAGAGCTAG TCATTATTTA AGGGCAATGG CAATTTGGGC AATTTATTTA GTCATAGAAG CAAAAAGGTG A
|
Protein sequence | MNQQNPIPNI PAISWNRPIG LDWDKPYTVR YSSNIDDGPW HGMPLGGFGA GATGRSPRGD FNLWHLDGGE HIYQNLPACQ FSVFEETKGQ KQAYALSTEL PTDGSLSAWQ WYPREKAGLK TGTYHALYPR SWFVYENVFT AQLSCEQFSP ILAGNYQETS YPIAIFEWTA HNPTDEAITL SIMLSWQNTV GWFTNSVKTP EVRVRDDGSP VYEYKPRWGE STDNFNLLVE DFHRIGCTMT KLSIADEPAE GEGQMAIATF INAGMEVFYH TRWNPTGTGE DIWHYFALDG SLIDEENELP ATEGEQIGVA ISVRFTIRPG KNRKIPFFLV WDLPVTDFGN GVSYYRRYTD FYGRNGKNAW SMIRTAMKHY QTWRENIEAW QNPILQREDL PNWLKMALLN ELYDLTSGGT IWAAASDSAP YGQFAVLECL DYRWYESLDV RLYGSFGLLI LWPELEKSVL VAFARAVSTA DDTLRIIGYN QVSAVRKVAG ATPHDLGAPN EHPWEMTNYT SYQDCNQWKD LSSDFVLQVY RDFLLTGADD YEFLWQSWSA ITETLAYLKG FDKDNDGIPE NEGAPDQTFD DWQLRGVSAY CGGLWLAALE AAIAIGKVLI EHPREIPYYP PKGFYSEVDK NSVDAINNQV YLYQGWLKKG LPIYQEKLWN GEYYRLDSES NSEVVMADQL SGQFYAKLLN LEDIVPAECA LSALKTVYNS CFKNFHNGKF GAANGVLPDG SPENPNATHP LEVWTGINFG LAAFMVQIGM KKEALEITEV VVGQIYENGL QFRTPEAITV MGTFRASHYL RAMAIWAIYL VIEAKR
|
| |