Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2769 |
Symbol | |
ID | 4244802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4291521 |
End bp | 4294466 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638107828 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_722425 |
Protein GI | 113476364 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.394148 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGGAA TTTCTCCTGA ACTCTATTCT AAACTGAGTG CATCTCTGTC AAGGTGTGAG CAATTTAACA GCAATGAAGA ACTACAGAGT TTTTTCAACG GTCACCGTCA GCTCTCCCTT TGGGCTAATA GTTTACCACA AGGTATCAAT ACGGACCAAC GAGTAACACA CGTGATTGGC TTTTTAGTGG GTAAATATCA TACCAATAAG AATGGACTCG TTATTTTGGT AAGGTTGCTT TGTGAGACGT TTGACCGAGA AGATAGTCGG CATAGCACTT TATCTAAATT AGAAAAAGAG CTTAATAAAC AGCTGAATAG TGTTGATAAC AAAACTAGCT TTAATTGGTT ACACTTAACC GACTTTCATC AAGTCATAAA AGAACAGAAT GGGGGATCAT CTCGCGTAAA AAAAAGATTT TTTGAAGATT TAAACAGGCT CCACGAGAAA AGTGAGCCGT GGGACTTAGT GCTATTTACT GGTGATTTGA CTCAGGAGGG TAGTGCTGAA GAATTTGAAA AGCTCGATCA GCTTTTGGAA CAACTATGGG CAAAATTCCA AGAATGGGGT TCGTCTCCTA AGCTACTAGC AATACCTGGA AATCACGATT TAGTTAGACC TAATCAAAAA GAACCAGCGG TAAGACTTCT AAAACGGTGG TCTGAGGAAC CAGAGGTACA ACAAGAATTT TGGGAGGATG CAAAATCAGA CTATCGTCTA GTCATGGCTG AAGCTTTTGC AAATTATATG GCTTGGTGGA AAAGACAACC CTGGAAGCCA GAACACCTAA AAGCTGGTAT TCTACCAGGA GATTTTTCAG CCACTATCGA AAAGAATAGT GCAAAACTCG GAATTTTGGG TTTGAATACA AGCTTTCTTC AGCTTATTGG TGAAAATAAT GAGGGCAAAC TGGCTATTCA TACCCACCAA TTTCACCAAG CTTGTGACGA CAATAACACC CAAAATTGGG CTAGAAAACA TCGTGCCTGT TTGTTACTGA CCGATCATCC TGTCGATGCT CAAATGCACT TGAATACAAA AATTACTGCC GGTTACCCCT TTGTCCTTTA TCTACGTGAC CATACTGATA CTAAAGCTAA ACAAATTTGG CATTACTATA CAGTTAATAA AATTGAACTT GAACTCATTA AAGGCAAGGA TCAGTTGAAA TCTTGGTTTC TTGAAGCATC CCGAGAAGGG GGAAAAATCC AAGAATGGGA AATTCCTATT TTCAATGATC GTCTTAATTC AATACCATCC AGACCCCCTG TGGGAAAAGA AGAGCTGATT GGAGATGAAC CTGGGCTAAC CCAAAATCTT ACCAGGGGGA TAACTCTGTT TTTGTCCCAA AATCCTACCG GTTGGATAAT TTCGTTTTTG TTGATTCTGA TTGATATTTT TTCAATTTAT CGCTGGTTAA TTTATCTTCC TAGACATGAG CCTCTTGGTG ATAACATTAG TATAGGTGAG GAAATTCTGG TAGAATCATC TAGACCTCTA GAAAAAGAAA ATGGTGTAAA AGAAGTTCAA TACTGCCAGA AATCCTGGAA TCATTTTCAA GCTATTTGGA AAAATAATAC TAAAATACAG AATTGTTTTG CTGGAGTGGC AAATACTCTT AAGGAAAGCT GGAAGAATGA AAGGCGGGAC CCAGAAACTT TAATTTATAT CAATAATGCT TTTTTAGAAC ATATAAAAGC TGATCCTTAT ACTATTGCAG TTGTAGTTCC AGTTATAGAT CAAAATGAGC AGGTGAATGG AGAACTTGCT GAAGAAATTT TGCGGGGAGT AGCTCAAGCA CAAACAGAAG TTAACTTAAG TTTATTCAAG AAAAAGCAAT TCTCTTTTTT GCCTTTTCCA AATGTAGATC TAAAAGCCAA AAGGTTTAAT GATAATGATA ATGATAAAGG CTTAAAAGTT ATTATTGCTA ATGATGCTAA CTCAGAGGAT GGCGCAGAGA ACGTTGCGAA GAAAATAGTC GGCCGCCCAG AAATTTTAGG TGTTGTTGGT CATTGGGCTA GTGAGATGAC TATGGCGACT AAAGATATAT ATGATGATGC AAAACTAGTA ATGGTTTCCC CAGGCACAAC CACTTCTAAA CTCACCGCCG AAGAAAGGGT AGACGTTTTT TTCCGGACTA CCACAACGAC TATTGAGCAA GCAGAGAACA TGGTTAACTC TTTGCTGAAT AAAAATCAAA CAAGAGTTGT AATTTTTTAT AATCCTAAGA GTTATTATTC CGCTGATTTA AAAAAACAAT TTGAAGAAAA ATTTGAAGGA AAAGGAGAAA TAATTAATTT ATCACTTAAT ATGGATTATT TTGCTGAGGA TAATTTTAAG GTTAAAGATG CCATCAATGA GGCCCGCAGA CAAGCAGGAG ATCAGGAATT CGCAATTGTT TTAATATCAG ATGGTCAAGT AAGTGATGCT TTCGATAATA GTCTCAAAAT TATTGAAGAA AATGGTGGTC AAAATTGGAT AGTAGCTAAT TGGTCAGTTT ATAGCCCGAG AACCTTGAAA ATTGCTCAAG ATCAATCCCA AGAAACACGA TATCAACTCC TAGAAAAACT GATTTTGATT GTTCCTGGGC ATCCTCTCAA CAATTCTGAT TTTTTTGATA CCGCTGTCAA ACTTTGGGAA GGTTATGTTA GCGCCCGTAC AGCTTTTAGC TATGACGCAA TGCAGGTAAT TCTTAAAGGT ATTCAGGAAC AAGGTACTCG CCCTACTAGC AAAGGAATTC AGAAGACATT GGCAGATGAA AATTTCATCG TTCAGGGGGC TACAGGAGAA ATTATATTTA AATCAGGGAC TGGCGATCGT CAAAAAGTAC CGTTGAACTC AATTCGGGTT TATCGTTGTC CGAGTCAGCC GTCCGGTTTC ATGTTTATTC CTGACAAATT TTCAACACCT GAAGAAGCAA AGGTGAAATG CTCGAATTCT GAGTAG
|
Protein sequence | MGGISPELYS KLSASLSRCE QFNSNEELQS FFNGHRQLSL WANSLPQGIN TDQRVTHVIG FLVGKYHTNK NGLVILVRLL CETFDREDSR HSTLSKLEKE LNKQLNSVDN KTSFNWLHLT DFHQVIKEQN GGSSRVKKRF FEDLNRLHEK SEPWDLVLFT GDLTQEGSAE EFEKLDQLLE QLWAKFQEWG SSPKLLAIPG NHDLVRPNQK EPAVRLLKRW SEEPEVQQEF WEDAKSDYRL VMAEAFANYM AWWKRQPWKP EHLKAGILPG DFSATIEKNS AKLGILGLNT SFLQLIGENN EGKLAIHTHQ FHQACDDNNT QNWARKHRAC LLLTDHPVDA QMHLNTKITA GYPFVLYLRD HTDTKAKQIW HYYTVNKIEL ELIKGKDQLK SWFLEASREG GKIQEWEIPI FNDRLNSIPS RPPVGKEELI GDEPGLTQNL TRGITLFLSQ NPTGWIISFL LILIDIFSIY RWLIYLPRHE PLGDNISIGE EILVESSRPL EKENGVKEVQ YCQKSWNHFQ AIWKNNTKIQ NCFAGVANTL KESWKNERRD PETLIYINNA FLEHIKADPY TIAVVVPVID QNEQVNGELA EEILRGVAQA QTEVNLSLFK KKQFSFLPFP NVDLKAKRFN DNDNDKGLKV IIANDANSED GAENVAKKIV GRPEILGVVG HWASEMTMAT KDIYDDAKLV MVSPGTTTSK LTAEERVDVF FRTTTTTIEQ AENMVNSLLN KNQTRVVIFY NPKSYYSADL KKQFEEKFEG KGEIINLSLN MDYFAEDNFK VKDAINEARR QAGDQEFAIV LISDGQVSDA FDNSLKIIEE NGGQNWIVAN WSVYSPRTLK IAQDQSQETR YQLLEKLILI VPGHPLNNSD FFDTAVKLWE GYVSARTAFS YDAMQVILKG IQEQGTRPTS KGIQKTLADE NFIVQGATGE IIFKSGTGDR QKVPLNSIRV YRCPSQPSGF MFIPDKFSTP EEAKVKCSNS E
|
| |