Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2077 |
Symbol | |
ID | 4245725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3243857 |
End bp | 3245767 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638107188 |
Product | hypothetical protein |
Protein accession | YP_721791 |
Protein GI | 113475730 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.189777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.779158 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAATC AAAATATAGC AATTACTATA GGAGTTCAGG AATATGAGTT TTTGACTCCT CTAAAATATG CAGCTAATGA CGCTAAAAAA ATGAGGGATT TTTTGCTCGA CGAAGCAGAT TTTGATGACG TTTTTTACTT GTCAGATAAT TCTCCAAAAA TTAATGGTGC TTCAACTCGC CCAACTCGTT CCCGCTTAGA ATTGGTGTTA GAAGATGAAG TTAAAAAACT ATCTCTGAAA ACTGGTGATA ACCTTTGGTT TTTTTTTAGC GGCCATGGTC ATAGAGAGAA TAATAATATT GATTATTTAA TTCCTATTGA TGGTCATTCA AATGTTGAAA GAAGTGGAAT TTCTGTTGAC TATATTATTC AACAACTGCA AAAATCTGGA GCAGATAATA TAGTTCTAAT ATTGGATGCT TGTAGAAACA AAAGTGATGG GGGAAAAGGG GGAGAAGGAC TAGGAAGGCA AACAGAACAA GAAGCTCGTG AAAAGGGGAT AGTAACTATT TTTTCTTGTA GTCCAAATGA ACGTTCTTGG GAATTAGAAG AGTTGCAACA GGGAGTTTTT ACTTATGCGT TATTAGAGGG GTTAGGTAGT AAGGGTCAAA AAGCAACTGC AGAAAGACTG AATGAATATC TGAAGTATCG GGTGCAAGAG CTAGCTCAAA AGGAAGGAAA ACGACAGACC CCTCGTATTA TTGCTGACCC TATTGAAAAG TCTCATTTAA TTTTAATGCC GAAGTATGCA ACAAAAACTG ATGTTTCTAC TCTGAAAATT GACGCTTACA GAGCTCAAAC AAATGGAGAT TTTAATTGGG CAGAACAACT ATGGATAAGA GTTTTAGAGG TAGGTTTGGA TCTTGAAGCA GTTAAGGCTC TGCAGAAAAT TGCTGTTGAT CGGTTTAGAT CTCAGTTGGT TTCCACCCCA CAGTTAGATC TAGGTTATTT TCAGAATTTA CCGGTTTCAG AAACTCAACA ACCAAGATCA ACAGAAGTAT TAGAAGGTTC AAATATTCAA ACTTCACTAC AGTCACAACG GAAGTTAGAA CCAAACTCAC AAATAAAAAC AAAGTCACAG CCAAAAATTA CTATTCATAC ATTTGCGACT CCTAAAGTTA ACAGGAAAGG AGAAATAATA AGCCGCTCTG AAGGTGAAGC AGAAGTAATG ATAGAAAATT TAGGTAATGG AGTTACTCTG GAAATGGTAA AAATACCCGG TGGTAGTTTT CTGATGGGGT CTCCAGAGAC GGAAGCACAG AGAAGTGATA ATGAAGGTCC GCAACATCAT GTAGATGTGC CAGAATTTTG GATGGGAAAG TATGTCGTTA CTCAACAACA GTGGCAAGCA ATAATGGGAA ATGATCCTTC GAAATTTAAA GGAAAAAATC GCCCTGTGGA AAGAGTAAGT TGGAATAACG CTACAGAATT TTGTCAAAAG CTCTCTAAGA AAACAGGAAG AGACTATAGA CTACCGAGTG AAGCAGAATG GGAATATGCC TGTCGTGCTG GGACAACTAC ACCTTTTTAT TTTGGAGAAA CTATCACAGG AGAATTAGCT AATTATAGAG CTTCAGAGAC TTATGCTGAT GAACCAAAAG GAGAATATAG AGAACAAACA ACTCCTGTAG GTGAGTTTCC ACCTAATGCT TTTGGTCTAT ATGACATGCA TGGGAATGTC TGGGAGTGGT GTCAGGATGT TGTGCATAGT AATTATGATG GAGCACCTGT TGATGGAAGT GCTTGGGTAA ATGGAGGCGA TAGTAGCGGT AGAGTGCTTC GTGGCGGCTC CTGGCTCAAC TATCCTGGGT GGTGTCGCTC TGCGAGCCGC GGCTACTATG TCTCGGTCGT GGTGGTCAGC TCCAATTTTG GTTTTCGTCT TGTGAGTTTC CCCCCCAGGA CTCCTGAATA G
|
Protein sequence | MTNQNIAITI GVQEYEFLTP LKYAANDAKK MRDFLLDEAD FDDVFYLSDN SPKINGASTR PTRSRLELVL EDEVKKLSLK TGDNLWFFFS GHGHRENNNI DYLIPIDGHS NVERSGISVD YIIQQLQKSG ADNIVLILDA CRNKSDGGKG GEGLGRQTEQ EAREKGIVTI FSCSPNERSW ELEELQQGVF TYALLEGLGS KGQKATAERL NEYLKYRVQE LAQKEGKRQT PRIIADPIEK SHLILMPKYA TKTDVSTLKI DAYRAQTNGD FNWAEQLWIR VLEVGLDLEA VKALQKIAVD RFRSQLVSTP QLDLGYFQNL PVSETQQPRS TEVLEGSNIQ TSLQSQRKLE PNSQIKTKSQ PKITIHTFAT PKVNRKGEII SRSEGEAEVM IENLGNGVTL EMVKIPGGSF LMGSPETEAQ RSDNEGPQHH VDVPEFWMGK YVVTQQQWQA IMGNDPSKFK GKNRPVERVS WNNATEFCQK LSKKTGRDYR LPSEAEWEYA CRAGTTTPFY FGETITGELA NYRASETYAD EPKGEYREQT TPVGEFPPNA FGLYDMHGNV WEWCQDVVHS NYDGAPVDGS AWVNGGDSSG RVLRGGSWLN YPGWCRSASR GYYVSVVVVS SNFGFRLVSF PPRTPE
|
| |