Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2700 |
Symbol | |
ID | 4244963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4176775 |
End bp | 4178328 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638107763 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_722362 |
Protein GI | 113476301 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0664303 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAATCG AAGAACTTCA AGAAAAATAT GCAGCCGGTG AAAGAAACTT TACTAATCTC AATCTTTTTG AGGCCAATCT CGCTGGAATG AATCTTAGTG GTGTAAATCT TAGTGGAGTT AACCTTAGTA TTGCCAACCT AAGTGGTACA AATTTAACTA ATTCTAATCT GAGTAAAGCT AAACTTAATA TTACTAAACT TAATGGAGCT CATCTGAGTG GTACAAATTT AAATGGTGCT GATTTGAATG TAGCCAACTT AGTTCTAGTT AATTTGAAAA GAGCTCAATT AATTGGTGCT AAATTGATTA GGGCAGAATT GATTAGAGCG CAACTGAGTG GGGCAAATTT TTCCTTAGCT GATTTAAGTG GGGCAAGTCT TGTTGAAGCC ACACTCCGAA AAACTAATCT TAGTGGTGCC AACTTAAAGG GAGCTAACTT ACGTTTTGCT TTTATTACAG AATCTAATTT AATAGAGGCC AATTTGGAAG GAGCAGACCT GAGTGGTGCA GACCTCAGCG GTTCAGACCT TAGTGGGGCT GAATTGAGAA AAAGTAATTT GACTGGTGCT AATTTGAATG GTGCTAATTT AAGCGGGGCT AATCTACGTT GGGCTGTTTT GACTGGCGCT CAAATGAACT GTGTTAATCT TAGTGATGCA AAACTGAGTG GTGCTGACTT AAGTAGAGCC AACTTGTTCC AAGCTAATCT TTTAAATGCT AGCCTAGTTC ATGTTAATTT ATCTAGTGCC TGTTTAATTG AAGTAGATTG GATTGGAGCA GATTTAACAG GAGCAAATTT AACCGGGGCA AAAATTTTTG GTACCTCTCG TCTAGGGATT AAAACAGAAG CTGTCACCTG TGATTGGCTT GATTTGAGCC CTAATGGAGA TCAACCTCAA GTTTATACTC TCAATACTCT GAAAATTAAA AGTTTCTTTC ACGAGACTCT TCCTAAAGTG AAGATAATTG TTGATACTCG CTTAGATATT TCTAGCCATT ATGCTCTAGC TGCTACATAT TATCAAATTA GCAAGTATTT ACCAGATATA AATCAACCTC CTTGTATAGA AGTTAACTCT TGTAGAACAA TTATTTCTTT TTCAATGAAC AGCGATGAAA AAATATTTGC TGTTGCTTAT ATCACTGTTC TACCTTTTCT TGATGGTTAT AAAACTCAAG AAAATATTAT CAAAATCATG AAGATACTTA ACTCTCATGA TTTCAATTCT TTAACTCCTA TAGTAGCTAA AATAATTCGG CAGTTAAGTA CAGTTTTAAA TAAATCAATT CAACAAGTAA ATAAAATTAA TAAAGACAAA GTTTTGTTGA AATTTTGTAG GGGAGTTCAG TTTTTTCAGA CTCCAACAAA AATAACTGTA ATTAATTCTA GCGGTAAAAA TTTTCATGTC TATCATCACC CTAGATTTGG TAAACGAATA GTAAATAAAG TTAGTAAAAG TACAGAAGGA GAAGTCAGAA TTCAAGTACC TCAAACAGTA CCATATTCTA TCAATACAAT TATTGATTTT ATTGATGGCT TTCATAAACC ATAA
|
Protein sequence | MIIEELQEKY AAGERNFTNL NLFEANLAGM NLSGVNLSGV NLSIANLSGT NLTNSNLSKA KLNITKLNGA HLSGTNLNGA DLNVANLVLV NLKRAQLIGA KLIRAELIRA QLSGANFSLA DLSGASLVEA TLRKTNLSGA NLKGANLRFA FITESNLIEA NLEGADLSGA DLSGSDLSGA ELRKSNLTGA NLNGANLSGA NLRWAVLTGA QMNCVNLSDA KLSGADLSRA NLFQANLLNA SLVHVNLSSA CLIEVDWIGA DLTGANLTGA KIFGTSRLGI KTEAVTCDWL DLSPNGDQPQ VYTLNTLKIK SFFHETLPKV KIIVDTRLDI SSHYALAATY YQISKYLPDI NQPPCIEVNS CRTIISFSMN SDEKIFAVAY ITVLPFLDGY KTQENIIKIM KILNSHDFNS LTPIVAKIIR QLSTVLNKSI QQVNKINKDK VLLKFCRGVQ FFQTPTKITV INSSGKNFHV YHHPRFGKRI VNKVSKSTEG EVRIQVPQTV PYSINTIIDF IDGFHKP
|
| |