Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1117 |
Symbol | |
ID | 4242849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1758520 |
End bp | 1759998 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638106340 |
Product | hypothetical protein |
Protein accession | YP_720952 |
Protein GI | 113474891 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00643446 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0767157 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAGTC AATTTAAAAA AAACCTGTCA AGATTAAAAT CTCAATTAAA ACTTATTAGC TTAGTGGGAA CAGTAATAAT TAATTTCAGT TTTCCAGTGC CAGCTTTTGG TGCTTTTTTA ACAAACTGGT TTTTTGACCC AGATAATAAT CAACTTCAGT TTACTCTCCC AGCAGGTACA ACTCCTACTT ATTCTGTTGA AAGCAAACCT ACTCGTCTAG TTGTCTATAT TGAAGATACT AAAGTTAGTG TAAATGTGAC TGAATTATAT CCAGCAGGGT TAGTGCGTAG AGTTAGTCTC TCTCAAGAGA GAATGCAACA AGCAAAAGTT ATTATAGACT TTGCTCCGGA AGTAGCTATA TCTGCTAAAA AAGTCAAATT AGAACAGGTC GAAGCGCCAG AAAATAGTTG GAAGTTACGT TTACTTGTAG GTAAGGAAAA AACCTCAACT CCAAGAGAAA CAAAAGTTTT AAATGGAGAT GCTGTTACAC TAACAACAAA ATCTACAGAA ACCACAGAAC TAGAAGTAGA TTCTTTTACA CCAACAACAA CATCTATAGA AACCACAAAA ATAGAAAATT ATGAATCTAC CAAAGCAGAA GAGGATCAAG CTTTCACTCC ACCCAAGCAA AACAGAGTAA GCTTTCAACA GAGAGCTCCC AAAGTAGAAT TTCCTCTACC CAACCCCGCA CTTTTGGCAG AGCCTTCTAT TCCATTCTTA GTTATTCCCA CGGAAAAACG AGATGGTGTC ATATTTCCAT TAATAGACAC TTCGCGTGCT CAACAGTCTC CTACTGCATC CTCTGAAATT TCGGAATTAC CACCACTCCA AGGGATAGAA ATTGAAAATG CTTTTGTTCC ACCTCCCCTA GACCCTGAAA CTGTAGAAGA ACTAAACGAT AGAACTACTA AATCAGTTGA TGTAATTTCC TTTGGAGAAC CATTGCCTGG AACTTCAGGC AAAAATCAGC TTGAAGATGA TGCTTCTATC TTAATTGAAG CAGGAAAGAT AATATCTCTA ATTTACCCCA CTAGCGAGCT TGAACTACCA CAGGGAGTTG AAATACAGGA AGTTTTGCTT CTCCAAGAGG CCATAACTGA TGAATCTGGT AGAATTATAG TACCAGCTAG AACACCTGTG GTCGGTAGTT TTAAGACCTT CAGTCACGGT AGTCAATTTA TCGCCAGAGC TATATATTTC AATAATCGTT TAATCCCCTT TGATGCTCGA TCTCAATTGC TCGAAGGAGA CCTTGATTTT AATGAAAAAG TTTTAGCAGG TAGTAGCACT GGTAGTGGTC TAGCATTATT ACTGTTAACG GGTTCAGGTT TTGGTTTCTT GGCAGGAGCA GCTCTTGGAG CTGGTAGTGT TCTTTTTACT GCTCCTCATT CTGTCACCAT TGAGCCGGGG CTTATTGTAG AAGTTCGTGT TGTTAAAGAT TTACCCCGCT CTAGATTTTA TGATACTAGT AGTTTTTAG
|
Protein sequence | MVSQFKKNLS RLKSQLKLIS LVGTVIINFS FPVPAFGAFL TNWFFDPDNN QLQFTLPAGT TPTYSVESKP TRLVVYIEDT KVSVNVTELY PAGLVRRVSL SQERMQQAKV IIDFAPEVAI SAKKVKLEQV EAPENSWKLR LLVGKEKTST PRETKVLNGD AVTLTTKSTE TTELEVDSFT PTTTSIETTK IENYESTKAE EDQAFTPPKQ NRVSFQQRAP KVEFPLPNPA LLAEPSIPFL VIPTEKRDGV IFPLIDTSRA QQSPTASSEI SELPPLQGIE IENAFVPPPL DPETVEELND RTTKSVDVIS FGEPLPGTSG KNQLEDDASI LIEAGKIISL IYPTSELELP QGVEIQEVLL LQEAITDESG RIIVPARTPV VGSFKTFSHG SQFIARAIYF NNRLIPFDAR SQLLEGDLDF NEKVLAGSST GSGLALLLLT GSGFGFLAGA ALGAGSVLFT APHSVTIEPG LIVEVRVVKD LPRSRFYDTS SF
|
| |