Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0537 |
Symbol | |
ID | 4244505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 850299 |
End bp | 852104 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638105848 |
Product | hypothetical protein |
Protein accession | YP_720462 |
Protein GI | 113474401 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.148715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00787083 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCAAATA CTTTAATATC CAAATCAAAA ATAACTAGAT TTTTATTATT CTTACATAAA AACTTAATAA ATAGAAGCAG GATATTTTGG TTAAGTTTAA CCATAGCAGT TGTATTAATT TATGGTATTG AATATCTAAA AGCAGCTTTT GAAACTAAAT ATATAATCCA AGATGATGCA CGACAACATA TATTTTGGAT GCGTCGTTTT TTCGATACAG AATTATTTCC CGAAGACTTA ATAGCTAACT ATTTTCAGTC AGTAGCACCT TGGGGTTATC AAACTTTTTA TTGGTTAATA ACATCTCTAG GTATCGACCC AATTTTTTTC GGTAAATTAT TACCAATATT TCTGGGATTA ATTTCTACAA TTTACTGCTT TGGAATCAGT TTACAAATTC TGCCAATTCC CGCCGTCGGA TTTTTGAGCT CATTTATTTT AAACCAAAGT TTATGGATGG AAGATGACTT AGTTTCTGCT ACTCCCCGAG CATTTTTCTA TCCACTTTTT TTAGCATTTT TATATTACTT ACTCCGAGGT TCCTTATTTC CTGTTTTAGT AGCGATCGCT CTCCAAGCAA TTTTTTATCC CCAGACTGTT TTACTATCAC TAAGCATCCT AACTATTAGA CTATTCTCCT ATCAGCAAAA GCGGTTAAAA TTCACCTCAA TTAAACTAAA TTATTTACTT TGGTTAGGAG CAATAATAAC TGCTGCCATA ATACTTTTAC CATACAAATT AACTGCTACT GAATTCGGAC CAATTATTAC TACTACCGCA GCCAAAATAC AACCTATTTT CAATTATGCT GATAGTAAAT ATGGCAGAGC ATTCTTTTTT CATCATAACC CCTTAGTTTT TTGGCTCACA GGACCAAGGA GTGGTATCTT ATTTGTAGGA TTATTCTCAC CACTGGCTAT AGCTTCATTA CTATTACCTT TTTTACTTAA AAAAGAAAAA TTTCCTCTAG GAAAACAAGT TTCGGAAAAA GTAGGAATAT TAGTTCAAAT TTTCATAGCA TCAGTAGGAT TATTTTTTCT AGCTCATATC TTTTTATTTC AGCTTCATTT TCCCAATAGA TATATTTACC ATAGTATACG AGTGACGATG GCAATAACTG CTGGTATTGC CTTAATAATC TGGTTAGATA GTTATTTAAA GGCAACTATT TATCAGATAA AAAATAGTTT TACTTGTCTA CAGGGAATAT ACCTCGGATG TACAACTTTG TTATTAGTAT TATTCAGTAT TATTCCTTTT TCCACAGACT TAACAATAGA TAATCAATCC TACATTAAAG GCAAAGAAAA AGAATTATAT GAATACTTAT TAGTACAACC TAAAGATACA TTAATTGCTT CCATATCTAA GGAATCAAAT AATATTCCTA CCTTTGCTCA ACGTTCGACC TTAGTAGCCC AAGAATATAG CTTACCCTAT CACGTAGGAT ATTATAATCA ATTTAGTCAA AGAGCCATTG ATTTAATTCA GGCTCAATAT ACTCCTAACC CAGAACAAGT TAATAATTTT ATTCAAAATT ATGGGGTTGA TTTTTGGTTG CTCGATCTAA CTGCTTATAA TCCTAGATAT GTAGCTGATA AACAGTTAAT TCGTCAGTAT AATTTAGCAG ATTCAATTAT TTATCAACTG GAGCAAAATA TGATCCCGGC ATTATCAACA ACCATGGAAA TTTGTAGTGT ATTAAGCAGT AAGCGAATAA CATTATTATC AACATCATGT ATTACAAATG AGTTGATAAA ATTGAGTATT AATAATCCTA AAAATTATCA CAAACATAAG ACTTGA
|
Protein sequence | MSNTLISKSK ITRFLLFLHK NLINRSRIFW LSLTIAVVLI YGIEYLKAAF ETKYIIQDDA RQHIFWMRRF FDTELFPEDL IANYFQSVAP WGYQTFYWLI TSLGIDPIFF GKLLPIFLGL ISTIYCFGIS LQILPIPAVG FLSSFILNQS LWMEDDLVSA TPRAFFYPLF LAFLYYLLRG SLFPVLVAIA LQAIFYPQTV LLSLSILTIR LFSYQQKRLK FTSIKLNYLL WLGAIITAAI ILLPYKLTAT EFGPIITTTA AKIQPIFNYA DSKYGRAFFF HHNPLVFWLT GPRSGILFVG LFSPLAIASL LLPFLLKKEK FPLGKQVSEK VGILVQIFIA SVGLFFLAHI FLFQLHFPNR YIYHSIRVTM AITAGIALII WLDSYLKATI YQIKNSFTCL QGIYLGCTTL LLVLFSIIPF STDLTIDNQS YIKGKEKELY EYLLVQPKDT LIASISKESN NIPTFAQRST LVAQEYSLPY HVGYYNQFSQ RAIDLIQAQY TPNPEQVNNF IQNYGVDFWL LDLTAYNPRY VADKQLIRQY NLADSIIYQL EQNMIPALST TMEICSVLSS KRITLLSTSC ITNELIKLSI NNPKNYHKHK T
|
| |