Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1560 |
Symbol | |
ID | 4242118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 2381858 |
End bp | 2383471 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638106703 |
Product | TonB family protein |
Protein accession | YP_721313 |
Protein GI | 113475252 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.139484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAACT TTTCTTCCCT AGAGCTAGAT CAGCCTATCG AAAAATCCTA TGTAATAAAC AGCCTCCGTC AACCAATATG GATAGCACTC TTGACTTCTG TAATTATCCA TACTGTTTTA GGGGTTAACT TACCAAAACT ATCCCTATTC TCCAAGAAAG CTAAATTACC ACCAACAGTA GGATTTGTTG AATTAACACC AGAACAACTC GAGCGTCTTC CCCAACCAGA AGAACCAGAA ATTACATTCT CTGAAATGCC CGTACCAGAG AACTTTTCCC CTGTTGCACC ACCAACTCCA AGTGAAGTAC CATTTGTTGC CGAACCTCCA TCTTCAGAAT TGCCAACAAT ACCATTAGAC CCCTCAGACT ACAACTTACC AGAATTACCA CCCCTAGACC CTACTAATTT ATCCTCTCCT CTTCCCTCTG TTTCGACTCC TTCTGTTTCT AGGCTGCCTA AAAACTCACT GCCATCAGAA TTTTCCTACA ATTCACCCAT ACCAAGTATA GCACCATTAC CAATAGCTCC AAGATATCAA CCCACACCAC CTAATACAAT TCCTCCACTT ATACCGACAT TTCAACCTAA ACCTAAACCA CCACAAATTC CAGAACAAAA TCAAATAAAT GAATATCAAA TTAGACAAAA TTTAAAGTTG ACAGACGACG TTTATGAATC AGGTTTATCT CGTCGTCCAA CTCTAGAAAT AGATCCCACA AATGGAACTT CTGTCAATGG AACTTCTGTA GGTGAAAACT CAACTAAGAA ATCATACTTA GACGGTTTAC AAAAGCCTCC ACAACGTTAT CAAGATAAAG TTAAAAATCA ACAAGATAAA GTTAAAAATC AAAATGCAAT GAATAATAAA GAAGAAGATG ATACCAATAA CGTTCAACCA CTAAAAACAC CTTTACCTCC AGACCCGACC AAAAAAAATA GATTGCGAGA AGCACCAATA GTTACACAAT TACGGGAAGG AAAGACTATA AAAGAAATAG TACAGGAAAC ACAAAAAGAA AAGGTTGTTG CTACCCCAAA ACTTAGTAAA GCCCCAAAGT CTGAACCTGT TGCTCTTACT CCTAAACCAT CTCCAACTCC AGAAAAAGCA GAAGAACCTA AAAAAGATTT ACCCTCAAAC AATTTTGAAC AACTACAGCA ACAAAAGGTT ACTGATGCTT CCCCACTTCT ACAACCGCAA CCAGAATTTA AGGGTTCATT GTTGCAAAAA CTACGGCAAA AAAAGCTAAC AAATGCTACC GTTAATGAGG AGTCGCAGTC AAGGGAAACA TCTGTAGATG CTGCACTTGC TTATGCGAAT TGGGCAATAG AATTAGGAGT AGGTAAGGAA ATTTCGACTC ACGCGAAAGC TATTCCTGAT ATTTATCCAG AAGCTGCTTG TGAGCAAAAA TTGGAAGGTA AAGCTCTGGT GGGGGTTTTG GTTGATCCAG ATGGTAGTAT TAGTAAAGGA CCGAAGTTGC TCATAGAAAG CGGTTTTCCA ATATTAGATA ACGCTGCTTT AGATGCTGTA AGCAAGGAGT CATTTGATAG TAGTAATAAG CCTAAATTAT ATCGGTATGA GTTTGATTTT GATAGTTCTA ATTGTACTAG TTAG
|
Protein sequence | MSNFSSLELD QPIEKSYVIN SLRQPIWIAL LTSVIIHTVL GVNLPKLSLF SKKAKLPPTV GFVELTPEQL ERLPQPEEPE ITFSEMPVPE NFSPVAPPTP SEVPFVAEPP SSELPTIPLD PSDYNLPELP PLDPTNLSSP LPSVSTPSVS RLPKNSLPSE FSYNSPIPSI APLPIAPRYQ PTPPNTIPPL IPTFQPKPKP PQIPEQNQIN EYQIRQNLKL TDDVYESGLS RRPTLEIDPT NGTSVNGTSV GENSTKKSYL DGLQKPPQRY QDKVKNQQDK VKNQNAMNNK EEDDTNNVQP LKTPLPPDPT KKNRLREAPI VTQLREGKTI KEIVQETQKE KVVATPKLSK APKSEPVALT PKPSPTPEKA EEPKKDLPSN NFEQLQQQKV TDASPLLQPQ PEFKGSLLQK LRQKKLTNAT VNEESQSRET SVDAALAYAN WAIELGVGKE ISTHAKAIPD IYPEAACEQK LEGKALVGVL VDPDGSISKG PKLLIESGFP ILDNAALDAV SKESFDSSNK PKLYRYEFDF DSSNCTS
|
| |