Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0565 |
Symbol | |
ID | 4242955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 884921 |
End bp | 886705 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638105873 |
Product | Na+/solute symporter |
Protein accession | YP_720486 |
Protein GI | 113474425 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.426275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTTAA TTGATTATAT TATCGTCGGG TTATACCTGT TCGCAATTGT TCTGTTTGGC ATATTTCTCC AGCGAAAAGC CTCCGCTGGT ATAGACTCTT ATTTTCTGGG TGATCGGAAT ATGCCTTGGT GGGTATTGGG AGCTTCCGGA ATGGCTTCCA ATACAGACAT TGCCGGAACA ATGTTGATAA CGGCTTTAGT CTACGCCTTG GGAACAAAGG GATTTTTTTT GGAACTCCGG GGTGGCATTG CCTTAATTCT AGCAATGTTC ATGATTTTTA TGGGCAAATG GAATCGTCGA GCCCAAGTTA TGACCTTAGC AGAGTGGATG CATTTACGTT TTGGAGTCGG ACGAGAAGGA AATATCGCCC GCATAGTTAG TGCCATTGCT GCTATTATCT TGACCATTGG TAGAATTAGT TATTTTGCCA TAGGTGGGGG CAAATTCTTA GGAGAATTTA TCGCGGTCGA TGCGCGCCTC GCCTCAATTA TCATTATCTT CCTGGCATTG ATCTACACTG TTATCAGTGG TTTTTATGGG GTAGTCTTAA CAGACCTATT TCAGGGAGTA TTGATTTTTT TTGCCATCAT CTATATCTGC GCGATCGCCA TCCAACTGCC CCCTCTCCCC GAAACATTTG CTATCTCTAT TCCAGGCACT AATCAATTGC AGGAGTGGAA TTTTAGGGAG TGGAGTAGTA TATTCCCCTC CATGAAAGTA GACTTGCCAG GAGACTATGG CCGTTTCAAC TTATTCGGCG CTATCCTTTG CTTCTATCTA CTGAAGGTAT TAATGGAAGG GTTTGGGGGT ATCGGTGGTT ATATGATGCA GCGATACTTT GCTGCCAAAA GCGATCGGGA AGTGGGGTTA ATGTCCCTGT GGTGGATCTT TTTGCTTTCC TTTCGCTGGC CTTTAGTAAC AGCTTTTGCA ATCCTGGGCA TTAATTACGG CATTACTAAC CAAGTAATTT CTGACCCAGA ACTGGTCTTA CCAACAGTCA TTGCTACTTA CCTACCAGTA GGAATTAAAG GATTGATATT AGCTTGTTTT ATTGCCGCCG GAATGTCTAC TTTTGACTCC CTAATTAATG GTGGTGCTGC TTACTGGGTG AAAGATATTT ATCAAGCTTA CCTTGATCCC CTAGCTGATA ACCGCAAACT GATGTTTCAA AGTCGTTGGG CATCGGTAAT CATAGCTATG GTAGGATTAC TATTTAGCTT CAGTATCTCC AATATCAATG AAATTTGGGG ATGGTTGACT TTGGGACTGG GCACAGGTCT GGCGGTTCCT CTGCTGTTAA GATGGTATTG GTGGCGGTTT AATGGCTATG GATTCGCTAC GGGTATTGCC GCTGGCATGA TAGCCGCAAT TATTACTAAG GCCATCGTCT TACCTTCTTT ACTGGATCTG CAAATTGCCG AATTTATCCA ATTTCTAATT CCTAGTAGTT GTTCTTTAGC AGGATGCATT GTTGGAACTC TGTTAACTCC AGCAACAGAA AGGTTAGTCC TGGAGAATTT CTATACTATC ACTCGCCCCT TTGGTTTCTG GAAGCAGATC GCCACTAATG TGCCCCATTA TCTCAGGGAA AAAATCACAC TAGAGAACCA ACAGGATTTG CTGGCAACTG CGATCGCAAT TCCTTGGCAA ATAGTTTTGT GCCTTACCGG AATTATGTTT GTGATGAAAC GTTGGGATAA TTTCAAAATT CTATGTTTTT TATTGATTAT ACTTTCTATC TGTTTATACT TTGCCTGGTA CAGATATCTA AAAGTTAAAA ATTGA
|
Protein sequence | MHLIDYIIVG LYLFAIVLFG IFLQRKASAG IDSYFLGDRN MPWWVLGASG MASNTDIAGT MLITALVYAL GTKGFFLELR GGIALILAMF MIFMGKWNRR AQVMTLAEWM HLRFGVGREG NIARIVSAIA AIILTIGRIS YFAIGGGKFL GEFIAVDARL ASIIIIFLAL IYTVISGFYG VVLTDLFQGV LIFFAIIYIC AIAIQLPPLP ETFAISIPGT NQLQEWNFRE WSSIFPSMKV DLPGDYGRFN LFGAILCFYL LKVLMEGFGG IGGYMMQRYF AAKSDREVGL MSLWWIFLLS FRWPLVTAFA ILGINYGITN QVISDPELVL PTVIATYLPV GIKGLILACF IAAGMSTFDS LINGGAAYWV KDIYQAYLDP LADNRKLMFQ SRWASVIIAM VGLLFSFSIS NINEIWGWLT LGLGTGLAVP LLLRWYWWRF NGYGFATGIA AGMIAAIITK AIVLPSLLDL QIAEFIQFLI PSSCSLAGCI VGTLLTPATE RLVLENFYTI TRPFGFWKQI ATNVPHYLRE KITLENQQDL LATAIAIPWQ IVLCLTGIMF VMKRWDNFKI LCFLLIILSI CLYFAWYRYL KVKN
|
| |