Gene Tery_0565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0565 
Symbol 
ID4242955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp884921 
End bp886705 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content42% 
IMG OID638105873 
ProductNa+/solute symporter 
Protein accessionYP_720486 
Protein GI113474425 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.426275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTAA TTGATTATAT TATCGTCGGG TTATACCTGT TCGCAATTGT TCTGTTTGGC 
ATATTTCTCC AGCGAAAAGC CTCCGCTGGT ATAGACTCTT ATTTTCTGGG TGATCGGAAT
ATGCCTTGGT GGGTATTGGG AGCTTCCGGA ATGGCTTCCA ATACAGACAT TGCCGGAACA
ATGTTGATAA CGGCTTTAGT CTACGCCTTG GGAACAAAGG GATTTTTTTT GGAACTCCGG
GGTGGCATTG CCTTAATTCT AGCAATGTTC ATGATTTTTA TGGGCAAATG GAATCGTCGA
GCCCAAGTTA TGACCTTAGC AGAGTGGATG CATTTACGTT TTGGAGTCGG ACGAGAAGGA
AATATCGCCC GCATAGTTAG TGCCATTGCT GCTATTATCT TGACCATTGG TAGAATTAGT
TATTTTGCCA TAGGTGGGGG CAAATTCTTA GGAGAATTTA TCGCGGTCGA TGCGCGCCTC
GCCTCAATTA TCATTATCTT CCTGGCATTG ATCTACACTG TTATCAGTGG TTTTTATGGG
GTAGTCTTAA CAGACCTATT TCAGGGAGTA TTGATTTTTT TTGCCATCAT CTATATCTGC
GCGATCGCCA TCCAACTGCC CCCTCTCCCC GAAACATTTG CTATCTCTAT TCCAGGCACT
AATCAATTGC AGGAGTGGAA TTTTAGGGAG TGGAGTAGTA TATTCCCCTC CATGAAAGTA
GACTTGCCAG GAGACTATGG CCGTTTCAAC TTATTCGGCG CTATCCTTTG CTTCTATCTA
CTGAAGGTAT TAATGGAAGG GTTTGGGGGT ATCGGTGGTT ATATGATGCA GCGATACTTT
GCTGCCAAAA GCGATCGGGA AGTGGGGTTA ATGTCCCTGT GGTGGATCTT TTTGCTTTCC
TTTCGCTGGC CTTTAGTAAC AGCTTTTGCA ATCCTGGGCA TTAATTACGG CATTACTAAC
CAAGTAATTT CTGACCCAGA ACTGGTCTTA CCAACAGTCA TTGCTACTTA CCTACCAGTA
GGAATTAAAG GATTGATATT AGCTTGTTTT ATTGCCGCCG GAATGTCTAC TTTTGACTCC
CTAATTAATG GTGGTGCTGC TTACTGGGTG AAAGATATTT ATCAAGCTTA CCTTGATCCC
CTAGCTGATA ACCGCAAACT GATGTTTCAA AGTCGTTGGG CATCGGTAAT CATAGCTATG
GTAGGATTAC TATTTAGCTT CAGTATCTCC AATATCAATG AAATTTGGGG ATGGTTGACT
TTGGGACTGG GCACAGGTCT GGCGGTTCCT CTGCTGTTAA GATGGTATTG GTGGCGGTTT
AATGGCTATG GATTCGCTAC GGGTATTGCC GCTGGCATGA TAGCCGCAAT TATTACTAAG
GCCATCGTCT TACCTTCTTT ACTGGATCTG CAAATTGCCG AATTTATCCA ATTTCTAATT
CCTAGTAGTT GTTCTTTAGC AGGATGCATT GTTGGAACTC TGTTAACTCC AGCAACAGAA
AGGTTAGTCC TGGAGAATTT CTATACTATC ACTCGCCCCT TTGGTTTCTG GAAGCAGATC
GCCACTAATG TGCCCCATTA TCTCAGGGAA AAAATCACAC TAGAGAACCA ACAGGATTTG
CTGGCAACTG CGATCGCAAT TCCTTGGCAA ATAGTTTTGT GCCTTACCGG AATTATGTTT
GTGATGAAAC GTTGGGATAA TTTCAAAATT CTATGTTTTT TATTGATTAT ACTTTCTATC
TGTTTATACT TTGCCTGGTA CAGATATCTA AAAGTTAAAA ATTGA
 
Protein sequence
MHLIDYIIVG LYLFAIVLFG IFLQRKASAG IDSYFLGDRN MPWWVLGASG MASNTDIAGT 
MLITALVYAL GTKGFFLELR GGIALILAMF MIFMGKWNRR AQVMTLAEWM HLRFGVGREG
NIARIVSAIA AIILTIGRIS YFAIGGGKFL GEFIAVDARL ASIIIIFLAL IYTVISGFYG
VVLTDLFQGV LIFFAIIYIC AIAIQLPPLP ETFAISIPGT NQLQEWNFRE WSSIFPSMKV
DLPGDYGRFN LFGAILCFYL LKVLMEGFGG IGGYMMQRYF AAKSDREVGL MSLWWIFLLS
FRWPLVTAFA ILGINYGITN QVISDPELVL PTVIATYLPV GIKGLILACF IAAGMSTFDS
LINGGAAYWV KDIYQAYLDP LADNRKLMFQ SRWASVIIAM VGLLFSFSIS NINEIWGWLT
LGLGTGLAVP LLLRWYWWRF NGYGFATGIA AGMIAAIITK AIVLPSLLDL QIAEFIQFLI
PSSCSLAGCI VGTLLTPATE RLVLENFYTI TRPFGFWKQI ATNVPHYLRE KITLENQQDL
LATAIAIPWQ IVLCLTGIMF VMKRWDNFKI LCFLLIILSI CLYFAWYRYL KVKN