Gene Tery_2974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2974 
Symbol 
ID4245090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4625247 
End bp4626683 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content38% 
IMG OID638108012 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_722605 
Protein GI113476544 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.154355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAC AAATTTGGAT TGGGATAACA TTTATAGCCT TTTTGCTATC ATTTACAGTA 
GTAGGCATTT ACTCCGCAAC ACAAAAGCAA AATACAACAA CTGATTACTT ACTTGCCAGT
AGAAATGTTA ATCCCTGGTT GACAGCACTA TCCGCAATGG CAACAGGTCA GAGTGGGTTT
CTATTTATTG GTTCGATAGG TTTTATCTAT AAAGTTGGAT TTGCTGCTAT TTGGATACCC
CTTGCTTGGA CAATAGGAGA CTATATTGCT TGGTTGTTAA TATTTAAAAG GTTGAGGTTA
GTTTCTCAGG AAACAGACTC AGATACAATC TCCTCATTCT TAGGTCAAGA AAATCTAAGT
CCAAAAAATC AAGGGCGCTC GATTACAATA ATTTCAGCAC TAATTACCAT AGGAATTCTG
GGTACTTATG CTGCAGCTCA ACTGGTAGCA GCAAGCAAAG GACTGAATGC TATATTTGGT
TGGAACTATG AACTGGGTAT TATTGCTGGG GCTGTAATTG TGGTTGTCTA CTGTTTTTCA
GGAGGTATCC GTGCTTCTAT ATGGACTGAC TCTGTGCAGG GAATTTTAAT GATATTATCT
CTGTTGATTT TGTGTATAGT AAGTTTACTG GCTTGTGGAG GGTTGACAGA ACTTTGGGTC
AAGCTTAATG CCATTGACCC CACTCTAACA AATTGGATGC CTACTAATTT ACCTTGGGGG
TTTTTTCCTT ACTTTTTGGG GTGGTTGGTG TCAGGCTTGG GTGTTGTCGG TCAACCTCAT
GTATTAGTAA GAGCAATGGC AATTGACTCT GCAGATAATA TAGCGTTAGC TCGTAACATA
AAATTAGTCT GCGGTCTAAT GAATTCGGCT ACAGCTTTTG GTATAGGATT AACTGCCAGA
GTTTTGTTAC CTGAATTAAT GACATCTGGT GACCCAGAGT TAGCATTACC GAATCTATCT
ATAGAATTAT TGCCAGCAGT TTTAGTAGGG TTGATGTTAG CAGGACTTTT TTCTGCAGCT
ATTTCTACAG CAGATTCTCA AATATTATCA TGTTCTGCTG CACTAAGTCA AGATTTAGTT
CCCAGTGGAT CTAACTCTTA TCGAAAAGCT AAAATTGCTA CCTTAGCTGT TACTGCTTTT
GTATTAGCGA TCGCTCTCAT AACAAACAAT AGTGTATTTG CTTTGGTCAT TTTCTCTTGG
TCAGTTTTAG CCTGCGCTTT AGGTCCGTTG TTAGTATTGC GAGTGTGGCA AAAACCTGTA
AGGGTTCCAG TCGCAATAAC AATGATGATT ACTGGTATAG TAGTTGCGAT TATATGGAAT
AAAGGCTTTA ACCTATCAAG CGCTATTTAT GAAGTCTTGC CTGGTATGGC AGCAGGCTTT
ATTGTTTATG GAATTGCTAA TTTGCAAATT TGGCCTAAAG ATTTGAGTAA ACAATAA
 
Protein sequence
MEKQIWIGIT FIAFLLSFTV VGIYSATQKQ NTTTDYLLAS RNVNPWLTAL SAMATGQSGF 
LFIGSIGFIY KVGFAAIWIP LAWTIGDYIA WLLIFKRLRL VSQETDSDTI SSFLGQENLS
PKNQGRSITI ISALITIGIL GTYAAAQLVA ASKGLNAIFG WNYELGIIAG AVIVVVYCFS
GGIRASIWTD SVQGILMILS LLILCIVSLL ACGGLTELWV KLNAIDPTLT NWMPTNLPWG
FFPYFLGWLV SGLGVVGQPH VLVRAMAIDS ADNIALARNI KLVCGLMNSA TAFGIGLTAR
VLLPELMTSG DPELALPNLS IELLPAVLVG LMLAGLFSAA ISTADSQILS CSAALSQDLV
PSGSNSYRKA KIATLAVTAF VLAIALITNN SVFALVIFSW SVLACALGPL LVLRVWQKPV
RVPVAITMMI TGIVVAIIWN KGFNLSSAIY EVLPGMAAGF IVYGIANLQI WPKDLSKQ