Gene Tery_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1057 
Symbol 
ID4241942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1652249 
End bp1653973 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content39% 
IMG OID638106289 
Productsulphate transporter 
Protein accessionYP_720901 
Protein GI113474840 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0213258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0387274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACGC AAGTTTTCAA TAAAATACAT TTCAGAAACC TTCAGGGCGA CATTTTCGGT 
GGCTTAACTG CTGCCGTGAT TGCTCTTCCA ATGGCACTTG CCTTCGGTGT TGCTTCAGGT
GCAGGTCCTG CTGCTGGCTT ATATGGTTCT GTATTAGTGG GTTTGTTTGC AGCACTGTTT
GGTGGTACTC CTACTCTAAT TTCTGAACCT ACTGGACCAA TGACGGTGGT AATGACTGCA
GTGATTGCGA ACTTAACTGC TACTAACCCA GAAAACGGCA TGACAATGGC ATTTACAGTG
GTAATGCTAG CTGGTTTATT CCAAATCAGT TTTGGTTTCT TAAAGCTGGG TAAATATATT
ACCATGATGC CTTATAATGT TATATCTGGT TTCATGTCAG GCATTGGTCT TATCCTAATT
ATCCTGCAAA TAGGTCCTTT TCTCGGACAA GCTAGTCCCA AGGGCGGTGT AATTGCTACT
ATTGAGAATC TTCCTCAACT TCTAAATAAT ATTAATCCCA TAGAAACAGG TTTAGCAGTT
CTTACAGTAG TTATCCTGTT TTCTATGCCA ACTAAACTCA AGAAAATTTT TCCAGCACCA
TTGGTAGCAT TAGTAATAGG AACAATAATT TCTATTGTAT TTTTCTCAGA TATTGATATT
CGTCGTATTG GTGAAATTCC TAGTGGTCTT CCTAGCTTAC AACTACCTTA CTTTACTGCC
GGTCAGTTAC AGTTAATGGT AGTTGATGCT ATAGTATTAG CAATGCTGGG TTGTATTGAT
GCTCTTCTTA CTTCTGTGGT AGCTGACAGT TTAACTCGTA CTCAACACGA CTCTGATAAA
GAATTAATTG GTCAAGGTTT AGGGAACCTA GCTTCTGGTT TATTTGGCGG TATTCCAGGT
GCTGGTGCGA CTATGGGTAC TGTAGTCAAT ATTAATACAG GAGCTCGCAC TGCTCTATCT
AGTATCACCC GTGCTGTCAT TTTAATGGTT GTAGTTTTGG GAGCTGCCAG TTTAACAGCA
CAAATCCCAA TGGCTGTTTT GGCAGGTATT GCCTTCCAGG TAGGCATTAA GATTATTGAC
TGGGGATTCC TCAAGCGTGC TCATCGCATT TCCTGGAAGT CGGCGATCAT TATGTACGCT
GTTATTGGTT TAACTGTATT TGTTGACTTG ATTACTGCTG TAGGTATTGG GGTATTTATC
GCTAATGTTT TGACTATTGA TCGCCTGACT CAGCTAAAAT CTGAAGATGT TAAAGCTATT
ACTGATGCTG ATGATGCGAT CATTTTAGAC AATGATGAAA AAGAATTACT AGATCGTGCT
GAAGGTCGAA TTTTACTGTT TCATTTAAGT GGTCCCATGA TATTTGGTAT TTCTAAAGCC
ATCTCTCGAC AGCACACACA TTTAAATAAT TATGAAGTTT TGATTGTAGA CTTGAGCGAA
GTACCTCACA TGGGTGTAAC TTCAGCTCTA GCAATAGAAA ATGTAATTCA GGAAACTATT
GATACAGGTC GTAATGTATT CCTAGTTGGT GCTGCAGGAA GTGTCAAACT CCGATTAGAA
AAATTAGGAG TTTTAAATAT TGTACCTTCA GAAAATATGT TGATGGATCG CAAGCAAGCA
TTGGTTAAAG CTGTGGCCTT AGTTACTTCT GATGTTAATA TTAATGATCC TATTCAGAAC
GGAGCTAAGG GTATTCAATC TGGGATTGAT AATATTATCA AATAA
 
Protein sequence
MATQVFNKIH FRNLQGDIFG GLTAAVIALP MALAFGVASG AGPAAGLYGS VLVGLFAALF 
GGTPTLISEP TGPMTVVMTA VIANLTATNP ENGMTMAFTV VMLAGLFQIS FGFLKLGKYI
TMMPYNVISG FMSGIGLILI ILQIGPFLGQ ASPKGGVIAT IENLPQLLNN INPIETGLAV
LTVVILFSMP TKLKKIFPAP LVALVIGTII SIVFFSDIDI RRIGEIPSGL PSLQLPYFTA
GQLQLMVVDA IVLAMLGCID ALLTSVVADS LTRTQHDSDK ELIGQGLGNL ASGLFGGIPG
AGATMGTVVN INTGARTALS SITRAVILMV VVLGAASLTA QIPMAVLAGI AFQVGIKIID
WGFLKRAHRI SWKSAIIMYA VIGLTVFVDL ITAVGIGVFI ANVLTIDRLT QLKSEDVKAI
TDADDAIILD NDEKELLDRA EGRILLFHLS GPMIFGISKA ISRQHTHLNN YEVLIVDLSE
VPHMGVTSAL AIENVIQETI DTGRNVFLVG AAGSVKLRLE KLGVLNIVPS ENMLMDRKQA
LVKAVALVTS DVNINDPIQN GAKGIQSGID NIIK