Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1057 |
Symbol | |
ID | 4241942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1652249 |
End bp | 1653973 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638106289 |
Product | sulphate transporter |
Protein accession | YP_720901 |
Protein GI | 113474840 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0213258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0387274 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACGC AAGTTTTCAA TAAAATACAT TTCAGAAACC TTCAGGGCGA CATTTTCGGT GGCTTAACTG CTGCCGTGAT TGCTCTTCCA ATGGCACTTG CCTTCGGTGT TGCTTCAGGT GCAGGTCCTG CTGCTGGCTT ATATGGTTCT GTATTAGTGG GTTTGTTTGC AGCACTGTTT GGTGGTACTC CTACTCTAAT TTCTGAACCT ACTGGACCAA TGACGGTGGT AATGACTGCA GTGATTGCGA ACTTAACTGC TACTAACCCA GAAAACGGCA TGACAATGGC ATTTACAGTG GTAATGCTAG CTGGTTTATT CCAAATCAGT TTTGGTTTCT TAAAGCTGGG TAAATATATT ACCATGATGC CTTATAATGT TATATCTGGT TTCATGTCAG GCATTGGTCT TATCCTAATT ATCCTGCAAA TAGGTCCTTT TCTCGGACAA GCTAGTCCCA AGGGCGGTGT AATTGCTACT ATTGAGAATC TTCCTCAACT TCTAAATAAT ATTAATCCCA TAGAAACAGG TTTAGCAGTT CTTACAGTAG TTATCCTGTT TTCTATGCCA ACTAAACTCA AGAAAATTTT TCCAGCACCA TTGGTAGCAT TAGTAATAGG AACAATAATT TCTATTGTAT TTTTCTCAGA TATTGATATT CGTCGTATTG GTGAAATTCC TAGTGGTCTT CCTAGCTTAC AACTACCTTA CTTTACTGCC GGTCAGTTAC AGTTAATGGT AGTTGATGCT ATAGTATTAG CAATGCTGGG TTGTATTGAT GCTCTTCTTA CTTCTGTGGT AGCTGACAGT TTAACTCGTA CTCAACACGA CTCTGATAAA GAATTAATTG GTCAAGGTTT AGGGAACCTA GCTTCTGGTT TATTTGGCGG TATTCCAGGT GCTGGTGCGA CTATGGGTAC TGTAGTCAAT ATTAATACAG GAGCTCGCAC TGCTCTATCT AGTATCACCC GTGCTGTCAT TTTAATGGTT GTAGTTTTGG GAGCTGCCAG TTTAACAGCA CAAATCCCAA TGGCTGTTTT GGCAGGTATT GCCTTCCAGG TAGGCATTAA GATTATTGAC TGGGGATTCC TCAAGCGTGC TCATCGCATT TCCTGGAAGT CGGCGATCAT TATGTACGCT GTTATTGGTT TAACTGTATT TGTTGACTTG ATTACTGCTG TAGGTATTGG GGTATTTATC GCTAATGTTT TGACTATTGA TCGCCTGACT CAGCTAAAAT CTGAAGATGT TAAAGCTATT ACTGATGCTG ATGATGCGAT CATTTTAGAC AATGATGAAA AAGAATTACT AGATCGTGCT GAAGGTCGAA TTTTACTGTT TCATTTAAGT GGTCCCATGA TATTTGGTAT TTCTAAAGCC ATCTCTCGAC AGCACACACA TTTAAATAAT TATGAAGTTT TGATTGTAGA CTTGAGCGAA GTACCTCACA TGGGTGTAAC TTCAGCTCTA GCAATAGAAA ATGTAATTCA GGAAACTATT GATACAGGTC GTAATGTATT CCTAGTTGGT GCTGCAGGAA GTGTCAAACT CCGATTAGAA AAATTAGGAG TTTTAAATAT TGTACCTTCA GAAAATATGT TGATGGATCG CAAGCAAGCA TTGGTTAAAG CTGTGGCCTT AGTTACTTCT GATGTTAATA TTAATGATCC TATTCAGAAC GGAGCTAAGG GTATTCAATC TGGGATTGAT AATATTATCA AATAA
|
Protein sequence | MATQVFNKIH FRNLQGDIFG GLTAAVIALP MALAFGVASG AGPAAGLYGS VLVGLFAALF GGTPTLISEP TGPMTVVMTA VIANLTATNP ENGMTMAFTV VMLAGLFQIS FGFLKLGKYI TMMPYNVISG FMSGIGLILI ILQIGPFLGQ ASPKGGVIAT IENLPQLLNN INPIETGLAV LTVVILFSMP TKLKKIFPAP LVALVIGTII SIVFFSDIDI RRIGEIPSGL PSLQLPYFTA GQLQLMVVDA IVLAMLGCID ALLTSVVADS LTRTQHDSDK ELIGQGLGNL ASGLFGGIPG AGATMGTVVN INTGARTALS SITRAVILMV VVLGAASLTA QIPMAVLAGI AFQVGIKIID WGFLKRAHRI SWKSAIIMYA VIGLTVFVDL ITAVGIGVFI ANVLTIDRLT QLKSEDVKAI TDADDAIILD NDEKELLDRA EGRILLFHLS GPMIFGISKA ISRQHTHLNN YEVLIVDLSE VPHMGVTSAL AIENVIQETI DTGRNVFLVG AAGSVKLRLE KLGVLNIVPS ENMLMDRKQA LVKAVALVTS DVNINDPIQN GAKGIQSGID NIIK
|
| |