Gene A9601_00961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00961 
SymbolcitT 
ID4716779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp99210 
End bp101018 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content34% 
IMG OID640077794 
ProductDASS family sodium/sulfate transporter 
Protein accessionYP_001008491 
Protein GI123967633 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.531259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTAA TTGCAGTAGT TAGTAATAAT TTTGATGCGT TTATAACGGT AGTTGTTTTA 
ATAATGTCAA TAATTTTATT TATTAGAAAT ACTATTGCAC CAGAATTGAC TGGTTTGTTA
TGTGTCGGAA TATTTATATC TACTGGGGTT CTCTCTCCTG AAAAAGCTTT AGCTGGATTT
GGTAGCCCTT CTTTAATTAC CCTTATGGGT TTATTTGCAG TTTCCTCAGC ATTATTTAAA
AGTGGTGCCT TAGACAGAGT AAGAGAATTG ATTTCTTCTG AAAGTATTCG AACTCCAAGG
AAATTAATTT CTTTAATAGC TTTTTTGATT GCTCCAATAT CTGGAATTGT ACCTAATACT
CCAGTAGTAG CATCTTTGTT ACCTTTAATT GAAAGTTGGT GCGAGCGAAG AAATATATCA
CCATCAAAAG TTTTATTACC TCTTTCTTTT GCTACTTTGC TTGGAGGAAC TCTGACATTA
TTAGGTAGCT CAGTAAATCT TCTTGTAAGT GATATTAGTC AACAATTAGG TTACGGAGGT
TTGGAATTAT TTAGTTTGAC TTCAATAGGA ATTCCTGTAT GGCTGATAGG TACAACCTAT
ATGATTCTAG TTTCTGACAT GCTTTTACCA GATAGAGGGA GAGATAAGGA GTTTATTAAA
AATGGTGATA TGAATATATA TTTTACCGAA GTTACCATTC CTTCTACTTC AGAATTAGTT
GGACAATCTG TCAGAAATAG TAGATTGCAA AGACGATTTG ACGTTGATGT TCTGGAATTG
CAACGAAATG GAAAAGTTAT TCTTCCTCCT TTGGCTGATA GAAAGATCGA ACCGAATGAT
AGATTAATAA TCCGCGTTAC AAGGGCAGAC TTATTTAGGC TGCAACAGGA ACATACTATT
CTGTTAGGAG AAAACAAAAC ATCGTTCGAT GGGGCTAATG TTTTCTCAGA TGATGAAGGT
ACTAAGACCT TTGAAGCCTT GTTACCAGCT GGTTCAACTT TAGCCGGTGC AAGTTTGAGA
GAATTAAGAT TTAGGCAGCG TCATAATGCA ACAGTTTTAG CATTAAGAAG AGGTCAGCAA
ACTGTTCAGG AGAGATTAGG CCAAGCTGTT TTAAGGGCTG GAGATGTTTT ATTATTGCAA
GCACCTCTAG ATTCAATAAG AGGTTTGCAA GCTAGTAATG ATTTGCTTAT TTTAGATCAA
TTCGAAGATG ACTTACCTTT TTTGATAAAA AAACCTATAT CGATTGCAAT TGCAATAGGA
ATGGTCATTT TACCTTCGGT TTCTAATATT CCATTAGTAG GTTCAGTTCT TTTGGCAGTG
ATTGCAATGG TGGCTTGTGG ATGTTTAAGA CCTGCAGAGA TACAAAAATC AATTAGGTTA
GACGTTATTT TATTGCTGGG ATCTTTATCG TGTTTTAGTG TAGCTATGCA GATAACTGGA
TTAGCAGATG TAATTGCAGT TAATCTAAAT TTTGCCCTTA ACGGAATGCC TCTTTATTTT
GCATTAGTCG TAATTTTTGT TTCTACAGTT ATTCTTACGC AATTTATAAG TAATGCTGCT
TCGGTTGCTT TGATTTTGCC TGTTGCTATT GAATTCTCAA ATGTTTTAGA AATTTCACCA
AGCGCTTTAA TAATGCTTGT TTTGTTTGGT GCAAGTCAAT CTTTCTTGAC TCCAATGGGT
TATCAAACAA ATTTAATGGT TTATGGTCCT GGAAGATATA GATTTTTTGA TATCGCAAAA
TACGGCGCAG GATTAACACT TATAATGTCT TTTACTGTGC CAGCATTGAT AATTTTAAAT
TACGGATAA
 
Protein sequence
MNLIAVVSNN FDAFITVVVL IMSIILFIRN TIAPELTGLL CVGIFISTGV LSPEKALAGF 
GSPSLITLMG LFAVSSALFK SGALDRVREL ISSESIRTPR KLISLIAFLI APISGIVPNT
PVVASLLPLI ESWCERRNIS PSKVLLPLSF ATLLGGTLTL LGSSVNLLVS DISQQLGYGG
LELFSLTSIG IPVWLIGTTY MILVSDMLLP DRGRDKEFIK NGDMNIYFTE VTIPSTSELV
GQSVRNSRLQ RRFDVDVLEL QRNGKVILPP LADRKIEPND RLIIRVTRAD LFRLQQEHTI
LLGENKTSFD GANVFSDDEG TKTFEALLPA GSTLAGASLR ELRFRQRHNA TVLALRRGQQ
TVQERLGQAV LRAGDVLLLQ APLDSIRGLQ ASNDLLILDQ FEDDLPFLIK KPISIAIAIG
MVILPSVSNI PLVGSVLLAV IAMVACGCLR PAEIQKSIRL DVILLLGSLS CFSVAMQITG
LADVIAVNLN FALNGMPLYF ALVVIFVSTV ILTQFISNAA SVALILPVAI EFSNVLEISP
SALIMLVLFG ASQSFLTPMG YQTNLMVYGP GRYRFFDIAK YGAGLTLIMS FTVPALIILN
YG