Gene P9211_00921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00921 
SymbolcitT 
ID5730889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp97251 
End bp99080 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content41% 
IMG OID641284435 
ProductDASS family sodium/sulfate transporter 
Protein accessionYP_001549977 
Protein GI159902633 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAGT TAATAGTGGT TTTAGAAAAC CCTCAGGCGC TAATAACTTT GGGAGTTTTG 
GTATTAGCAG TAGTGCTATT TATTAGTGGT TTGCTTGCAC CTGAATTAAC CGGTCTTTTA
AGTGTGGCGT TGTTAATGGC TACAGGAGTC CTGTCACCTC ACAAGGCTTT ATCTGGTTTT
GGTAGTCCAG CCTTGATAAC TTTAATGGGA TTGTTCGCTG TATCTGCAGC ACTATTTAAA
AGTGGTGCGT TAGATAGATT GCGAGAATTT ATAGCTTCAG AAAGTATTAG AACTCCTCGT
CGTTTAATTG CATTTTTAGG CTTTATTGTA GCTCCCATAT CAGGGATAGT TCCTAATACA
CCTGTAGTTG CATCACTTTT ACCTGTGATA GAAGCTTGGT GTTTTAAACG CAAACTTTCC
CCTTCCAGGG TATTACTACC TCTTTCCTTT GCAACAGTTT TAGGAGGCAC TCTTACTTTA
TTAGGTAGCT CGGTGAATCT ATTGGTTAGT GATATAAGTC AGCAACTTGG GTATGGGTCT
TTAGAATTAT TTAGTTTCAC TGCGATAGGT GTTCCAATTT GGCTGGTGGG GACAGCTTAT
TTATTGTTAG CGCCTCAAAG ACTGTTGCCT GACCGAGGAA GAGATAATGG GGAATTTGGA
GGTAGTGCAG ACCAGACGGG ATACTTCACT GAAGTGACTA TTCCCATAGA TTCAGATTTA
GTTGGACAGT CTCTGCATAA CAGTAGATTG CAACGTCGAT TTGATGTTGA TGTTTTGGAG
CTTCAAAGGG GAAAGGAAAG ATTGCTTCCC CCGCTCGCAG ATAGAACTAT TGAGCCTGGA
GATAGATTAT TGCTTCGTGT GACTCGTGCA GATTTATTGC GTCTTCAGCA AGAACATACT
GTTCAGTTGG CTAAACAGAA TTTTGTCAAT GCTTCAGAGG AGCAAGTGGA GCCGTTTTTA
CGAGAAGGTC AAAAAACAGT TGAGGTTCTT CTTCCAGCAG GATCAACCTT GGCTGGTGCG
AGTTTGAGAG AATTAAGATT TCGACAACGC CATAATGCAA CTGTCTTAGC TCTTAGGCGA
GGACAGCAGA CTGTCCAAGA ACGTTTAGGT CAAGCAATTT TGCGAGAAGG AGATGTATTG
CTTTTACAAG CTCCGATAGA TTCAATTCGT GGACTGCAGG CTAGTAATGA TTTGCTTGTC
TTAGATCAAT TTGAAAATGA CTTGCCTACT ATCAGACGCA AGCCAATTAC CATTGGCATT
GCTATTGCGA TGGTTCTCTT ACCTGCCCTT ACTTCCCTGC CATTAGTTGC ATCCGTTTTG
ATAGCAATGA TTCTAATGGT TGTTAGTGGT TGTTTGCGCC CTGCAGAGGT GCAGAGTTCT
ATACGTCTAG ATGTAATTCT CTTGCTGGGG TCTCTATCTA GTTTTAGTGT TGCGATGCAG
GCGACAGGGT TAGCTGATGC TTTTGCTGCA ACTCTTGAGT ATTGGTTAAA GGGATTACCT
ACATATTTTT CTTTACTAGT TGTTTTCTTT GCTACGACTA TAGTTACTCA GTTTATTAGT
AATGCTGCTT CAGTAGCTTT ACTGGCACCA GTTGCAGTTC AGCTTGCTTC GGGAATGAAC
TTACCTCCCA TGGCTCTTTT GATGACAGTT TTATTTGGCG CGAGTCAATC TTTTCTGACA
CCTATGGGGT ACCAAACAAA TTTAATGGTT TTTGGCCCTG GTAGGTATCG TTTCCTTGAT
GTGACTAGAT ATGGAGCTGG ATTGACTGCA TTAATGACTC TCATTGTTCC TTTATTAATT
ATTTGGCAAT ACGGAGGAAC TTTTAGGTAA
 
Protein sequence
MDELIVVLEN PQALITLGVL VLAVVLFISG LLAPELTGLL SVALLMATGV LSPHKALSGF 
GSPALITLMG LFAVSAALFK SGALDRLREF IASESIRTPR RLIAFLGFIV APISGIVPNT
PVVASLLPVI EAWCFKRKLS PSRVLLPLSF ATVLGGTLTL LGSSVNLLVS DISQQLGYGS
LELFSFTAIG VPIWLVGTAY LLLAPQRLLP DRGRDNGEFG GSADQTGYFT EVTIPIDSDL
VGQSLHNSRL QRRFDVDVLE LQRGKERLLP PLADRTIEPG DRLLLRVTRA DLLRLQQEHT
VQLAKQNFVN ASEEQVEPFL REGQKTVEVL LPAGSTLAGA SLRELRFRQR HNATVLALRR
GQQTVQERLG QAILREGDVL LLQAPIDSIR GLQASNDLLV LDQFENDLPT IRRKPITIGI
AIAMVLLPAL TSLPLVASVL IAMILMVVSG CLRPAEVQSS IRLDVILLLG SLSSFSVAMQ
ATGLADAFAA TLEYWLKGLP TYFSLLVVFF ATTIVTQFIS NAASVALLAP VAVQLASGMN
LPPMALLMTV LFGASQSFLT PMGYQTNLMV FGPGRYRFLD VTRYGAGLTA LMTLIVPLLI
IWQYGGTFR