Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_02561 |
Symbol | citT |
ID | 4778419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 267209 |
End bp | 269029 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640085760 |
Product | putative sodium/sulfate transporter, DASS family |
Protein accession | YP_001016276 |
Protein GI | 124021969 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTGC TGAGCACCGC TATTCAAAAC CCCCAAGCCC TCATCACCCT GGCGGTGTTG TTGCTGGCGG TGGTGCTCTT CATCAGCGGT GCCCTTGCAC CTGAACTCAC TGGCCTGCTG AGCATGGCAC TGCTGATGGC CACAGGCGTC CTAACTCCCC AACAAGCCCT GGCTGGATTT GGCAGGCCTG CCCTTATTAC GCTAATGGGC CTATTCGCCG TCTCAGCAGC CTTATTCCGC AGTGGTGCCC TGGATCGCGT GCGCGAACTG ATCGCATCCG AGCGCATTCG CAGCCCACGC CGGTTGATTG CCTTGTTGGG TCTAGTGGTC GCTCCCATTT CGAGTGTCCT CCCAAACACA CCCGTCGTCG CCTCGTTGCT ACCTGTCATC GAAGCCTGGT GTCATCGACG CCGGATCTCT CCGTCTAAGG TCCTGCTACC ACTATCTTTC GCAGCGCTTT TCGGCAGCAC ACTCACACTG CTGGGGAGTT CAGTGAATCT GCTGGTCAGT GACATCAGCG AGCAACTGGG CAATGGATCT CTAGAACTGT TCAGCTTCAC AGCCATCGGC GTGCCGATCT GGCTCGCTGG CACCACCTAT CTCATGCTTG CCCCTCAGGC CCTGTTGCCA GATCGTGGCA GCAACAGCGA TGAACTAGGA GACAACAAAG ACCAGACCGG TTACTTCACA GAGGTCACCA TCCCCCAGAA CTCGCAACTG GTAGGACAAT CCCTGCACAA CAGTCGTTTA CAACGTCGTT TTGATGTAGA CGTGCTGGAA CTGCAAAGAG GTCGAGAACG ACTCCTACCG CCCCTGGCTG ATCGCAGACT TGAACCTGGC GACCGACTTC TTCTCAGGGT CACCCGCGCA GACCTACTAC GTCTCCAACA AGAACACAAC GTACAGCTAG CAACAAGAGA GTCGATCGCT CCGTCATCAC TTTCCCCATC TGGCCTGGGC GAAGGGCAAA GAACTGTGGA AGTCCTCCTA CCGGCAGGCT CAACTTTGGC TGGTGCAAGT CTGCGTGAAC TGCGATTTCG GCAACGCCAC AACGCCACAG TTCTTGCCCT CAGGCGTGGT CAGCAAACAG TGCAAGAGCG TCTTGGACAA GCTGTACTAC GCGAAGGCGA TGTCTTACTG CTGCAAGCCC CACTGGACTC CATTCGCGGC CTGCAAGCCA GCAATGATTT GCTGGTCCTG GATCAGCTGG AAAACGACCT GCCCACAGTG CGCCGCAAAC CACTGACCAT CGCCATTGCT TTGGCCATGC TGATCATGCC GACAGTGACT GCCCTGCCAC TGGTGGCAGC GGTCCTACTT GCTGCAGTCG CGATGGTGGC AGGAGGATGC CTTCGTCCAG GGGAACTTCA GCGATCAATT CGTCTCGATG TGATTCTGCT GCTGGGGTCG CTCTCCAGCT TCAGCGTGGC CATGCAGAAA ACCGGCCTAG CAGATGCTCT GGCAAGCAGC TTTGAAACCT TGCTGAATGA CTGGTCCAAC TACTTAGCCT TACTAGTCAT CTTTCTGGTC ACCACGCTCC TCACCCAGGT GATGAGCCCA GCCGCATCCG TTGCACTACT TGTCCCAGTA GCCATCCAAC TCGCACCTGG ACTAGACCTT GTACCCAATG CACTGGTCTT CACCGTGCTA TTTGGTGCAA GCCAGTCTTT TCTGACACCA ATGGGGCATC AGACAAACCT GATGGTGTTC GGTCCTGGGC GTTATCGATT TCTGGATGTA ACCCGTTACG GAGCTGGCCT AACGGCCCTG ATGACCGTGA TGATTCCTGG GCTGATCCTT TGGCATTTCG GCGGATCTTG A
|
Protein sequence | MAVLSTAIQN PQALITLAVL LLAVVLFISG ALAPELTGLL SMALLMATGV LTPQQALAGF GRPALITLMG LFAVSAALFR SGALDRVREL IASERIRSPR RLIALLGLVV APISSVLPNT PVVASLLPVI EAWCHRRRIS PSKVLLPLSF AALFGSTLTL LGSSVNLLVS DISEQLGNGS LELFSFTAIG VPIWLAGTTY LMLAPQALLP DRGSNSDELG DNKDQTGYFT EVTIPQNSQL VGQSLHNSRL QRRFDVDVLE LQRGRERLLP PLADRRLEPG DRLLLRVTRA DLLRLQQEHN VQLATRESIA PSSLSPSGLG EGQRTVEVLL PAGSTLAGAS LRELRFRQRH NATVLALRRG QQTVQERLGQ AVLREGDVLL LQAPLDSIRG LQASNDLLVL DQLENDLPTV RRKPLTIAIA LAMLIMPTVT ALPLVAAVLL AAVAMVAGGC LRPGELQRSI RLDVILLLGS LSSFSVAMQK TGLADALASS FETLLNDWSN YLALLVIFLV TTLLTQVMSP AASVALLVPV AIQLAPGLDL VPNALVFTVL FGASQSFLTP MGHQTNLMVF GPGRYRFLDV TRYGAGLTAL MTVMIPGLIL WHFGGS
|
| |