Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_24401 |
Symbol | |
ID | 4777920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2145924 |
End bp | 2147234 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640087960 |
Product | bicarbonate transporter, ICT family protein |
Protein accession | YP_001018436 |
Protein GI | 124024129 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | [TIGR00947] probable bicarbonate transporter, IctB family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTAAGA CTGCGGTCCC AAAGCCCCTT CTACTGCGCT GGCAGGGACG CATTCCCTCC TCTGAGGCGA TGCAAATGCG CCTGCAGTGG ATTGCGGGGT TGCTGTTGAT GATGCTCCTA GCAACCCTGC CCATGCTGAC TCGAACAGGG CTGGGACTAA CAATCCTCGC CGCCGGAGCG TTATGGATCA TCTGGGGCTG CGTGACACCA GCTGGCCGAA TTGGAAGCAT CAGTAGCTGG TTACTTGTGT TTCTCGCTAT TGCATTGCTC GCCACAGGAT TCTCACCCGT TCCATTGGCA GCTGCCAAAG GATTGATCAA ACTCATCAGC TACCTGGGGG TGTACGCACT GATGCGGCAG CTACTAGCCA CAAGGAGCGA CTGGTGGGAT CGCCTGGTGG CTGCCCTACT AACCGGCGAA CTGATCTCTT CTGTGATCGC AATCAGGCAG CTCTATGCCC CCGCTGAGGA AATGGCCCAC TGGGCAGATC CCAATTCAGT GGCTGCAGGG ACAGTGCGAA TTTATGGTCC GCTTGGTAAT CCCAACCTGC TAGCCGGCTA TCTCATACCC ATCCTGCCGC TGGCCTTAGT AGCCCTACTG AGATGGCAAG GCTTGGGGGC AAAGCTTTAC GCGATGGTCG CTCTAGGGCT TGGCATCACA GCAACCCTAT TCAGCTTCAG CCGCGGTGGA TGGCTAGGCA TGCTTTCCGC TCTAGCTGTG ATTTTGGTGC TGCTGCTGTT GCGCAGTACC AGCCACTGGC CTCTCGTCTG GCGTCGTCTG CTGCCCCTAA TCGTGATTGT TTTGGGCACA GCCATGCTGG TGATAGCAGC AACCCAGATT GAGCCCATCC GCACCCGAAT CACAAGCTTG ATCGCAGGGC GAAGTGACAG CTCTAACAAC TTCCGCATCA ACGTTTGGCT ATCGAGCCTT GAAATGATTC AGGCACGCCC ATGGCTGGGT ATTGGCCCTG GCAACGCTGC CTTCAACAGG ATCTATCCGC TCTTTCAACA GCCCAAATTC AACGCCCTAA GTGCCTACTC TGTTCCCCTG GAAATCCTTG TCGAAACCGG ACTGCCTGGC CTCATTGCAA GTCTCGCTCT AGTAATCACC AGCATCCGCA AGGGCCTCGC TGGCCTCAAC TCAAACAATC CGCTGGCCCT CCCCGCTCTG GCAAGCCTGG CCGCCATGGC TGGGCTTGCG GTTCATGGCA TCACAGATAC CATTTTTTTT CGACCTGAGG TTCAACTCGT GGGCTGGTTC TGCCTCGCCA CACTGGCCCA AACACAGCCA GAACAAAAGC AACTCCAATA G
|
Protein sequence | MPKTAVPKPL LLRWQGRIPS SEAMQMRLQW IAGLLLMMLL ATLPMLTRTG LGLTILAAGA LWIIWGCVTP AGRIGSISSW LLVFLAIALL ATGFSPVPLA AAKGLIKLIS YLGVYALMRQ LLATRSDWWD RLVAALLTGE LISSVIAIRQ LYAPAEEMAH WADPNSVAAG TVRIYGPLGN PNLLAGYLIP ILPLALVALL RWQGLGAKLY AMVALGLGIT ATLFSFSRGG WLGMLSALAV ILVLLLLRST SHWPLVWRRL LPLIVIVLGT AMLVIAATQI EPIRTRITSL IAGRSDSSNN FRINVWLSSL EMIQARPWLG IGPGNAAFNR IYPLFQQPKF NALSAYSVPL EILVETGLPG LIASLALVIT SIRKGLAGLN SNNPLALPAL ASLAAMAGLA VHGITDTIFF RPEVQLVGWF CLATLAQTQP EQKQLQ
|
| |