Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_07081 |
Symbol | thiP |
ID | 4776714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 658660 |
End bp | 660261 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640086217 |
Product | putative iron ABC transporter |
Protein accession | YP_001016724 |
Protein GI | 124022417 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1178] ABC-type Fe3+ transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAAG ACGCCGTCCC GTCAAGCTAC GGCTCGAACC GCCTGCGCAA ACGCCATTGG CAACCAGACC GCCTGCTTCT CAACAGCATG GCAATTCTGC TTGCTGTACT TGCTCTCTGG CCTCTGATTG GTCTGATCCG TGAAGCACTG CAGGGATTCA TCAATGGATC GGCCAGCCTG GGAGTCGATG GCCCACAACA GATTCAAGGG ACCTTGACTC TGCTGATCGG GACATCCTTA TTGGGTGGCC TACTGGGAAC TGCAAACGGC TGGTTATTAG CAAACTGCCG CTTCCCAGGA CGTCGAGCCC TACGAGTGGC TCAGCTATTG CCCCTCGCCA CACCCGCCTA CCTACTCTCG GCCATCCTGA TCGATCTTGG CAGTCGCAAT GCCATACGAA TTCATGGCAT GGGCTGGGGG ATTTTGATCA TGGCGCTGAC AACCTATCCC TACGTTTTTC TCCTCAGCAC TGAAAGCTTT TCAATCTGCG GACGCCGCCA ACTGGAAGCC TGCCGCAGTC TTGGGGTTGG ACCGTGGAAC AGTTTTCGAC GCATTGCTCT GCCCATGGCT TTGCCAGCGA TTGGTGCCGG GATTGCCCTG ATGGGCATGG AAGTCGTCAA TGAACTCGGG GCCGTAGCAC TACTCGCCAT CCCAAGCCTG TCAGCTGGCA TCGTTGAGAC CTGGCAAATG GAAGGCAATC CTGCCGGTGC CATTGGACTA GCAATGATCG CGCTGATCAT CGTGATGTCA CTGGTGGGGT ATGAACGCAG ACTGAGACGA CGAAGCCGCC GCTGGACAGA GGGTGTCGCT GGTGGCGATT CTCCAGCCTG GCAGCTGCAC GGTGTACGAG CCCTGTGCGC TCAGTGCCTA GCCCTCATTC CTCCCACGAT CACTTTGGGT GTGCCCCTGC TGTGGGCCAT CCTCAATCTC GATCAACTCG AACAAGGGAT AGATCCTGAT CTAATCCCAC TGACAGAACG CAGTCTTGGA CTTGGGCTGG CAGCAGCAAG CCTGGCCGTG GTAGCAGGTC TGATCCTTGC AATCGCCAAG CGTTGGTCAT CAACACGCTG GATGGGCAAC CTCTCATTTA TGGCAGGTAT TGGTTATGCC ATTCCAGGAG CTGTGATGGC CATTGCGCTG ATGCCATTCA ACGGTGCTCC ATGGAACCTC GCATGGATTC TGCTGCTGCT GTGGGGCTAT AGCGATCGCT TCTTGGCAGT TGCAAAAGGA GGACTCGACG CTGCCTTCGA GCGCCTTTCC CCAAGCCTGG ATGAAGCCGC TACTGGACTT GGCTGTCAAT GGCAGGAGGT GCTTCGACGC GTTCATCTGC CTTTACTGAA AGGTCCACTA GCGGTAGGAG CACTTTTGGT TTTTGTCGAC ACGGTTAAAG AACTACCTCT CACATTTGTC CTGAGACCAT TTGATTTCGA CACCCTTTCT GTACGGCTCT ACCAATACGC CGCAGATGAA CGCATGGCGG AATCCATCTT GCCAGCACTG ATTATCATCG CTCTGGGATT AATCGCTTCG CTGGCATTGG TCCCAGGGCT CGATCAAGGG GAACGAAAAA AGCCTCCTTT AAGTAAAGAA CCACTCACTT AG
|
Protein sequence | MTQDAVPSSY GSNRLRKRHW QPDRLLLNSM AILLAVLALW PLIGLIREAL QGFINGSASL GVDGPQQIQG TLTLLIGTSL LGGLLGTANG WLLANCRFPG RRALRVAQLL PLATPAYLLS AILIDLGSRN AIRIHGMGWG ILIMALTTYP YVFLLSTESF SICGRRQLEA CRSLGVGPWN SFRRIALPMA LPAIGAGIAL MGMEVVNELG AVALLAIPSL SAGIVETWQM EGNPAGAIGL AMIALIIVMS LVGYERRLRR RSRRWTEGVA GGDSPAWQLH GVRALCAQCL ALIPPTITLG VPLLWAILNL DQLEQGIDPD LIPLTERSLG LGLAAASLAV VAGLILAIAK RWSSTRWMGN LSFMAGIGYA IPGAVMAIAL MPFNGAPWNL AWILLLLWGY SDRFLAVAKG GLDAAFERLS PSLDEAATGL GCQWQEVLRR VHLPLLKGPL AVGALLVFVD TVKELPLTFV LRPFDFDTLS VRLYQYAADE RMAESILPAL IIIALGLIAS LALVPGLDQG ERKKPPLSKE PLT
|
| |