Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_29911 |
Symbol | |
ID | 4776950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2640896 |
End bp | 2642002 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640088515 |
Product | hypothetical protein |
Protein accession | YP_001018986 |
Protein GI | 124024679 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.853518 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACAG AAAGATCCAA TGAGTTCCGG CAATCAATCG TGACCACATG CCCCGAATTA TCCGGCCGTG AACGCTTCAA AGCCCACCTC AGGAAGGTGG GAAGTGGAGA GCAAACCAGT CGTGGCATGA GCCGTGAAGA ATCAGCTGAT GCCTTGCATC TCATCCTTAC AGCCCAAGCC AGCCCTGCTC AAATTGGCGC CTTCCTCATC GCCCATCGCA TTCGTCGACC CGAGCCCCAG GAACTCGCCG GCATGCTCGA TACATACAGA GTCCTTGGGC CCAAACTGAA ATCAGCCAAT GGCCAGAAAC GACCCATTTG CTTTGGGATG CCATTCGACG GCCGCAAGCG AACAGCCCCG ATTTATCCGC TCACAGCACT GGTCCTACTC AACGCTGGTC AACCCGTCGT CTTGCAAGGG GGGCAGCGTA TGCCAATCAA ATATGGCGTC ACCACAGAAG AGTTATTCAA AGCCTTAGGG CTACAACTCC AAGGCCTATC AATAGCAAAC CTAGAAGCTG GCTTTCAACA ACATGGTCTG GCACTGATCT ACCAGCCAGA TCACTTCCCG CTGGCCGAAA GCTTGATCAG TTATCGCGAC GACATCGGCA AGCGACCGCC TGTGGCCAGT TTGGAGCTGC TTTGGACAGC ACATCAAGGA CAGCATTTAC TCGTCAGCGG CTTCGTGCAT CCACCCACCG AAGACCGGGC CTGGAAAGCC CTTGAGCTAG CAGGTGAAAC AAATCTCGTG ACTGTGAAAG GGCTCGAAGG AAGCACAGAC CTTCCCATCA GTCGAACTTG CATCACATCC CGAGTTCAAA ACGGCAAGCC AGAACGACTC ATCCTCCACC CCCGTAACCA CGGCTGCTTT AGCCAAGACG TTGAGTGGAG CAACCTTACG GAGTGGCGCG AGCAGGCAAT GGAAGCTCTG CACAATCGCG GGCCATTAAG CCAGCCCCTC CTCTGGAATG CTGGTACCTA CCTATGGTTA GCGGGCCTAG CCGACAACAT CGATGAAGGT ATCGCTCATG CTGAAAAGTG TCTGCAATCA GGCTTAGCCC AAACCACGCT TGAGCAGCTC ATTGCTTGGA GAGAGACCAT CATTTGA
|
Protein sequence | MTTERSNEFR QSIVTTCPEL SGRERFKAHL RKVGSGEQTS RGMSREESAD ALHLILTAQA SPAQIGAFLI AHRIRRPEPQ ELAGMLDTYR VLGPKLKSAN GQKRPICFGM PFDGRKRTAP IYPLTALVLL NAGQPVVLQG GQRMPIKYGV TTEELFKALG LQLQGLSIAN LEAGFQQHGL ALIYQPDHFP LAESLISYRD DIGKRPPVAS LELLWTAHQG QHLLVSGFVH PPTEDRAWKA LELAGETNLV TVKGLEGSTD LPISRTCITS RVQNGKPERL ILHPRNHGCF SQDVEWSNLT EWREQAMEAL HNRGPLSQPL LWNAGTYLWL AGLADNIDEG IAHAEKCLQS GLAQTTLEQL IAWRETII
|
| |