Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_25661 |
Symbol | |
ID | 4777914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2260466 |
End bp | 2262691 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640088087 |
Product | hypothetical protein |
Protein accession | YP_001018562 |
Protein GI | 124024255 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACT TTGTGGTTCT TTCCACTGCA GATTGGGATC ATCCGCTATG GACTAATAAG CAGCATGTTG CTGTCTCCTT GGCTGCTGCC GGTCACCGAG TTTTGTATGT TGATTCTCTT GGTTTGCGTG CACCTCGTGT TGGGGCTGTC GATCGAGGCC GCATCCTGCG GCGTCTTGGT CGAGTGTTGC GTCCGCCTCG GCGTGTAGGC GAGGCCCTTT GGGTGTGGTC ACCTCTTGTG TTGCCCGGCG GGACGGCTGG ATTTGCTTTG ATTTTGAATC GGCAATTATT AACTCTGGGG CTGCGTCTTG CCTTGCTCTG GCTGCGCTTC CAGCAACCAA TCCTTTGGAC TTACAACCCG CTGACATGCC GCTATTTGGC GCTGGGTAGT TTTGGTGGAA GCATTTATCA CTGCGTCGAT CGCATCCAGG CTCAACCAGG TATGCCGGCG GAAAGGATCA GCGCAAGCGA GCGGCAACTT TGCCGAGCCG TGGATGTTGT CTTTACCACT TCCCCGGATC TACAGGCTGA CCTGGAAAAG ATTCATCCTC ATACTCATTT TTTCGGCAAC GTCGCTGATC AGCAACATTT TGGACAGGCA TTGAGTGGAA CATTGCCGTG TCCTCCTGCT CTCAATGATC TCCCTAGGCC TCGCTTGCTA TTCATCGGCG CCATTGATGC CTACAAGCTG AATTTGCCGA TGCTTGAGAT CTTGGCGGAG CGCCACCCTG AGTGGACCTA TGTCTTTGTT GGTCCTGTGG GGGAGGCAGA TCCCGCAACG GATGTTTCCA ATTTGCTCAC GTTCTCCAAT GTTCACTTCG TTGGAGCTCA GCCTTATAGC GATTTGCCTT CTTGGCTTGC TCACTGTGAT GTCGCCTTGC TGCCATTGCG GCACAACAGC TACACCCGTC ATATGTTCCC GATGAAGTTC TTCGAGTATT TGGCGTCTGG GAAACCAGTC GTTGCCACTG CGATTCCTGC TCTTCGTCCC CACGCTGTAG CTGCACATCT TTGTGAGTCT GAAGCGGACT CCTTTGAAGT GGCAATTGCC AAGGCGCTTG CTTCTGAAGG ACCGGCTTTG ACGGAGCGAC TCGCTGTAGC GGCAGAACAC ACCTATGAGG TGCGAACAGC AGCCATGTTG TCGGTGTTGA ACGAGCTGGG AATTTTGCCT GACGCTCGAG CCAGTGGTGT TTCCTCTGGA AGGGCCAGAG TCAGAGTGCG TCGTCTTCGT CACTATTGGC ATGAATGGCT CCTGTCGCAG CTCGCGACTT CCTTGGCTGC TGGGCTGGAT CGAATTGGTG CCCATCACAA TGCCTTGGAG ATGTTGCAAG CTTTTAGACA CCGTTGGCCA CTCAATCTGC CTGTACTACG TGCCTTGATT CCACGTTCAG TGCAAGCAGG TGATTTCAAT TATGCGCTTG AAGTCATGGA AGATCTTTGG ATTAATTACG GCCAGATTTC TTACTTGCGC AAATTACTTT TTCGTCGGGG TTCGCGTCCT GAAGATCTAC AACAGCAGAT TGCTTTGTTC GAAACTTTAG CCAGAAGTGT TCGGCTGCCA TTGACCTACC GATGTTATTC CCGAGTCGTG CTGGCCTATC GGATTGTTGA GAGTGGGGAT CAAGTCAGAA TGCGTGAATC AGCGGTTGCT TTGCAGTCGT TTGTTGTTCA ACTTGAAAGT GATCCAGGCA CAAGACTATG TCGGCGAGGA AATCGCTCGA ATCGAGCAAA GTTATTGATT TCTTGCTATT CAACACTCAC GCGCTTGTAT TTAGCGCTTG GCGATCGAAA GTCACTGGCG GCTATTGGAC AAAAAGCAGC GGAGTTTATG GATGGGTTCG ATCTGAATGC GATTGATAGA GATACTTCCT TCCGTTTGAC TCGCAATCTG ATGCGATGCC TCACAATCGA TGTCCTTGAA GCTTGGCGTT TGGGGGATCA ATCCCTTTAT CAGAGGGCGA GGCAAAGGCT TGTTCTGGTT GTGGATCATT GTCATCAATC CATCCATGAT GAAAGCAATG CGCAAGAGGA TCATCGAGGT TTTGCCAAGG CTCTTCTTGA AGAGGTTGAT AGCCTGGAGC CAATGATTAC TGGTCCCAGT CATGATCCAC AAAGGATTCA CGAATTATTG CGATTAATGG TTAAGAATAA GGGATTATCA CTTGATGGAG TGTTGCCCTT ATTCCCTGAG TATCTAGATA CTAAAGTGGC TGAGGTCTGT CAATGA
|
Protein sequence | MADFVVLSTA DWDHPLWTNK QHVAVSLAAA GHRVLYVDSL GLRAPRVGAV DRGRILRRLG RVLRPPRRVG EALWVWSPLV LPGGTAGFAL ILNRQLLTLG LRLALLWLRF QQPILWTYNP LTCRYLALGS FGGSIYHCVD RIQAQPGMPA ERISASERQL CRAVDVVFTT SPDLQADLEK IHPHTHFFGN VADQQHFGQA LSGTLPCPPA LNDLPRPRLL FIGAIDAYKL NLPMLEILAE RHPEWTYVFV GPVGEADPAT DVSNLLTFSN VHFVGAQPYS DLPSWLAHCD VALLPLRHNS YTRHMFPMKF FEYLASGKPV VATAIPALRP HAVAAHLCES EADSFEVAIA KALASEGPAL TERLAVAAEH TYEVRTAAML SVLNELGILP DARASGVSSG RARVRVRRLR HYWHEWLLSQ LATSLAAGLD RIGAHHNALE MLQAFRHRWP LNLPVLRALI PRSVQAGDFN YALEVMEDLW INYGQISYLR KLLFRRGSRP EDLQQQIALF ETLARSVRLP LTYRCYSRVV LAYRIVESGD QVRMRESAVA LQSFVVQLES DPGTRLCRRG NRSNRAKLLI SCYSTLTRLY LALGDRKSLA AIGQKAAEFM DGFDLNAIDR DTSFRLTRNL MRCLTIDVLE AWRLGDQSLY QRARQRLVLV VDHCHQSIHD ESNAQEDHRG FAKALLEEVD SLEPMITGPS HDPQRIHELL RLMVKNKGLS LDGVLPLFPE YLDTKVAEVC Q
|
| |