Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_20011 |
Symbol | |
ID | 4777078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1758523 |
End bp | 1759692 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640087515 |
Product | integral membrane protein |
Protein accession | YP_001018008 |
Protein GI | 124023701 |
COG category | [R] General function prediction only |
COG ID | [COG4956] Integral membrane protein (PIN domain superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.192672 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCATG CTGCCTGCAT CCGGAGCGAG CAGACACGAT CTGACCCCAT GGTTGACCTG CTCATCCTGG TGCTGTTTCT GATCTCTGGC GCGGCCACTG GCTGGCTTGG AGTTGAGCTG TTGCCCGAAC GGCTGCTGGA GCAAACGATC AACATCGATC GACTGAGGTT GGTCCTGAGT GGGTTGAGCG CCAGCTTTGG ACTTTTGGCT GGATTCTTCT TCCAGCGACT GCGGCAAAGG CTGATGCAGC AGGTTCGCAC CATGCCCACC GACCTCTTGG TCAGTCGAGC CGTAGGCCTC ATTCTTGGCC TACTGGTTGC CAATTTGCTG CTGGCACCAA TCCTGCTTTT GCCCCTCCCA TTTGAGGTGG TGTTGGTGAA GCCTCTGGCC GCAGTACTGA GCAATGTCTT CTTTGGGGTC CTTGGATACA ACCTGGCGGA GGTCCACGGC CGCACCCTGC TACGCCTGTT CAATCCCAAC AGCACCGAGG CGTTACTGGT CGCCGATGGG GTTCTAACAC CTGCCACTGC AAAGATTCTC GACACCAGCG TGATCATCGA TGGCCGCATT CATGGACTAC TGGCCTGCGG ACTGCTGGAA GGTCAGGCAA TCGTGGCCCA AACCGTGATT GATGAACTGC AACAGCTGGC CGATTCAAGC AATGCCGAAA AACGAGCCAA AGGTCGACGC GGTTTGAAGT TGCTCACTGA ATTACGCGAA ACCTATGGCA GACGGCTGGT TTTAAACAGC ACCCGATACG AGGGATCGGG CACTGACGAG CGACTGCTCA AACTCACCGC CGATACAGGC GGGATGTTGG TCACTGCCGA TTACAACCTC GCCCAGGTCG CCAAGGTCAA AGACCTCAAA GCCATCAACC TCAGCGAAAT CGTGATCGCC CTGCGGCCTG AAGTGCAACC CGGCGACGAG CTGAAACTCA AAATCGTCAA ACAGGGCAAG GAAGACAACC AAGGGGTGGG CTATCTAGAA GACGGCACGA TGGTGGTGGT TGAGGGCGCC AGAGAGGCCA TTGGACAGCG GCTACCCGTT GTTGTCACTG GAGCGCTGCA AACCCCTACC GGCAGGATGG TCTTTGGTCG GTTCGAGAAA AATCAGCCAA CCCGCAAATC AAGTAAGACC AGCGAGCGCC CACCGGCTAA CCCTCGCTAG
|
Protein sequence | MVHAACIRSE QTRSDPMVDL LILVLFLISG AATGWLGVEL LPERLLEQTI NIDRLRLVLS GLSASFGLLA GFFFQRLRQR LMQQVRTMPT DLLVSRAVGL ILGLLVANLL LAPILLLPLP FEVVLVKPLA AVLSNVFFGV LGYNLAEVHG RTLLRLFNPN STEALLVADG VLTPATAKIL DTSVIIDGRI HGLLACGLLE GQAIVAQTVI DELQQLADSS NAEKRAKGRR GLKLLTELRE TYGRRLVLNS TRYEGSGTDE RLLKLTADTG GMLVTADYNL AQVAKVKDLK AINLSEIVIA LRPEVQPGDE LKLKIVKQGK EDNQGVGYLE DGTMVVVEGA REAIGQRLPV VVTGALQTPT GRMVFGRFEK NQPTRKSSKT SERPPANPR
|
| |