Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18531 |
Symbol | |
ID | 4775994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1612888 |
End bp | 1614489 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640087362 |
Product | permease |
Protein accession | YP_001017860 |
Protein GI | 124023553 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.632317 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTCAG ATCTCAAGCA GAAGCACTGG AGGCCGCAAT GGTTCGTCCG TGGAGATGTC GACGGATTCC TAGGACTTGG CCTCGATAAC TTGATTCAAG TCCTCTTGAT AATTGCCTTA TGCCGCAACG TGCTCGGCTA TCCGAATGAA TTAGTTTTCG GCACAATCCT TCCTGCCACA GGAATAAGCC TGCTGATAGG CAACCTTGCC TACGCGCATC AGGCTCATCA ACTTGCCAGC ATCGAAAAAC GTAGTGATCG GACAGCTCTG CCCTACGGAA TTAACACCGT GAGCCTGTTT GCTTATGTGT TTTTGGTAAT GCTCCCTGTG AAACTCACCG CACTGGGCCA GGGAATGGAT GAGGCAAGCG CTGTTCGCCT CTCCTGGCAA GCAGGGATGG TGGCTTGCCT GGGCTCTGGG CTGATCGAAA CAGTAGGGGC TTTTACAGCA GGGGTACTGC GTCGCTGGCT GCCTAGGGCT GCATTACTTG CCACTCTCGC CGGAATTGCT CTTGGCTACA TCGCCCTTGG TTTTCTGCTG CGTACCTATG CCCAACCTGT AGTGGGGCTG ACAGTACTAG CAATTGTTCT GGTCACCTAT TACGGACGCT TGCGATTGCC CATTCCTGGT GGCTTGCTTG CTGTTGTTAT TGGCGTGGGG CTTGCTTGGA GTACTGGATT AATCGATAGC GACGCTAGCC GTTGGAGTCA GGAAGCAAGT CAGATTGGCT TGCGCCTTCC TCACCTAGAA ATCGCCAATC TTTGGCATGC TCGTGGCCAG CTGCTGCCCT GGCTAGGGGT GATTGTGCCA ATGGGCCTAT TCAACGTGCT GGGTTCTCTT CAAAACATCG AAAGCGCAGA GGCGGCAGGT GATCGTTATT GCGTTCGAAG CTCTTTATTG ATCGATGGCA TCGGCACAAT GGCTGCCGCA GGTCTGGGGT CTTGTTTTCC TACAACCATT TATATCGGCC ATCCAGCCTG GAAGGAAATG GGGGCTCGCA TTGGCTATTC CTGGCTAAAT GGCTTGGTAA TGGGTAGCGC TTGTCTTCTA GGTGTTTTTG GTTTGGTCGC AGAGTTGGTA CCTATTGAGG CAGGCATGGC GATCGTGCTC TATATCGGCA TCGTGATTGC TGCTCAAGCC TTTCAAGCCA CGCCCAGCAC CCATGCACCG GCAGTAGCTC TTGGCCTGCT TCCTGGATTA GCAGGATGGG GTGCATCACT GATCAAGGCT GGTTTGCGCG CAGGAGGAGC CGGAAGCAAT TCAGAACCTT TTAATTCAGC TTTATTAGGA AGGCTTCAAG AAGCTGATGT ATGGGCAAGT GGCGCTTTTG CCCTTGAGCA AGGACAAATT ATTACCGCCA TGCTGCTAGC GGCAATGTTG GTTTATGTGA TCGAACAACG CTTTTTAGCA GCAAGCATTA CTTCTTTCCT CGCATCTGCA GCCTCATGGA TTGGGATCAT CCACGCCTGG CGATTCACCC AGACCGATAC TGTTCTGGAG CTTGGCTGGG GCGTTGGTAG CTCCTGGGCA ATGGGCTATC TACTAATGGC AGTTGTGTTC ATGCTCGCCG GTTTGCATCA GCGCAGATTG ATTTCTGGTT GA
|
Protein sequence | MHSDLKQKHW RPQWFVRGDV DGFLGLGLDN LIQVLLIIAL CRNVLGYPNE LVFGTILPAT GISLLIGNLA YAHQAHQLAS IEKRSDRTAL PYGINTVSLF AYVFLVMLPV KLTALGQGMD EASAVRLSWQ AGMVACLGSG LIETVGAFTA GVLRRWLPRA ALLATLAGIA LGYIALGFLL RTYAQPVVGL TVLAIVLVTY YGRLRLPIPG GLLAVVIGVG LAWSTGLIDS DASRWSQEAS QIGLRLPHLE IANLWHARGQ LLPWLGVIVP MGLFNVLGSL QNIESAEAAG DRYCVRSSLL IDGIGTMAAA GLGSCFPTTI YIGHPAWKEM GARIGYSWLN GLVMGSACLL GVFGLVAELV PIEAGMAIVL YIGIVIAAQA FQATPSTHAP AVALGLLPGL AGWGASLIKA GLRAGGAGSN SEPFNSALLG RLQEADVWAS GAFALEQGQI ITAMLLAAML VYVIEQRFLA ASITSFLASA ASWIGIIHAW RFTQTDTVLE LGWGVGSSWA MGYLLMAVVF MLAGLHQRRL ISG
|
| |