Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_12951 |
Symbol | |
ID | 4718014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1080254 |
End bp | 1081597 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640079014 |
Product | hypothetical protein |
Protein accession | YP_001009686 |
Protein GI | 123968828 |
COG category | [R] General function prediction only |
COG ID | [COG0733] Na+-dependent transporters of the SNF family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0640548 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGACTCAA AAATTTCTCA GAGAGAACAA TGGACTAGTA AGCTAGGATT CATTCTCGCA GCTGCTGGTA GTGCAGTAGG TCTAGGCAAC CTTTGGGGTT TTGCTTACAG AGCATCTCAG GGTGGAGGTG CGGCGTTTGT ACTTTTATAT ATATTGATCG TTTTAATTGT ATGTCTTCCA GTATTTGTTG CTGAAATGGC TCTAGGGAGA AACGCAATGG CAAGTACATT GCTTGCTCCT GTAAAGCTGG CAGGGAAGAA TTGGTATCCA TTAGGAATTC TTTTCTTTAT AGCTCCTTTA GGAATAGCAT CATATTATTC AGTGATAATG GGATGGACTG CAGATACCTT GTTCCATTCT TTATTTTTTG GATTACCAAA GAATTTAACT GAAGCAGAAA CCTTCTTTGG CTCGATTAGT AGTGGCAGCA GTGTTTTGTT GGGCCACCTA TTAAGTCTTG TACTTACAGC AATAATAGTT TCATCAGGTA TAAAAAAAGG TATAGAAAAG GTTACTAGAT ATTTCATGCC AATCCTTTTC ATAATTATTG TGATTCTTGC TATTTGGGCT ACTTCACTTT CAGGTGCATG GGAAGGATAT AAAACATTTC TACTTAAGTT TGACTTCAAT GAATTGAGAA ATCCTCAAAC AATAAGAAAC GCTTTTACAC AAGCATTTTT TTCATTAAGT TTAGGGATTG GAATTATGGT TACCTACGCA TCCTATTTAA ATAAAAAAAG TAATCTTCCA AAATTAAGTG TAGGAGTTGC ATCATTAGAT ACTTTGGTTG GACTAATGGC TGGATTTATA ACTTTCCCAA TAGTTTTAAC ATTCGGTTTA AGTGACGCTA TTTCTGAATC CACTGTTGGT GCTTTATTTA TCTCAATTCC AACAGGTTTA GGTTCATATG GTGCGGCAGG AAGAATTGTA GCTGTTGCAT TTTTCGCATT AGCTTATATT GCAGCAATAA CTTCCTCTGT TTCATTATTG GAAGTTCCAG TTTCCTCTTT AATGGATAAA TTTGGTTTTA AAAGAGAAAA ATCTGTTTGG CTGATAACTC TTTTCTTATT CTTAGCAGGC ATTCCTTCTG CATTAAACTT AAACATTCTT GGAACTATTG ATTCGATTTT TGGAGGTGTA TTACTTATCT TTGGTGGATT CTTGGTTACT TTCTTTATGG GATGGGTAGT ACCTGGAAAG TTTAATGAAG AACTTAGTGA TTCAAAAGTT GGAATCAAAA CGACACGTTA TTTGAAATTC ATGACAAGAT GGGTTGCGCC ACCAATTATT GGTTTTGGAC TATTTATTAG TGTGTTTGAT TTGCTTAAAG GCTGGGTAAG TTAA
|
Protein sequence | MDSKISQREQ WTSKLGFILA AAGSAVGLGN LWGFAYRASQ GGGAAFVLLY ILIVLIVCLP VFVAEMALGR NAMASTLLAP VKLAGKNWYP LGILFFIAPL GIASYYSVIM GWTADTLFHS LFFGLPKNLT EAETFFGSIS SGSSVLLGHL LSLVLTAIIV SSGIKKGIEK VTRYFMPILF IIIVILAIWA TSLSGAWEGY KTFLLKFDFN ELRNPQTIRN AFTQAFFSLS LGIGIMVTYA SYLNKKSNLP KLSVGVASLD TLVGLMAGFI TFPIVLTFGL SDAISESTVG ALFISIPTGL GSYGAAGRIV AVAFFALAYI AAITSSVSLL EVPVSSLMDK FGFKREKSVW LITLFLFLAG IPSALNLNIL GTIDSIFGGV LLIFGGFLVT FFMGWVVPGK FNEELSDSKV GIKTTRYLKF MTRWVAPPII GFGLFISVFD LLKGWVS
|
| |