Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_02331 |
Symbol | |
ID | 5730456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 223641 |
End bp | 224636 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641284577 |
Product | putative sodium-dependent bicarbonate transporter |
Protein accession | YP_001550118 |
Protein GI | 159902774 |
COG category | [R] General function prediction only |
COG ID | [COG3329] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACCA ATCTGATTCT TCAAAATGTA TTAACACCTC CAGTACTATT TTTCTTTTTA GGAATTGTAG CTGTAGTAGT ACGTTCTGAT CTGGAAATAC CTGCACCTTT ACCCAAACTT TTTTCTCTAT ATCTTCTACT AGCAATTGGT TTTAAAGGGG GACTAGCGCT TGAAAAGAGT GGTTTTGGAG GTCAAGTTTT TCCAACACTT ATTGCAGCGA TATTAATGTC TTTACTTATA CCACTAGTAT GTTTTGTAAT TTTGCGATTA AAGCTTGATG TTTTTAATTC AGCAGCAATT GCAGCTGCCT ATGGATCGAT AAGTGCAGTA ACTTTCATAA CAGCAGAAAG TTTTTTAGAG AGTCAAAACA TACCATTTGA TGGCTTTATG GTTGCGGCTT TAGCATTAAT GGAATCGCCC GCAATAATTG TTGGTCTTTT ACTTGTAAAG CTAGGCGCTC CAAAAAATCG GCCAAATATA CAAGAAATGA AATGGAGGTC TATTTTGCAT GAATCAATGC TAAATGGATC AGTCTATTTA CTTTTGGGTA GTCTAGTTAT TGGATTCCTC ACTGCTGCTC ATAATCCAGT TGGAGTCGAT AAAATGCAAC CTTTTACTGG CAAGTTGTTT TATGGAGCGG AATGTTTCTT TCTATTGGAT ATGGGAATAG TTGCTGCTCA GAGACTGCCT AGTCTTAAGA AAGCAGGGTC ATTCCTAATT ATTTTTGCGG TACTGATTCC TCTTTTGAAT GCATGTTTAG GAATTTTTGT TGCAAAAGCA TTATTACTAG GTCCTGGTAA TTCACTCTTA TTTGCAATTC TATGTGCCAG TGCTTCATAT CTTGCAGTTC CAGCTGCAAT GAGAATGACG GTGCCTGAAG CTAAATCAAG CTATTACATT TCAACCACCC TAGGGTTAAC TTTCCCTTTT AATATTGTAA TTGGAATACC TCTATATATG GGTCTGGTAA ACAAACTCAT TCCTTCTATT GGATAA
|
Protein sequence | METNLILQNV LTPPVLFFFL GIVAVVVRSD LEIPAPLPKL FSLYLLLAIG FKGGLALEKS GFGGQVFPTL IAAILMSLLI PLVCFVILRL KLDVFNSAAI AAAYGSISAV TFITAESFLE SQNIPFDGFM VAALALMESP AIIVGLLLVK LGAPKNRPNI QEMKWRSILH ESMLNGSVYL LLGSLVIGFL TAAHNPVGVD KMQPFTGKLF YGAECFFLLD MGIVAAQRLP SLKKAGSFLI IFAVLIPLLN ACLGIFVAKA LLLGPGNSLL FAILCASASY LAVPAAMRMT VPEAKSSYYI STTLGLTFPF NIVIGIPLYM GLVNKLIPSI G
|
| |