Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_08861 |
Symbol | |
ID | 4911233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 762087 |
End bp | 763364 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640160468 |
Product | putative urea ABC transporter, substrate binding protein |
Protein accession | YP_001091110 |
Protein GI | 126696224 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR03407] urea ABC transporter, urea binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATTT CAAGGCGTAT TTTGGCAGGT TTAGCTACTG CCTCACTTGC AGTCACCGCA ACTTCGTGTG GTGGAGGCGG AACCTCCGGA AGTTTCGATG ACACAGTAAC CGTTGGTATT TTGCATTCGT TATCCGGAAC AATGGCTATC TCTGAATCAA CTCTTGTTGA TACAGAAAAA ATGGCTATTG AAGAGATAAA TGCTGCTGGT GGCGTTAAAG TTGGCGGCAA AAGCTACAAA ATAGAATACA TAGTTGAAGA TGGTGCATCT GACTGGCCTA CCTTTGCTGA AAAATCAAAG AAACTTATAG ACCAAGACGG AGTTCCTGTC GTATTTGGTG GATGGACATC TGCAAGTAGA AAGGCAATGT TACCTGTCTA CGAATCAAAA GATGCGTTCC TCTACTACCC GATTCAATAT GAAGCCCAAG AATGTTCTAA TAACATTTTT TATACAGGAG CCACACCAAA CCAACAATCG GAACCTGCTA CAGATTTCAT GTATAAGCGT TCTCCCGCAG CTGGTGGAGA TTTCTTCCTT GTAGGTTCTG ATTATGTTTT CCCAAGAACT TCAAATACAA TCACAAAAGC CCAGGTAAAA CAGTTAGGCG GCAAAGTTGT TGGAGAAGAT TACCTTCCAT TAGGAAATAC TGAAGTTGCT CCAATTATCT CAAAAATCAA AAAGGCGCTT CCTGAAGGTG GAATAATCAT TAATACACTT AATGGTGACC AAAACGTTGC ATTCTTCAAA CAGATTCAAG ACGCTGGTAT CACACCTTCG AGTGGCTACT ACGTAATGAA CTATTCAATT GCTGAAGAAG AGATTAGTAC AATTGGACCT GAGTTCCTTG AAGGTCACTA TGGTGCTTGG AACTACATGA TGTCAATTGA TACTCCTGCA TCAAAGAAAT TTGCTAAAAG TTTCAAGAAA AGATGGGGAG CAGATCGTGT TGTAGCTGAT CCTCAAGAAT CTGCTTACAA CATGGTTTAC TTATGGAAAC AAGCAGTTGA AGATGCTGGA ACTTTCGACG ACAACGCAGT AAGGGAAGCC CTAGTTGGAC AAAAGTTTGA TGCCCCACAA GGTCCAGTTG AAGTTATGCC AAACCATCAC TTATCTCAAA CTGTGAGAAT CGGAGAGATT AATGCAGAAG GTGGCTTTAC AATTCTTGAA GAGACAGGAG TTGTTCTACC TCAAGCATGG AACCAAAAGC ATCCAAGCTC AAAAGGATTT GCATGCGATT GGACAGATCC TTCAAAAGGA GAAAAGTATA AGCTTTAA
|
Protein sequence | MRISRRILAG LATASLAVTA TSCGGGGTSG SFDDTVTVGI LHSLSGTMAI SESTLVDTEK MAIEEINAAG GVKVGGKSYK IEYIVEDGAS DWPTFAEKSK KLIDQDGVPV VFGGWTSASR KAMLPVYESK DAFLYYPIQY EAQECSNNIF YTGATPNQQS EPATDFMYKR SPAAGGDFFL VGSDYVFPRT SNTITKAQVK QLGGKVVGED YLPLGNTEVA PIISKIKKAL PEGGIIINTL NGDQNVAFFK QIQDAGITPS SGYYVMNYSI AEEEISTIGP EFLEGHYGAW NYMMSIDTPA SKKFAKSFKK RWGADRVVAD PQESAYNMVY LWKQAVEDAG TFDDNAVREA LVGQKFDAPQ GPVEVMPNHH LSQTVRIGEI NAEGGFTILE ETGVVLPQAW NQKHPSSKGF ACDWTDPSKG EKYKL
|
| |