Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_08881 |
Symbol | |
ID | 4717594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 763633 |
End bp | 764910 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640078600 |
Product | putative urea ABC transporter, substrate binding protein |
Protein accession | YP_001009279 |
Protein GI | 123968421 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR03407] urea ABC transporter, urea binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATTT CAAGGCGTAT TTTGGCAGGT TTAGCTACTG CCTCACTTGC GGTCACCGCA ACTTCGTGTG GTGGAGGCGG AACCTCAGGA AGTTTCGATG ACACAGTCAC CGTTGGTATT TTGCATTCGT TATCCGGAAC AATGGCTATC TCTGAATCAA CTCTTGTTGA TACAGAAAAA ATGGCTATTG AAGAGATAAA TGCTGCTGGT GGTGTTAAAG TTGGCGGCAA AAGCTACAAA ATAGAATACA TAGTCGAAGA TGGTGCATCT GACTGGCCAA CTTTTGCTGA AAAATCAAAA AAACTTATAG ACCAAGACGG AGTTCCTGTC GTATTTGGTG GATGGACCTC TGCAAGTAGA AAGGCAATGT TACCTGTCTA CGAATCAAAA GATGCGTTCC TCTATTACCC AATTCAATAT GAAGCTCAGG AATGTTCTAA CAACATTTTC TATACAGGAG CCACACCAAA CCAACAATCG GAACCTGCTA CAGATTTCAT GTATAAGCGT TCCCCCGCAG CTGGTGGAGA TTTTTTCCTT GTAGGTTCTG ATTATGTTTT CCCAAGAACT TCAAACACAA TCACGAAAGC TCAGGTAAAA CAGTTAGGCG GTAAAGTTGT TGGAGAAGAT TACCTTCCAT TAGGAAATAC TGAAGTTGCT CCGATTATCT CAAAAATCAA AAAAGCGTTG CCTGACGGTG GAATAATCAT TAATACTCTT AATGGTGACC AAAACGTTGC ATTCTTCAAA CAGATTCAAG ACGCCGGTAT CACACCTTCA AGTGGATATT ACGTAATGAA CTATTCAATT GCTGAAGAAG AGATTAGCAC AATTGGTCCT GAGTTCCTTG AAGGTCACTA CGGTGCTTGG AACTACATGA TGTCAATTGA TACTCCTGCA TCTAAGAAAT TTGCTGCAAG TTTCAAGAAA AGATGGGGAG CAGATCGTGT TGTAGCTGAT CCTCAAGAAT CTGCTTATAA CATGGTCTAC TTATGGAAAC AAGCAGTAGA AGATGCTGGA ACATTCGACG ACAACGCAGT TAGGGAAGCC CTTGTTGGAC AAAAGTTTGA TGCTCCACAA GGTCCTGTTG AAGTTATGCC TAATCACCAC TTATCTCAAA CAGTGAGAAT TGGTGAGATT AATGCTGAGG GTGGATTTAC AATCCTTGAA GAGACAGGAG TTGTACTTCC TCAAGCATGG AACCAAAAGC ATCCAAGTTC AAAAGGATTT GCATGCGATT GGACAGATCC TTCAAAAGGA GAAAAGTATA AGCTTTAA
|
Protein sequence | MRISRRILAG LATASLAVTA TSCGGGGTSG SFDDTVTVGI LHSLSGTMAI SESTLVDTEK MAIEEINAAG GVKVGGKSYK IEYIVEDGAS DWPTFAEKSK KLIDQDGVPV VFGGWTSASR KAMLPVYESK DAFLYYPIQY EAQECSNNIF YTGATPNQQS EPATDFMYKR SPAAGGDFFL VGSDYVFPRT SNTITKAQVK QLGGKVVGED YLPLGNTEVA PIISKIKKAL PDGGIIINTL NGDQNVAFFK QIQDAGITPS SGYYVMNYSI AEEEISTIGP EFLEGHYGAW NYMMSIDTPA SKKFAASFKK RWGADRVVAD PQESAYNMVY LWKQAVEDAG TFDDNAVREA LVGQKFDAPQ GPVEVMPNHH LSQTVRIGEI NAEGGFTILE ETGVVLPQAW NQKHPSSKGF ACDWTDPSKG EKYKL
|
| |