Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_19191 |
Symbol | |
ID | 4779713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1579224 |
End bp | 1580501 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640085209 |
Product | putative urea ABC transporter, substrate binding protein |
Protein accession | YP_001015739 |
Protein GI | 124026624 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR03407] urea ABC transporter, urea binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTTT CAAAGCGCAT TTTTGCAGGT TTAGCTACTG CCTCTTTAGC CGTAACTGTT ACTGCTTGTG GTGGATCAGA TTCCTCTGGC AACTTTGATG ACACCGTAAC TGTTGGAATT CTCCATTCTC TTTCAGGGAC AATGGCAATC TCCGAATCAA CTCTTGTTGA TACAGAGAAA ATGGCTATTG AGGAAATCAA TGCAGCTGGC GGTGTAACAG TCGACGGTAA AAGCTATAAA ATTGAATACA TCGTTGAAGA TGGTGCCTCA GATTGGCCTA CCTTTGCAGA GAAATCCAAG AAGTTAATCG ACCAAGATGG AGTACCAGTA GTCTTTGGCG GCTGGACTTC TGCAAGTCGA AAGGCAATGC TTCCAGTTTA TGAATCAAAG GATGCATTCC TTTATTACCC AATTCAATAT GAAGCACAAG AGTGCTCCAA TAACATTTTC TATACAGGAG CGACTCCAAA TCAGCAGTCT GAGCCTGCTA CTGATTTCAT GTATAAGCGC TCACCAGCTG CTGGAGGAGA TTTCTTCTTA GTTGGTTCTG ACTATGTTTT CCCAAGAACT TCTAACACAA TTACTAAAGC TCAAGTGAAA CAACTTGGAG GTAAAGTTGT TGGAGAAGAT TATCTTCCTT TAGGTAATAC AGAGGTGGCA CCTATTATCT CGAAGATAAA AGTTGCTCTT CCTGATGGTG GAATCATCGT TAACACTTTG AATGGCGACC AAAACGTTGC TTTCTTCAAA CAAATCCAGG ACGCAGGAAT TACTCCTTCT AATGGTTATT ACGTAATGAA CTACTCCATT GCGGAAGAAG AGATTAGTAC GATTGGACCT GAGTTCCTTG AGGGCCACTA TGGTGCTTGG AACTACATGA TGTCTATTGA TACTCCAGCT TCTAAGAAAT TTGCTAAGAG CTTTAAGAAG AGATGGGGTA GTGATCGTGT TGTAGCTGAT CCTCAAGAAT CTGCTTATAA CATGGTTTAT CTTTGGAAGC AGGCAGTTGA AGATGCAGGT ACATTTGATG ACAATGCGGT TAGAGAAGCA TTGGTTGGTC AGACATTCGA TGCTCCTCAG GGTCCAGTAG AAGTGATGGC AAATCATCAC TTATCTCAAA CAGTGAGAAT CGGTGAAATC AATGCAGAGG GTGGATTTAC AATCCTTGAA GAAACTGGAG TAGTTGAGCC ACAAGCATGG AACCAAAAAC ATCCAAGTTC AAAAGGTTAC GCTTGTGATT GGACTGATCC TAAGAAAGGT GAAAAATATA AGATGTGA
|
Protein sequence | MKLSKRIFAG LATASLAVTV TACGGSDSSG NFDDTVTVGI LHSLSGTMAI SESTLVDTEK MAIEEINAAG GVTVDGKSYK IEYIVEDGAS DWPTFAEKSK KLIDQDGVPV VFGGWTSASR KAMLPVYESK DAFLYYPIQY EAQECSNNIF YTGATPNQQS EPATDFMYKR SPAAGGDFFL VGSDYVFPRT SNTITKAQVK QLGGKVVGED YLPLGNTEVA PIISKIKVAL PDGGIIVNTL NGDQNVAFFK QIQDAGITPS NGYYVMNYSI AEEEISTIGP EFLEGHYGAW NYMMSIDTPA SKKFAKSFKK RWGSDRVVAD PQESAYNMVY LWKQAVEDAG TFDDNAVREA LVGQTFDAPQ GPVEVMANHH LSQTVRIGEI NAEGGFTILE ETGVVEPQAW NQKHPSSKGY ACDWTDPKKG EKYKM
|
| |