Gene NATL1_19191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_19191 
Symbol 
ID4779713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1579224 
End bp1580501 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content42% 
IMG OID640085209 
Productputative urea ABC transporter, substrate binding protein 
Protein accessionYP_001015739 
Protein GI124026624 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTTT CAAAGCGCAT TTTTGCAGGT TTAGCTACTG CCTCTTTAGC CGTAACTGTT 
ACTGCTTGTG GTGGATCAGA TTCCTCTGGC AACTTTGATG ACACCGTAAC TGTTGGAATT
CTCCATTCTC TTTCAGGGAC AATGGCAATC TCCGAATCAA CTCTTGTTGA TACAGAGAAA
ATGGCTATTG AGGAAATCAA TGCAGCTGGC GGTGTAACAG TCGACGGTAA AAGCTATAAA
ATTGAATACA TCGTTGAAGA TGGTGCCTCA GATTGGCCTA CCTTTGCAGA GAAATCCAAG
AAGTTAATCG ACCAAGATGG AGTACCAGTA GTCTTTGGCG GCTGGACTTC TGCAAGTCGA
AAGGCAATGC TTCCAGTTTA TGAATCAAAG GATGCATTCC TTTATTACCC AATTCAATAT
GAAGCACAAG AGTGCTCCAA TAACATTTTC TATACAGGAG CGACTCCAAA TCAGCAGTCT
GAGCCTGCTA CTGATTTCAT GTATAAGCGC TCACCAGCTG CTGGAGGAGA TTTCTTCTTA
GTTGGTTCTG ACTATGTTTT CCCAAGAACT TCTAACACAA TTACTAAAGC TCAAGTGAAA
CAACTTGGAG GTAAAGTTGT TGGAGAAGAT TATCTTCCTT TAGGTAATAC AGAGGTGGCA
CCTATTATCT CGAAGATAAA AGTTGCTCTT CCTGATGGTG GAATCATCGT TAACACTTTG
AATGGCGACC AAAACGTTGC TTTCTTCAAA CAAATCCAGG ACGCAGGAAT TACTCCTTCT
AATGGTTATT ACGTAATGAA CTACTCCATT GCGGAAGAAG AGATTAGTAC GATTGGACCT
GAGTTCCTTG AGGGCCACTA TGGTGCTTGG AACTACATGA TGTCTATTGA TACTCCAGCT
TCTAAGAAAT TTGCTAAGAG CTTTAAGAAG AGATGGGGTA GTGATCGTGT TGTAGCTGAT
CCTCAAGAAT CTGCTTATAA CATGGTTTAT CTTTGGAAGC AGGCAGTTGA AGATGCAGGT
ACATTTGATG ACAATGCGGT TAGAGAAGCA TTGGTTGGTC AGACATTCGA TGCTCCTCAG
GGTCCAGTAG AAGTGATGGC AAATCATCAC TTATCTCAAA CAGTGAGAAT CGGTGAAATC
AATGCAGAGG GTGGATTTAC AATCCTTGAA GAAACTGGAG TAGTTGAGCC ACAAGCATGG
AACCAAAAAC ATCCAAGTTC AAAAGGTTAC GCTTGTGATT GGACTGATCC TAAGAAAGGT
GAAAAATATA AGATGTGA
 
Protein sequence
MKLSKRIFAG LATASLAVTV TACGGSDSSG NFDDTVTVGI LHSLSGTMAI SESTLVDTEK 
MAIEEINAAG GVTVDGKSYK IEYIVEDGAS DWPTFAEKSK KLIDQDGVPV VFGGWTSASR
KAMLPVYESK DAFLYYPIQY EAQECSNNIF YTGATPNQQS EPATDFMYKR SPAAGGDFFL
VGSDYVFPRT SNTITKAQVK QLGGKVVGED YLPLGNTEVA PIISKIKVAL PDGGIIVNTL
NGDQNVAFFK QIQDAGITPS NGYYVMNYSI AEEEISTIGP EFLEGHYGAW NYMMSIDTPA
SKKFAKSFKK RWGSDRVVAD PQESAYNMVY LWKQAVEDAG TFDDNAVREA LVGQTFDAPQ
GPVEVMANHH LSQTVRIGEI NAEGGFTILE ETGVVEPQAW NQKHPSSKGY ACDWTDPKKG
EKYKM