Gene PMN2A_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1048 
Symbol 
ID3606435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1542633 
End bp1543910 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content42% 
IMG OID637687918 
Productputative urea ABC transporter, substrate binding protein 
Protein accessionYP_292241 
Protein GI72382886 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.293483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTTT CAAAGCGCAT TTTTGCAGGT TTAGCTACTG CCTCTTTAGC CGTAACTGTT 
ACTGCTTGTG GTGGATCAGA TTCCTCTGGC AACTTTGACG ACACCGTAAC TGTTGGAATT
CTCCATTCTC TTTCAGGGAC AATGGCAATC TCGGAATCAA CTCTTGTTGA TACAGAGAAA
ATGGCTATTG AGGAAATCAA TGCAGCTGGC GGTGTAACAG TCGACGGTAA AAGCTATAAA
ATTGAATACA TCGTTGAAGA TGGTGCCTCA GATTGGCCTA CCTTTGCAGA GAAATCTAAG
AAGTTAATCG ACCAGGATGG AGTACCAGTA GTCTTTGGCG GCTGGACTTC TGCAAGTCGA
AAGGCAATGC TTCCAGTTTA TGAATCAAAA GATGCATTCC TTTATTACCC AATTCAATAT
GAAGCACAAG AGTGCTCCAA TAACATTTTC TATACAGGAG CGACTCCAAA TCAGCAGTCT
GAGCCTGCCA CTGATTTCAT GTATAAGCGC TCTCCAGCTG CTGGAGGAGA TTTCTTCTTA
GTTGGTTCTG ATTATGTTTT TCCAAGAACT TCTAACACAA TTACTAAAGC TCAAGTGAAA
CAACTTGGCG GAAAAGTTGT TGGAGAAGAT TATCTTCCTT TAGGTAATAC TGAGGTAGCA
CCTATTATCT CGAAGATAAA AGTTGCTCTT CCTGATGGTG GAATCATAGT TAACACTTTG
AATGGTGACC AAAACGTTGC TTTCTTCAAA CAAATCCAGG ACGCAGGAAT CACTCCTTCT
AATGGTTATT ACGTAATGAA CTACTCCATT GCGGAAGAAG AGATTAGTAC GATTGGACCT
GAGTTCCTTG AGGGCCACTA TGGTGCTTGG AACTACATGA TGTCTATTGA TACGCCAGCT
TCTAAGAAAT TTGCTAAGAG CTTTAAGAAG AGATGGGGTA GTGATCGTGT TGTGGCTGAT
CCTCAAGAAT CTGCCTATAA CATGGTTTAT CTTTGGAAGC AGGCAGTTGA AGATGCAGGT
ACATTTGATG ACAATGCGGT TAGAGAAGCA TTGGTTGGTC AGACATTCGA TGCTCCTCAG
GGTCCAGTAG AAGTTATGGC AAATCATCAC CTATCTCAAA CAGTGAGAAT CGGTGAAATC
AATGCAGAGG GTGGATTTAC AATCCTTGAA GAAACTGGAG TAGTTGAGCC ACAAGCATGG
AACCAAAAAC ATCCAAGTTC AAAAGGTTAC GCTTGTGATT GGACTGATCC TAAGAAAGGT
GAAAAATATA GGATGTGA
 
Protein sequence
MKLSKRIFAG LATASLAVTV TACGGSDSSG NFDDTVTVGI LHSLSGTMAI SESTLVDTEK 
MAIEEINAAG GVTVDGKSYK IEYIVEDGAS DWPTFAEKSK KLIDQDGVPV VFGGWTSASR
KAMLPVYESK DAFLYYPIQY EAQECSNNIF YTGATPNQQS EPATDFMYKR SPAAGGDFFL
VGSDYVFPRT SNTITKAQVK QLGGKVVGED YLPLGNTEVA PIISKIKVAL PDGGIIVNTL
NGDQNVAFFK QIQDAGITPS NGYYVMNYSI AEEEISTIGP EFLEGHYGAW NYMMSIDTPA
SKKFAKSFKK RWGSDRVVAD PQESAYNMVY LWKQAVEDAG TFDDNAVREA LVGQTFDAPQ
GPVEVMANHH LSQTVRIGEI NAEGGFTILE ETGVVEPQAW NQKHPSSKGY ACDWTDPKKG
EKYRM