Gene PMN2A_0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_0066 
Symbol 
ID3605473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp620635 
End bp621684 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content44% 
IMG OID637686921 
Productchlorophyll a/b binding light harvesting protein PcbA 
Protein accessionYP_291261 
Protein GI72381906 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACCT ATGGAAATTC AGCCGTCACC TACGGGTGGT GGGCTGGTAA CTCAGGGGTC 
ACCAACCGTT CAGGCAAATT TATTGCTGCT CATGCCGCTC ATACCGGTTT GATTGCTTTC
TGGGCTGGTG CCTTCACGTT ATTTGAATTG GCTCGATTTG ACCCTTCTGT ACCAATGGGT
CATCAGCCTT TAATTGCGCT TCCTCATTTA GCAACTTTGG GTATAGGTTT CGATGAAACT
GGAACCTTTG TTGGTGGAAG TGCGGTTGTT GCAGTTGCTG TGTGTCATCT AGTTGGATCC
ATGGCTTATG GGGCAGGTGG ATTGATGCAC TCTCTTCTTT TCTCTAGTGA CATGCAGGAA
TCTTCTGTGC CACAGGCCAG AAAATTCAAG CTTGAATGGG ACAACCCAGA TAACCAGACT
TTCATCCTTG GACATCATTT GATTTTCTTT GGTGTTGCAT GTATTTGGTT TGTTGAATGG
GCGCGAATTC ATGGGGTCTA CGATCCTTCT ATTGGTGCTA TTCGTCAGGT TGAATACGAC
CTTAATTTGA GTCATATCTG GGATCATCAG TTTGATTTTC TAACTATTGA CAGCTTGGAG
GATGTTATGG GAGGTCATGC TTTCTTGGCT TTCTTAGAAA TTACTGGTGG TGCTTTCCAT
ATCGCTACTA AGCAAGTTGG AGAATATACG AAGTTCAAAG GAGCTGGGCT TCTTTCTGCA
GAAGCTATTC TTTCTTGGTC ACTAGCTGGT ATTGGCTGGA TGGCAGTTGT CGCAGCATTC
TGGAGTGCAA CAAATACCAC TGTTTACCCT GTTGAATGGT TTGGAGAACC ACTAGCACTT
AAATTTGGAA TATCTCCTTA TTGGGTAGAT ACTGTTGATT TAGGTTCAGC CCACACTTCT
AGGGCTTGGC TAGCTAATGT TCATTACTAC TTTGGATTTT TCTTTATTCA GGGTCACCTA
TGGCATGCTC TTAGGGCAAT GGGATTCGAT TTCAAACGAG TAACTAGTGC CTTAAGTAAT
CTTGATACTG CTTCAGTATC TTTAAAATAG
 
Protein sequence
MQTYGNSAVT YGWWAGNSGV TNRSGKFIAA HAAHTGLIAF WAGAFTLFEL ARFDPSVPMG 
HQPLIALPHL ATLGIGFDET GTFVGGSAVV AVAVCHLVGS MAYGAGGLMH SLLFSSDMQE
SSVPQARKFK LEWDNPDNQT FILGHHLIFF GVACIWFVEW ARIHGVYDPS IGAIRQVEYD
LNLSHIWDHQ FDFLTIDSLE DVMGGHAFLA FLEITGGAFH IATKQVGEYT KFKGAGLLSA
EAILSWSLAG IGWMAVVAAF WSATNTTVYP VEWFGEPLAL KFGISPYWVD TVDLGSAHTS
RAWLANVHYY FGFFFIQGHL WHALRAMGFD FKRVTSALSN LDTASVSLK