Gene NATL1_06881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06881 
Symbol 
ID4781008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp632978 
End bp634027 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content44% 
IMG OID640083964 
Productlight-harvesting complex protein 
Protein accessionYP_001014513 
Protein GI124025397 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCT ATGGAAATTC AGCTGTCACC TACGGGTGGT GGGCTGGTAA CTCAGGGGTC 
ACCAACCGTT CAGGCAAATT TATTGCTGCT CATGCCGCTC ATACCGGTTT GATTGCTTTC
TGGGCTGGTG CCTTCACGTT ATTTGAATTG GCTCGATTTG ACCCTTCTGT ACCAATGGGT
CATCAGCCTT TAATTGCGCT TCCTCATTTA GCAACTTTGG GTATAGGTTT CGATGAAACT
GGAACCTTTG TTGGTGGAAG TGCGGTTGTT GCAGTTGCTG TATGTCATCT AGTTGGATCC
ATGGCTTATG GGGCAGGTGG ATTGATGCAC TCTCTTCTTT TCTCTAGTGA CATGCAGGAA
TCTTCTGTGC CACAGGCCAG AAAATTCAAG CTTGAATGGG ACAATCCAGA TAACCAGACT
TTCATCCTTG GACATCATTT GATTTTCTTT GGTGTTGCAT GTATTTGGTT TGTTGAATGG
GCGCGAATTC ATGGGGTCTA CGATCCTGCT ATTGGTGCTA TTCGTCAGGT TGAATACGAC
CTTAATTTGA GTCATATCTG GGACCATCAG TTTGATTTCC TAACTATTGA CAGCTTGGAA
GATGTTATGG GAGGTCATGC TTTCTTGGCT TTCTTAGAAA TTACTGGTGG TGCTTTCCAC
ATCGCTACTA AGCAAGTTGG AGAATACACG AAGTTCAAAG GAGCTGGGCT TCTTTCTGCA
GAAGCTATTC TTTCTTGGTC ACTAGCTGGT ATTGGCTGGA TGGCAGTTGT CGCAGCATTC
TGGAGTGCAA CAAATACCAC TGTTTACCCT GTTGAATGGT TTGGAGAACC ACTAGCACTT
AAATTTGGAA TATCTCCTTA TTGGGTAGAT ACTGTTGATT TAGGTTCAGC CCACACTTCT
AGGGCTTGGC TAGCTAATGT TCACTACTAC TTTGGATTCT TCTTTATTCA GGGTCACCTA
TGGCATGCTC TTAGGGCAAT GGGATTCGAT TTCAAACGAG TAACTAGTGC CTTAAGTAAT
CTTGATACTG CTTCAGTATC TTTAAAATAG
 
Protein sequence
MQTYGNSAVT YGWWAGNSGV TNRSGKFIAA HAAHTGLIAF WAGAFTLFEL ARFDPSVPMG 
HQPLIALPHL ATLGIGFDET GTFVGGSAVV AVAVCHLVGS MAYGAGGLMH SLLFSSDMQE
SSVPQARKFK LEWDNPDNQT FILGHHLIFF GVACIWFVEW ARIHGVYDPA IGAIRQVEYD
LNLSHIWDHQ FDFLTIDSLE DVMGGHAFLA FLEITGGAFH IATKQVGEYT KFKGAGLLSA
EAILSWSLAG IGWMAVVAAF WSATNTTVYP VEWFGEPLAL KFGISPYWVD TVDLGSAHTS
RAWLANVHYY FGFFFIQGHL WHALRAMGFD FKRVTSALSN LDTASVSLK