Gene NATL1_03601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03601 
Symbol 
ID4779660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp332724 
End bp334217 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content35% 
IMG OID640083628 
Productretinal pigment epithelial membrane protein 
Protein accessionYP_001014189 
Protein GI124025073 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTTA GTTATTTAAA AAGAGAAACA TCACCAAAGC AAACAATCTT CAACAAAGAA 
GATTGGTCCA GTGCATATTG CAATGTTGAA AAAGAATTAG ATCACGTTCA ACTCAAGCTT
GTAAAAGGAT CTATTCCTGA ACAAATTTCT GGTACCTTTT ATCGAAACGG GCCAGGTAGA
TTAGAAAGAG GGGGAAGATG GGTCCATCAT CCATTTGATG GAGATGGCAT GATTGCTGCC
TTCAAATTTG ACAATGGAAA AATAAACCTG ACGAATCGTT TTGTTCGCAC AAAGGAATGG
ACAGAAGAAG AAAAATCCCA AAAATTTCTA TATAGAGGTG TATTTGGAAC TCAAAAAGAA
GGAGGAGTGT TAGCTAATGC TTTTGATGTA AGGCTAAAAA ATATTGCCAA TACACACGTA
ATAAAACTTG GAGATGATCT ACTAGCTTTA TGGGAAGCAT CTAGTCCATA TTCACTTAAT
CCAAATACCC TTGAGACCAA AGGTTTATCA AATTTAAAAG GAGTTTTAAA AAAAGGTGAA
GCATTTAGTG CTCATCCACG ATTTGACCCT GGCCATCATC AAAGTCAAAG AATGGTCACT
TTTGGGGTAT CTACAGGTCC TAAAAGCACA ATAAGATTGA TGGAATTTTC CACAAAGGGA
GAAAATATTG GTTCTCTTTT AAGTGATAGA AAAGATTCTT TTAATGGATT TGCGTTCTTG
CATGATTTTG CCATAACTCC AAACTGGGCA ATATTTCTGC AAAATGCTAT TAGTTTTAAT
CCTCTTCCTT TTCTTCTTGG ACAAAAAGGA GCCGCACAAT GTTTAGCCTC TAAAAGTGAT
GGAACTCCAA AATTTTTATT AATTCCAAGG GACTCTGGCA AGTTTGCTGG TCAACCTCCA
AAATCAGTTG ATGCTCCAAA GGGTTTTGTT TTTCATCATC TAAATGCATG GGAAGATAAT
GAAAAAATCA ATATTGAAAG TATTTTTTAT GATGATTTTC CGAGCATTGG ACCCGAAGAT
AATTTTAGAG AAATTGATTT TGATCTTTTA CCAGAAGGAA TTTTGAAAAG AAGTGAAATC
AATCCCATAG AAAATACATT TACCTGCTCA ACAATAAGCA ATCAATGTTG TGAATTTGCA
ATGGTTAATC CTCATTTTGA AGGATTAAAG GCCCGCTTTA GTTGGATGGC AACTGCAGAA
GAAAAAGAGG GGAATGGGCC ACTTCAAGCC ATAAAAAAAA TCGATTTATC TAATAATAAA
GAGATAAGTT GGAGTGCGGC TCCAAGAGGT TTTGTAAGTG AACCTATATT TATTCCATCT
CAAGAATCAA AGTCTGAAGA AGACAATGGA TGGGTTGTTG CATTGGTTTG GAATAGTATT
AGATCGGGAA CTGATTTAAT AATTCTTGAT TCTAAAGATC TGACTGAAAA AGCTATTCTT
GAAGTTCCAA TATCAATTCC ACATGGATTA CACGGAAGCT GGGTTGAAAA TTAA
 
Protein sequence
MAVSYLKRET SPKQTIFNKE DWSSAYCNVE KELDHVQLKL VKGSIPEQIS GTFYRNGPGR 
LERGGRWVHH PFDGDGMIAA FKFDNGKINL TNRFVRTKEW TEEEKSQKFL YRGVFGTQKE
GGVLANAFDV RLKNIANTHV IKLGDDLLAL WEASSPYSLN PNTLETKGLS NLKGVLKKGE
AFSAHPRFDP GHHQSQRMVT FGVSTGPKST IRLMEFSTKG ENIGSLLSDR KDSFNGFAFL
HDFAITPNWA IFLQNAISFN PLPFLLGQKG AAQCLASKSD GTPKFLLIPR DSGKFAGQPP
KSVDAPKGFV FHHLNAWEDN EKINIESIFY DDFPSIGPED NFREIDFDLL PEGILKRSEI
NPIENTFTCS TISNQCCEFA MVNPHFEGLK ARFSWMATAE EKEGNGPLQA IKKIDLSNNK
EISWSAAPRG FVSEPIFIPS QESKSEEDNG WVVALVWNSI RSGTDLIILD SKDLTEKAIL
EVPISIPHGL HGSWVEN