Gene NATL1_04541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04541 
SymbolfadD 
ID4779165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp414541 
End bp416514 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content37% 
IMG OID640083731 
Productputative long-chain-fatty-acid--CoA ligase 
Protein accessionYP_001014283 
Protein GI124025167 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAA CTAACTCACA AAAAAATATA TTTTCTTCTG AGGATGCTTT GGCTTTTTGG 
ATCCCTAATA GGAAAGAGGA ACAAGCAATT AATAGAAGAT CTCATCTTAA TAAGATCACT
CAAGTTGACG AGATTTGGGA ACTATTAAAA GTTCAGTTAG GGGAGGTATT GGCGGTTGAT
TCACCACATA CTTTCCACCC AGAAAGCTTT ACATATGAAG AACTTGCAGA AAATATTTCT
ATGGCTGCAT CTGCTTTTTC TCAGGTCGGA GTAGAACCTG ATGAGGTGGT CGCTCTTTTT
GCTGAAAATA GTCCTAGATG GCTCATCGCC GATCAAGGTC TTATGCGAAT AGGTGCAACA
GACTCTGTTA GAGGTGCGAC TGCTCCACCA AGTGAACTTA GATATATTCT TGAGGATTCA
AAAGCGGTTG GATTGATTGT TCAAAACTCC GATGTATGGG AAAGACTTTC GCTTAATGAT
GATCAAATCA ATAGTTTGAA ATTTGTTCTT CAACTTGAGG GTAAAGCTTG TGAAGGCGTT
TTTGAATGGG AGACTTTTTT GAAGAAAGGA TTGAATATAG AAAATGTAAG TAAACAGGAG
AAAATAATTG ATAGGCAACA AAAAAGAATA GCAACCATTT TATATACTTC TGGAACGACT
GGTAAACCTA AAGGAGTTCC ACTAACACAT TCTAATTTAT TACATCAAAT CAGGTCTCTC
GCTTGTGTGG CGAACCCTTC TCCAGGTGCG CCTGTATTAA GTGTCTTGCC AATTTGGCAT
TCATATGAAC GTAGTGCAGA ATATTATTTT TTTTCTTGTG GTTGTACTCA AACATATACA
TCAATTAGGC ATCTTAAGGA AGATTTGCCA AGGGTTAAGC CAATAGTAAT GGCAACAGTT
CCAAGGCTTT GGGAGTCAAT AAAGTTAGGG TTTGAGGATG CTGTTGATAA AATGCCGAGA
CTGAGAAAGA CTTTGATAAA AAGTGCGATT TCTAATAGTA AGGCATATAA ATTGGCACGA
AGAAAACTTT ATTTTTTAAC TATTGAGAGT GTTTCTAGTT TTGAACAACT TATCTCTTGT
ATAGAGATTC TTTTACGGTA CCCCATTCAT AGGATTTCCT CTATTTATTT ATGGCCAAAA
ATCCTTACCA AGATTTGTGG AGGAAAATTG AGATTCCCAA TTAGTGGTGG AGGTGCAATT
GCTCCGCATA TTGATTCTTT TTTTGAAGCT TTAGGTGTTG AGTTATTAGT TGGTTATGGC
TTAACAGAAA CTAGTCCAGT CCTCACATGT AGAAGGCCTT GGAGAAATAT ACGTGGGGGT
GCAGGTCAAC CATTGCCAGA GACTGAGATA AAAATAGTAG ATCCTGAAAC ATTCCAAATA
AAAAAACTGC GTCAAAAAGG TTTGGTTCTT GCGCGTGGTC CTCAAATAAT GTCTGGTTAT
TTAGGGAAAC GATCTGAATC GAAGAAGGTT TTAGATGCTA CTGGCTGGTT TAATACTGGA
GATTTAGGGA TGTTGCTTTC TGATGGATCC TTAATCCTGA CTGGAAGAGC AAAAGATACG
ATTGTGCTAA GTAGTGGAGA AAATATTGAG CCTGGACCTT TGGAGGAATG TTTGATTGCT
AGTCCATTGA TTGAGCAGGC TTTGCTATTG GGTCAAGATC AAAAATATCT TGCTGCTTTG
ATTGTTCCAA GAATTGATCA CGTAAAAGAA TGGCTGGCAG GAAGGGGTGT GAATTCAAAA
GTTGTTCTTG GAATATCTCC TGCAAATTAT GAATTAAGAC AATCTCTAAA ATTGGAAATG
AATCAAGCAC TGGCAAATCG ACTCGGATCT AGAAGAGAAG AGAGATTATT TTCAATTGCC
TTGGTAGAGC CATTTACAAT AGAAAATGGC TTGTTAACGC AAACTCTTAA GCAAAAACGT
GAAAACATTA TCCAACGAGA TTTGAAACTT ATAAATGAAA TATATGGTTT GTAA
 
Protein sequence
MTKTNSQKNI FSSEDALAFW IPNRKEEQAI NRRSHLNKIT QVDEIWELLK VQLGEVLAVD 
SPHTFHPESF TYEELAENIS MAASAFSQVG VEPDEVVALF AENSPRWLIA DQGLMRIGAT
DSVRGATAPP SELRYILEDS KAVGLIVQNS DVWERLSLND DQINSLKFVL QLEGKACEGV
FEWETFLKKG LNIENVSKQE KIIDRQQKRI ATILYTSGTT GKPKGVPLTH SNLLHQIRSL
ACVANPSPGA PVLSVLPIWH SYERSAEYYF FSCGCTQTYT SIRHLKEDLP RVKPIVMATV
PRLWESIKLG FEDAVDKMPR LRKTLIKSAI SNSKAYKLAR RKLYFLTIES VSSFEQLISC
IEILLRYPIH RISSIYLWPK ILTKICGGKL RFPISGGGAI APHIDSFFEA LGVELLVGYG
LTETSPVLTC RRPWRNIRGG AGQPLPETEI KIVDPETFQI KKLRQKGLVL ARGPQIMSGY
LGKRSESKKV LDATGWFNTG DLGMLLSDGS LILTGRAKDT IVLSSGENIE PGPLEECLIA
SPLIEQALLL GQDQKYLAAL IVPRIDHVKE WLAGRGVNSK VVLGISPANY ELRQSLKLEM
NQALANRLGS RREERLFSIA LVEPFTIENG LLTQTLKQKR ENIIQRDLKL INEIYGL