Gene NATL1_07091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07091 
Symbol 
ID4780303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp651651 
End bp652661 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content35% 
IMG OID640083983 
ProductFAD linked oxidase, N-terminal 
Protein accessionYP_001014532 
Protein GI124025416 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.105041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.609294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAAT TATGCTCTTT TCTACAAAAG CATAAAAGAA GTTTTCCAAT TGGTTTGTCT 
GGAAATACAG GAATGGGATA CATCTTGACT GGTGGAATAA GTCCACTCAG TAGGAGCAAA
GGATTAGCAA TTGATCAAAT CATAGAGATT AAGGGTTTTT GGGGAAACGG AGAAGAGTTT
CATTTACTAC GACCAAATAC GAAAAATGAG TTGACTAATG AATGGAAAGC CCTTTGCGGA
GCAGCTATAT TTCTTGGCAT AATCACACAG GTAAAGTTAA AGACTCAGCC ATTAAGACCA
CTATTAAGCT GGACAGCAAA CCTTTCATTT TCTCAGCTAT CTGAATGTAT TAATCAAGCC
GAAAGTTGGC CTAATTCTCT TAGCCTTCAA TGGATATATG GAGACGATAT TTTTGCTCAT
GCAATTGGTG AAATTGAAAA TAATGATGAT GAATCCGTCT TGATTAAATT ATTAGAAAAA
TTACCATTCT CTCGAAATAG AATCATTAAT AAATTCAATA ATATGAAGTC TTTACCTAAT
TTAAGCCTTG GAGATAATAA TAATTATAAT CATTCAAATC ATTCTGAAGT GCTTGGGTTA
TTAGGCCCTG CTTGGCAAGA AAAAAATCAA CAAGTATTAA AAATCCTTAA AGAATTAATA
AATAAAAGGC CGAACAAAAG TTGTTATATA GCTTCTCAAC AATTAGGAGG TTTAACACAT
TTAAATGATC TTGACACTTC TTTTATTCAT CGTGATGCAA TCTGGAAACC TTGGATTAAC
GGTGCTTGGG AAGCCCACAA TCAAGCCGAG AGAAAAAGAA CTCTGGAATG GATGACAGAG
TGTTGGAATA ATCTAGAATT CATATGCCCT GGGGTTCATC TTGCGCAAAT ACACCCACAT
TTAGAATGGC ATAAAAAAGA ATTATCATCT GCATTCAAAG ATTGGCTCCC AAACTTAGAA
GAGCTCAAAG CCATTCATGA TCCAGAAAAT ATAATGCCAC CATTAAAATA G
 
Protein sequence
MGKLCSFLQK HKRSFPIGLS GNTGMGYILT GGISPLSRSK GLAIDQIIEI KGFWGNGEEF 
HLLRPNTKNE LTNEWKALCG AAIFLGIITQ VKLKTQPLRP LLSWTANLSF SQLSECINQA
ESWPNSLSLQ WIYGDDIFAH AIGEIENNDD ESVLIKLLEK LPFSRNRIIN KFNNMKSLPN
LSLGDNNNYN HSNHSEVLGL LGPAWQEKNQ QVLKILKELI NKRPNKSCYI ASQQLGGLTH
LNDLDTSFIH RDAIWKPWIN GAWEAHNQAE RKRTLEWMTE CWNNLEFICP GVHLAQIHPH
LEWHKKELSS AFKDWLPNLE ELKAIHDPEN IMPPLK