Gene NATL1_00571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00571 
Symbol 
ID4779895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp59689 
End bp61467 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content36% 
IMG OID640083320 
Productflavoprotein 
Protein accessionYP_001013886 
Protein GI124024770 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTTA CTAAATCGAC AGCTTTAGAA AATCAGAAAA GTGATACTCC TCCTAAGCTT 
TCACTGCAAT ATGAACAAAT TGCTACGGAT ACTCACACTT TAAGATCATT GGATTGGGAT
CGAAGCAGGT TTGATATTGA ATTTGGCCTT CGAAATGGGA CAACTTATAA TAGCTTTTTA
ATAAGAGGCA AAAAAACTGC ACTTATAGAT ACAAGTCATT TAAAGTTTAA AGATATTTGG
TTTGAAAAAC TTAGACAAGA AATCAATCCA ACTGACATTG ATTATTTAAT AGTCAGTCAC
ACAGAGCCAG ATCACTCAGG ACTGATAAAA TATTTAATTG AATTAAACCC AAATATTGAA
ATCGTCGCAT CTAAAGTAGC TATAAAATTC TTAGAAGACC AAATTCATCA ACCTTTTAAG
TCAAGAGCAG TCAAAAGCGG TGAAGAACTC AATTTAGAAT TTAATTCAAT CAGCGGAATT
GAGCATAGAA TTGAATTTAT TAGTGCACCA AATCTACACT GGCCAGATAC TATTTTTTCT
TTTGATCACG GTACACAAGT TATATATACC TGTGACGCAT TTGGTTTACA TTACTGTTCA
GAAAAATTAT ATGACGAAAA TCCAAGTCTA TTAAATGAAG ATTTTCGATT TTATTATGAC
TGCCTCATGG GTCCAAATGC CCGCAGCGTA GTTCAGGCAT TAAAAAAAAT TGATTCATTG
TCAACTATCC AGACTATTGG TGTAGGACAT GGACCAATCC TGAATTTTAA CACGCAGTTA
TGGCTGAATC ATTACAGAGA ATGGAGTAAG CAAAAAAGCA CGGGTGAAAA CTATGCTGTG
GTTTGCTATC TCAGTCAATA TGGTTTTTGT GACCGGTTAA GCCAAGCAAT TGCTCATGGA
ATAGGCAAGG CAAATGCCCA AGTCCAATTA GTTGATCTTA TTGCTTCAGA TACTCAAGAG
TTAAGTGCTC TAATTAGTGA AGCGAGTGCA GTCGTTGTGC CTACATGGCC TATCAAATCT
GATTCAGAAT TACAAAGCAA TATTGGAACA CTTCTAGCAT CCTTAAAACA AAAGCAATGG
GTAGCCACTT ATGACTCATA TGGAGGAAAT GAAGAGCCTA TTGACTTTAT AACTAACCAA
TTAAGAAAGC TTGGACAAAA AGAAGCATTT AAACCACTTC GAGTTCGTGA TGAACCAAAC
AAAAGTGTTT ACCAACAATT TGAAGAAGCT GGTACAGATT TAGGGCAAAT TCTTACTAGA
AAGAAAAATT TAGCTGCTAC TAAGAGCCTT GATGGAGACT TAAATAAAGC GCTAGGTTGC
CTAAGCGGTG GACTTTACAT TGTTACAGCA AAAGACAATG AAGGTGCTGA TAGTAGAGAC
GGTGCGATGG TTGCAAGTTG GGTTAGTCAA GCAAGTTTTG ACCCCCCTGG AATAACCGTA
GCGGTAGCTA AAGACAGAGC AATTGAATCA CTATTACAAG TCAATGACCG TTTTGTTCTG
AATATCCTTC AAGAAAATAA CTATTTGCAC CTCTTCAGAC ACTTTTTAAA ACGTTTTCCA
CCAGGTGCTA ATCGATTTAA AGGAGTTGAA TTAATGAATG ATCTTGCAGC TGGGGGACCA
GTTCTATCTG ATGCATTAGC GTTCCTGTCA TGTAAAGTTA TTCAAAGAAT GGAGACAACA
GATCATTGGA TCATTTATTC ATCAGTTGAA AAAGGTAATT TATCTAATAC TCAAAGTAAA
ACAGCAGTTC ATCACAGAAG AGTTGGTAGT AATTATTAA
 
Protein sequence
MIFTKSTALE NQKSDTPPKL SLQYEQIATD THTLRSLDWD RSRFDIEFGL RNGTTYNSFL 
IRGKKTALID TSHLKFKDIW FEKLRQEINP TDIDYLIVSH TEPDHSGLIK YLIELNPNIE
IVASKVAIKF LEDQIHQPFK SRAVKSGEEL NLEFNSISGI EHRIEFISAP NLHWPDTIFS
FDHGTQVIYT CDAFGLHYCS EKLYDENPSL LNEDFRFYYD CLMGPNARSV VQALKKIDSL
STIQTIGVGH GPILNFNTQL WLNHYREWSK QKSTGENYAV VCYLSQYGFC DRLSQAIAHG
IGKANAQVQL VDLIASDTQE LSALISEASA VVVPTWPIKS DSELQSNIGT LLASLKQKQW
VATYDSYGGN EEPIDFITNQ LRKLGQKEAF KPLRVRDEPN KSVYQQFEEA GTDLGQILTR
KKNLAATKSL DGDLNKALGC LSGGLYIVTA KDNEGADSRD GAMVASWVSQ ASFDPPGITV
AVAKDRAIES LLQVNDRFVL NILQENNYLH LFRHFLKRFP PGANRFKGVE LMNDLAAGGP
VLSDALAFLS CKVIQRMETT DHWIIYSSVE KGNLSNTQSK TAVHHRRVGS NY