Gene NATL1_00561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00561 
Symbol 
ID4779997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp57836 
End bp59338 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content37% 
IMG OID640083319 
Productflavoprotein 
Protein accessionYP_001013885 
Protein GI124024769 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTGA AATATAAAAA TTTAACGGTT ATTTGTTCTA ATCCAGGAGC AAAATTATTC 
AAAGAGATTT GGAATTTACG AAAACCCTCG CAAAATACAA ATCCTAAAGA AGCATTGGAG
ACAGTTGAAG TTCTTCCAAA TATACAAATC ATCAAACAAT TAGAAACTCA TACACTTAAC
AGTAATTTTG AAGTTACGTA CATTCCCGCG CCAACAGCTC GCTGGCCTGG TGGACTAATT
GTTTTTGAAA AGCAAACTGG TTTATTGATG AGTGATAAAT TATTCGGTGC TCATGTTTAT
GAAGAAAAAT GGGCTGAATT AACCAGTAGT AGCACGGAAG AAGAGAGAAG ACATTACTTC
GATTGTCTAA TGGCGCCAAT GTCTACCCAA GTCAATAGTA TTATCGAAAA ATTTGAAGAC
TTTGAGATTG ATACGATAGT ACCCGGACAT GGACCTGCAA TCAGCGGTAG TTGGAGGAGT
TTATTAAACA ACTACCAAAG CTGGGGAGAA AGCCAAAAAT ACAGCAACTT AAGAGTTGCT
CTATTATTTG CAAGTGCATA TGGAAATACT GCTGCTATTG CTGATGCCAT TGCTAGAGGA
ATTAGTAAAA CAGGGGTCAA CGTTAAGATT ATTAATTGTG AATTCACCGC ATCAGATAGC
TTAGTCACTG AAATTCGTAA AGCAGACGGA TATTTAATTG GATCGCCAAC ATTAGGAGGG
CATGCACCCA CCCCGATTGT TTCAGCACTT GGCTCGCTTT TGGCTGAGGG AGATAGAGGA
AAGCCGGCTG GAGTATTTGG AAGTTATGGA TGGAGTGGGG AAGCTCTTGA TTTGCTTGAA
AAAAAATTAA AAGATGGAGG TTTTAAATTT GGATTCGAAC CTATCAAAAT TAAATTTAGT
CCTGACCCTT TAATGATTAA AAAACTTGAA GAAACAGGTA TCCAATTTGG TAAGCAATTA
ATTAATGCAA AATTACGTCA ACAAAGAAAG GCTAATGTAG GTTTAAATAC AAGTAAAAGT
GATCCAACAA TTAATGCACT CGGAAGGGTC GTCGGATCAC TATGTATATT GACTGCTCAG
AAAGGAGATG AAGATAATCT GATTAGCGGA GCTATGGTTG CAAGTTGGGT TAGTCAAGCA
AGCTTTTCTC CTCCTGGTAT TACTATTGCA GTCGCTAAAG AAAGAGCTGT AGAAAACTTA
CTTCATACAG GAGATAACTT TGCTCTAAAC ATTTTAGAGC AAAATAATCA CCAAAGCCTC
CTTAAACAAT TTCTCCAATC ATTCAAACCT GGAGATAATA GATTTACCAA TCTTGAGATT
AAATTAAGTC CAAGCAATCA GCCATTATTA AACGAAGCTT TAGCCTGGCT GGAGGGTACA
GTTAGTCAAC GAATGGAGTG TGGGGATCAT TGGCTGATAT ATGCTGAGAT TAAATATGGA
AAAGTCATTA AAAAAGATGG AGTAACAGCA GTTCATCATC GAAAAACCGG AGCGAACTAC
TAG
 
Protein sequence
MNVKYKNLTV ICSNPGAKLF KEIWNLRKPS QNTNPKEALE TVEVLPNIQI IKQLETHTLN 
SNFEVTYIPA PTARWPGGLI VFEKQTGLLM SDKLFGAHVY EEKWAELTSS STEEERRHYF
DCLMAPMSTQ VNSIIEKFED FEIDTIVPGH GPAISGSWRS LLNNYQSWGE SQKYSNLRVA
LLFASAYGNT AAIADAIARG ISKTGVNVKI INCEFTASDS LVTEIRKADG YLIGSPTLGG
HAPTPIVSAL GSLLAEGDRG KPAGVFGSYG WSGEALDLLE KKLKDGGFKF GFEPIKIKFS
PDPLMIKKLE ETGIQFGKQL INAKLRQQRK ANVGLNTSKS DPTINALGRV VGSLCILTAQ
KGDEDNLISG AMVASWVSQA SFSPPGITIA VAKERAVENL LHTGDNFALN ILEQNNHQSL
LKQFLQSFKP GDNRFTNLEI KLSPSNQPLL NEALAWLEGT VSQRMECGDH WLIYAEIKYG
KVIKKDGVTA VHHRKTGANY