Gene A9601_00441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00441 
Symbol 
ID4716726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp44316 
End bp46091 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content35% 
IMG OID640077741 
Productflavoprotein 
Protein accessionYP_001008439 
Protein GI123967581 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGCCT CTGCCCAGAC AAGTAATTCT AAATTGGCAC CAAATAATAG CAAGTTGACG 
GTTCAATCTC AAAATTTTGC TGATGATTCT TGTGCCATAA GATCTTTGGA TTGGGATCGT
AGTAGATTTG ATATTGAATT TGGTTTAAGA AATGGAACTA CTTACAATAG TTTTATTATT
AAAGGCGAGA AATTAGCAAT TATTGATACT AGTCACGCAA AGTTCGAAGA ATTATGGTTT
GAAGAATTAC TGAAAAAGGT AAATCCGCAA GAAGTTGATT ATTTAATTAC TAGCCATACA
GAACCTGATC ATTCTGGTTT AATAGGTAAT CTTTTAGAAT TAAATCAAAA TATCACAGTA
GTTGGATCAA AATTAGCACT TAAATTTATT GAAGACCAAA TACATATTCC CTTTAAACGT
CTAGAGGTCA AGAGTGGAGA GTTTTTAAAT CTCGGAACTA ATCCTAATAG TGGTTTACAA
CATAATATTG AATTTATAAG TGCACCAAAT TTACATTGGC CAGATACCAT ATTTTCATAT
GATCACAGCA CAAATGTTCT CTATACATGC GATGCATTTG GTCTCCATTA TTGTTCTGAC
GAATTTTTTG ACACTGATCA AAAAGAAATA TACGATGATT TCCGTTTTTA TTACGATTGC
CTTATGGGTC CAAACGCTAG AAGCGTTATG CAGGCAATTA AAAGAATAGA TAAGCTACCT
AAAGTAAAAA CAATAGCCGT TGGTCATGGG CCTTTGCTCC ATAATCAGGT CAATTTTTGG
AAAGGAAAAT ATCTAGAATG GAGTAGTAAT AAAAGCAAAG GTAATGATTT TGTGTCAGTC
TGCTACATAA GCGACTATGG TTATTGTGAT CGACTCAGTC AAGCAATATC TCATGGAATA
AGCAAAGCAG ATGCACAGGT TCAATTAATT GATTTAAGAT CTTCTGACCC GCAAGAATTA
ACAAGTTTAA TTTCCGAATC AAAAGCAGTA GTCATTCCCA CATGGCCAGT AGACTCAGAT
AATGAATTAA AAGAATCTCT CGGTACTTTA TTTGCAGCAC TAAAACCAAA ACAATTTACT
GCAGTTTATG ATGCATTTGG TGGAAATGAT GAACCAATAG ATTCCTTAGC AAATAAATTA
AGAGAACTTG GTCAAAAAGA AGCTTTCTCT CCATTAAGAG TTAAAAATAT CCCAGATCCC
ATTGTTTATC AACAATTCGA AGAAGCTGGA ACTGACTTGG GTCAATTGAT CAATAAAAAG
AAGAATATTG CCTCTATGAA GAGCCTTGAT TCAAATTTAG ATAAAGCATT AGGGAGATTA
AGTGGAGGAT TATATGTAGT TACAGCGAGC CAAGGCGAAG GTTCTACATT CAGACAAAGT
GCAATGGTCG CAAGTTGGGT TAGTCAAGCA AGCTTTTCTC CACCAGGTAT TACAGTTGCA
GTAGCAAAAG ATAGAGCTAT TGAATCATAT ATGCAGGTTG GGAAAGGTTT TGTTGTGAAT
ATTTTAAGGG AAGATAACTA TCAAAAAATG TTCAGACATT TTTTAAAAAG ATTTGCCCCT
GGAGCTGATA GATTCGCAGA TGTAGATGTA ATTAGCAACA TCGCTGAAGG AGGACCAGTT
CTTTCAGATT CACTCGCCTT TTTAGATTGT AAAGTTAGTT CCAGGCTAGA GACTCCAGAC
CATTGGATAA TTTACGGAAT TGTTGAAAAT GGTAATGTTT CTGACTTATC ATGCAAGACA
GCAGTTCATC ACAGAAAAGT TGCTAATCAC TATTAG
 
Protein sequence
MIASAQTSNS KLAPNNSKLT VQSQNFADDS CAIRSLDWDR SRFDIEFGLR NGTTYNSFII 
KGEKLAIIDT SHAKFEELWF EELLKKVNPQ EVDYLITSHT EPDHSGLIGN LLELNQNITV
VGSKLALKFI EDQIHIPFKR LEVKSGEFLN LGTNPNSGLQ HNIEFISAPN LHWPDTIFSY
DHSTNVLYTC DAFGLHYCSD EFFDTDQKEI YDDFRFYYDC LMGPNARSVM QAIKRIDKLP
KVKTIAVGHG PLLHNQVNFW KGKYLEWSSN KSKGNDFVSV CYISDYGYCD RLSQAISHGI
SKADAQVQLI DLRSSDPQEL TSLISESKAV VIPTWPVDSD NELKESLGTL FAALKPKQFT
AVYDAFGGND EPIDSLANKL RELGQKEAFS PLRVKNIPDP IVYQQFEEAG TDLGQLINKK
KNIASMKSLD SNLDKALGRL SGGLYVVTAS QGEGSTFRQS AMVASWVSQA SFSPPGITVA
VAKDRAIESY MQVGKGFVVN ILREDNYQKM FRHFLKRFAP GADRFADVDV ISNIAEGGPV
LSDSLAFLDC KVSSRLETPD HWIIYGIVEN GNVSDLSCKT AVHHRKVANH Y