Gene A9601_06501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_06501 
Symbol 
ID4717352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp572575 
End bp574116 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content35% 
IMG OID640078363 
ProductNAD(P)H-quinone oxidoreductase subunit 4 
Protein accessionYP_001009043 
Protein GI123968185 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTAG AATCTTTTCC TTGGCTATCA TCCATTATTT TACTGCCTTT AGTTGGGGCA 
TTAATAATGC CTTTTTTGAG TTCAAAAGAA GGAGAAGATA ATACACTCCC TAGAAATATC
TCATTAAGTT TTTTATTTAT AGATTTTTTA CTAATAATAG GCGTCCTTTT TCAAAAATTT
GATACTTCAG ATAGCTCATT GCAAATGGTA GAAAGAGCCT CCTGGTTACC TTCAATAGGC
TTAGAGTGGT CTCTTGGCGT AGATGGATTA TCTGCTCCTT TAGTAGCTTT GAGTGGGTTA
ATTACATTTT TATCAGCTGC TGCAAGTTGG AAAATTAAGA AAAAATCTAA TCTATATTTT
GCTCTTTTAT TAGTTCAAGC ATCAGCACAG GCACTAGTTT TCCTTTCTCA AGATTTCCTA
TTGTTTTTCT TAGCATGGGA ACTTGAATTA GTTCCGGTAT ATCTTCTTAT TGCGATTTGG
GGAGGTAAAA AGAAATTATA TGCGGCCACT AAATTCATTC TTTATACAGC TTTAGCTTCT
TTATTAATAC TCATAAGCGG GTTAGCACTA GCCTTGAGTG GTGATACCTT TACTCTAAAT
ATTACCGATT TAACCAATAA ACATGTAACA GGGAGCCTAG CTTTATTATC TTATTTAGGA
TTTTTAATTG GTTTTGGAGT AAAACTTCCT ATCTTTCCAC TACATACTTG GTTACCCGAT
GCACATGGAG AGGCCAATGC ACCAGTTTCA ATGTTACTTG CTGGAATACT CTTAAAAATG
GGAGGATATG CCCTCTTAAG ATTTAACGTT CAAATACTAC CTGAAGTACA TCTTCAAATT
GCACCTGCGT TAATTATTCT TGGAATCATT AATATAATTT ACGGAGCCTT AAATGCATTC
GCACAAGATA ATGTTAAAAG GAGAATCGCA TGTAGCTCAG TGAGTCATAT GGGTTTTGTT
CTATTAGGTA TTGGAGCAGT AGATGCTCTA GGAATTAGCG GAGCAATGCT ACAAATGATC
AGTCACGGAC TTATTGCTGC AGCTATGTTC TTTGTTACGG GCTCATTCTA TGAAAGAACA
AATACTCTTT CCATACCAAA TATGGGCGGT TTAGCAAAGG TCTTGCCAAT AACTTTTGCT
TTTTTCTTAG CAAGCTCACT GGCCTCTCTA GCACTACCAG GGATGAGCGG TTTTATAAGT
GAAATAACTG TATTTTTAGG TATCACTAGT CAAGAGGGAT TTAGCTCTAT CTTTAGATCA
ATCACGATTC TAATTGCAGC CATAGGATTA GTTCTAACAC CAATATATCT ACTATCAATG
TGTAGAAGAG TATTTTTTGG CCCTAGAATT CCAGCACTAG CTACAGTTAA AGAGATGAAT
GGTAGAGAAT TGACTATTGG TTTCAGTTTA TTGTTGCCTA CATTGGTAAT AGGTTTTTGG
CCCAAAATCG CTATTAATTT ATACGAATCT TCAACCAATG CTCTCAGTCA GCAGCTTACC
TTAGCTAAAT TGATTGGAAT AATACCCACT TTAGTTAATT AA
 
Protein sequence
MNLESFPWLS SIILLPLVGA LIMPFLSSKE GEDNTLPRNI SLSFLFIDFL LIIGVLFQKF 
DTSDSSLQMV ERASWLPSIG LEWSLGVDGL SAPLVALSGL ITFLSAAASW KIKKKSNLYF
ALLLVQASAQ ALVFLSQDFL LFFLAWELEL VPVYLLIAIW GGKKKLYAAT KFILYTALAS
LLILISGLAL ALSGDTFTLN ITDLTNKHVT GSLALLSYLG FLIGFGVKLP IFPLHTWLPD
AHGEANAPVS MLLAGILLKM GGYALLRFNV QILPEVHLQI APALIILGII NIIYGALNAF
AQDNVKRRIA CSSVSHMGFV LLGIGAVDAL GISGAMLQMI SHGLIAAAMF FVTGSFYERT
NTLSIPNMGG LAKVLPITFA FFLASSLASL ALPGMSGFIS EITVFLGITS QEGFSSIFRS
ITILIAAIGL VLTPIYLLSM CRRVFFGPRI PALATVKEMN GRELTIGFSL LLPTLVIGFW
PKIAINLYES STNALSQQLT LAKLIGIIPT LVN