Gene A9601_05801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_05801 
Symbol 
ID4717279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp505073 
End bp506242 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content31% 
IMG OID640078292 
Productphage integrase family protein 
Protein accessionYP_001008973 
Protein GI123968115 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0844338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTAA TTCAGGAAAT TAATAATGTC AATGATAAAT TTGCTACTCA AGGAAGCAAG 
CTTAAAATTG AGAAGAGAGG AGAGAAATTA AATATTCGTG GTTCACTACC CTCCAAAGAA
GATAACAATA ACTTTAAAAT TCAAAGAATA TCTCTTGGTT TAAACGCTGA TATTTCTGGA
TTAGAGGAGG CCAAAAAAAA ATTACAATTA ATCAATTTGC AATTGGAATT GAATCAATTT
GATTGGATTA ACTGGATTGG CAACCCTTAT AAAAAGCAAA TAAAAGATGG TTCTGAATTC
CCAAATAGAT TAAATCAATT TGAAGAATTT TTTTTTAAAG AAAACAAAAG TGATTTTCGA
ACCAGCACTA GAAAAACTAC TTGGAAAAGT TCTTACAAAC CATATATGAA AAGAATCCTA
AATATTTACA ATGATTATGA AAATGAATCT TTAGAAAGAA TATTTCAAAA AACACTTGAA
AGTTACAAGG AAGGTAGCAG AAGTAGGAAA CAATGCGCTA CTTCTCTTAG TGTTTTGGCT
AAGTTTTTGG AAATTAAACT ACCAGAGGAT TGGAAATTAA ATTCTAGAGG ATATGGTCTG
AACAAAGCAG GATTTAGGGA TCTGCCTAAA GACGAGTTAA TTGTGAAACT TTGGGAGACA
ATCCCAAACA AATCTTGGAA ATTTGTTTTT GGTTTGATGG CTACATACGG ATTAAGGAAT
CATGAAGTAT TTTTTTGTGA TTTAAGTTCT CTAACTAATT TTGGGGACAA AATTATTAGA
GTTTTACCTA CTACTAAAAC TGGGGAGCAT CAAGTTTGGC CATTTCATCC TGAATGGGTT
GAAAAGTTCG AATTATCAAA ACTTGGTGAA AATCCAGAAC TACTACCAAA TATTAATACA
GACCTTAAAA TTACAACTTT ACAAAATATT GGAAAAAAAA TTACAGATCA GTTTAAGCGT
TACTCTTTAC AAATAAAACC TTATGATCTA AGGCATGCAT GGGCAGTAAG AACAATTTTT
TATGATTTGC CTGATACTGT TGCTGCCAGA ATGATGGGAC ATTCGGTTAG TTTACATACT
CAAACTTATC ATCACTGGAT TACTAAAAGA GATCAACAAC AGGCAGTAAA TAATGCACTT
TTAAAAGTTA AAAGAGCTAA AAATATTTAA
 
Protein sequence
MNLIQEINNV NDKFATQGSK LKIEKRGEKL NIRGSLPSKE DNNNFKIQRI SLGLNADISG 
LEEAKKKLQL INLQLELNQF DWINWIGNPY KKQIKDGSEF PNRLNQFEEF FFKENKSDFR
TSTRKTTWKS SYKPYMKRIL NIYNDYENES LERIFQKTLE SYKEGSRSRK QCATSLSVLA
KFLEIKLPED WKLNSRGYGL NKAGFRDLPK DELIVKLWET IPNKSWKFVF GLMATYGLRN
HEVFFCDLSS LTNFGDKIIR VLPTTKTGEH QVWPFHPEWV EKFELSKLGE NPELLPNINT
DLKITTLQNI GKKITDQFKR YSLQIKPYDL RHAWAVRTIF YDLPDTVAAR MMGHSVSLHT
QTYHHWITKR DQQQAVNNAL LKVKRAKNI