Gene A9601_03511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_03511 
Symbol 
ID4717040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp320956 
End bp322395 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content36% 
IMG OID640078055 
Productputative neutral invertase-like protein 
Protein accessionYP_001008746 
Protein GI123967888 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.268956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAAA GATTTAGTCA AAAAAATTTA AGAGTAAGAC CAAGTTCTGA TGAGGAAAAA 
ATTGTAACAA ATGCAAAAAA ACACTTCGAG AAGACTTTGG TTGAGATATC AGGCGAGTTA
GTGGGAAGCG TCGCTGCACT AGAACATCCA ACAAAAAATA AAAAATTAAA TTATGGAGAA
ATATTTTTAA GAGACAATGT TCCTGTAATG ATTTATCTCA TTACCCAAAA ACGTTACGAA
ATTGTCAAAA AGTTCCTAAG TGTATGCCTT GAGTTACAAA GCTCTAACTA CCAAACACGT
GGCGTATTTC CTACTAGTTT CGTTGAAGAA AATGGACAGC TCATTGGAGA CTATGGTCAG
AGATCAATAG GGAGGATTAC TTCAGCTGAT GCAAGTTTAT GGTGGCCCAT TTTATGTTGG
TATTATGTCA ATAAAAGCGG TGATTATGCC TTTGGAAAAA GTCAAAGCGT TCAAAGAGGT
ATTCAACTTC TACTAGATCT AGTTCTACAT CCAACATTTG AGGGTACTCC AGTACTTTTT
GTGCCAGATT GCGCATTTAT GATTGATAGA CCTATGGATG TATGGGGAGC ACCACTAGAA
GTTGAAGTTT TACTTCATGG ATGTTTAAAA AGTTGCATTA ACTTAATGGA ATTAAGTAGA
GCAGATCATG TTAGTAGACT TTTAGATCAA AGACTTATTC TTACAAATCA ATGGGTTAAG
GATTTAGGAA GTTTTCTTTT AAAGCATTAT TGGGTTACAA GCCAAACAAT GCAAATTTTA
AGAAGAAGGC CAACTGAGCA GTATGGTGAT GATCAGCACT TCAATGAATT TAATGTTCAA
CCTCAAGTGG TTCCCTCATG GCTACAAGAT TGGTTAGAGA ATAGAGGCGG TTACTTAATA
GGAAATATTA GGACAGGAAG GCCTGACTTT CGATTTTACA GTTTAGGCAA TTCTTTAGCA
TGTATGTTCG GAGTTCTTCC TCCTGAAGAA CAAAGAGCTT TATTTAGATT AGTTTTACAT
AACAGACAGC ATTTGATGGC TCAAATGCCT ATGAGAATTT GTCATCCTCA TATGGATGTA
GAGGAATGGC AAAATAAAAC TGGATCCGAT CCAAAGAATT GGCCTTGGAG TTACCATAAC
GGTGGTCATT GGCCAAGCTT ACTTTGGTTT TTTGGTACAG CTGTCCTATT ACATCAAAAA
CATTATGGTT CAGACGATGT GATCCTCATG GAAGAAATGA AATCTTTAAT AGAGGAATCA
TATTGGTGTC AACTTAATCA ATTGCCTAAG CAAGAATGGG CAGAATATTT TGATGGTCCT
ACAGGAACTT GGGTTGGACA ACAATCAAGA ACATATCAGA CTTGGACAAT TGTTGGATTT
TTATTAATGA ATCACTTTCT AAGGAATGAG TATAACGATT TAGATATGTT TAAGATTTGA
 
Protein sequence
MAERFSQKNL RVRPSSDEEK IVTNAKKHFE KTLVEISGEL VGSVAALEHP TKNKKLNYGE 
IFLRDNVPVM IYLITQKRYE IVKKFLSVCL ELQSSNYQTR GVFPTSFVEE NGQLIGDYGQ
RSIGRITSAD ASLWWPILCW YYVNKSGDYA FGKSQSVQRG IQLLLDLVLH PTFEGTPVLF
VPDCAFMIDR PMDVWGAPLE VEVLLHGCLK SCINLMELSR ADHVSRLLDQ RLILTNQWVK
DLGSFLLKHY WVTSQTMQIL RRRPTEQYGD DQHFNEFNVQ PQVVPSWLQD WLENRGGYLI
GNIRTGRPDF RFYSLGNSLA CMFGVLPPEE QRALFRLVLH NRQHLMAQMP MRICHPHMDV
EEWQNKTGSD PKNWPWSYHN GGHWPSLLWF FGTAVLLHQK HYGSDDVILM EEMKSLIEES
YWCQLNQLPK QEWAEYFDGP TGTWVGQQSR TYQTWTIVGF LLMNHFLRNE YNDLDMFKI