Gene A9601_14051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14051 
Symbol 
ID4718126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1176105 
End bp1177313 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content32% 
IMG OID640079126 
Producthypothetical protein 
Protein accessionYP_001009796 
Protein GI123968938 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTTA AATGTAGACA TTGTGGAAAA ACTCTAAATC ATGAAGTTAT TGATTTGGGC 
AACCAGCCTC CAAGTAATGC ATATTTGGAT AAAAATCAAA TATTGAGACC TGAAATTACG
TATCCCTTAA AAGTCTATGC ATGTGAACAT TGTTGGTTAA TTCAACTTCC TGAACATGCC
ACATCAGAGG AATTGTTCAC GCCAGATTAT GCTTATTTTT CAAGTACATC AACTAGTTGG
TGTGCTCATG CTGAGAAATT TGTAAATGAA GCTGTCTCAG ATCATAATCT TTCGCAAAAA
AGTTTTGTTG TTGAATTAGC AAGCAACGAT GGATATTTAT TGCAATATTT GAAATCTAGG
GGAATACCTT GTTTAGGTGT TGAACCAACT AGAGCTGCTG CTGAAATCGC AAGATCTAAG
GGGATTAATA CAATAGAAAG CTTCTTTGGC TTAGAAATGG CTAAGGATAT GGATAAGGCA
GATCTTGTTA TAGCTAATAA TGTTTTAGCT CATGTACCTG ATATTAACGA TTTCATGGGG
GGTATACATG AAGTTTTAAA GCCAAAAGGA AAAGCATCAA TTGAATTTCC CCACTTATTA
AGAATGATAA AGGGGAAACA ATTCGATACT ATTTATCATG AACATTTTAG TTATCTAAGT
CTTAGAACAG TTCAAAGAAT AGCTTCATCT GTTGGACTGG AAATATTCAA AGTTTCCGAA
TTAAGTACTC ATGGAGGTAG TTTAAGGGTA TGGCTTTCCA AAAAAAATAA TTTTGAGATT
GATTCTTCTG TTGAGAGAAT ACTAAATTTA GAGGTTCAAG AGGAATTAGA GTCTCTAAAA
ATTTTTGAAG AATTTCGCGC GAGCGCCTTA AAAGCTAAAT ATCAATTCTT AGATTTTCTT
ATAAAAGCTA AAAATAAAAA TAAAAAAATA ATGGCATATG GAGCCGCTGC AAAAGGTAAT
ACTTTTTTAA ATTTTGCTGG AATAAAATCT GATTTAATAT CTTTGGTAGC AGATAAATCT
ATTAGTAAAC AAAATAAGTT TATGCCTGGT AGTTTGATAC CAATTGTTTC ACCTAAAACT
CTATTGAATG AGAAACCTGA TTCAATAATA GTTTTACCTT GGAATATTAT TTCAGAAATC
AGATCTCAAT TAAAAAACCA TCAGCTTGTT ACTGCTATTC CGAATTTAAA GGTTTGGAAT
AATTTATAA
 
Protein sequence
MNLKCRHCGK TLNHEVIDLG NQPPSNAYLD KNQILRPEIT YPLKVYACEH CWLIQLPEHA 
TSEELFTPDY AYFSSTSTSW CAHAEKFVNE AVSDHNLSQK SFVVELASND GYLLQYLKSR
GIPCLGVEPT RAAAEIARSK GINTIESFFG LEMAKDMDKA DLVIANNVLA HVPDINDFMG
GIHEVLKPKG KASIEFPHLL RMIKGKQFDT IYHEHFSYLS LRTVQRIASS VGLEIFKVSE
LSTHGGSLRV WLSKKNNFEI DSSVERILNL EVQEELESLK IFEEFRASAL KAKYQFLDFL
IKAKNKNKKI MAYGAAAKGN TFLNFAGIKS DLISLVADKS ISKQNKFMPG SLIPIVSPKT
LLNEKPDSII VLPWNIISEI RSQLKNHQLV TAIPNLKVWN NL