Gene A9601_07361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_07361 
Symbol 
ID4717441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp655068 
End bp656258 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content31% 
IMG OID640078450 
Producthypothetical protein 
Protein accessionYP_001009129 
Protein GI123968271 
COG category[S] Function unknown 
COG ID[COG1565] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGCT TACCCGCGAA TAATCCAGAT TGGTTAGTAA AAAAAATAAT AAAAATGGGT 
GGGACTATAA GTTTTTATGA CTTTATGAAT TTTGCATTAA ATGATCCTAT TAATGGTTAT
TACGGCAGCG GAAAAGCTGA GTTAGGCGTT CGAGGAGATT TTGTCACATC ACCATCTTTA
TCTGATGACT TTGCTTTTTT AGTTGGTAAA CAAATAGAAG ATTGGTTGAT TCAGTTCAAA
AGTAGTTTTT TATCTAATGA GACATTATCT GTAACTGAAT TTGGAGCTGG AGATGGAAGC
TTTATGAGTG GATTAATTAA ATACTTTTTA GAAAACAGCA AGAATTTTTT AGAAGGTATT
TCTTTTGTAA TTATTGAACC TAATGAAGGG ATGGTAGAAA AACAAAAAAA TAAATTGGAG
GAATTTTTGA ACTTAGGTAT TGATATTTTA TGGAAAGGTT TGGATGAAGT AGAGGAAAAT
AATATAAATG GAATAGTTCT AGCAAATGAA GTTTTGGATG CTTTGCCAGT AGAAAGAATA
ACCTTCTCAA AGGGAAAACT AATTCGACAA GCAGTTTCTA TAGACAAAAA ATCTCATAAA
TTATTTTTTG ATAAAATGCC AATTACACGT GAATTGGAAA AAAGTTTTGA ACTTGCTAAA
AGTGAGTTGG GAATAACTAT TCCGCCTGAA GATGCTCTTG AAGGATGGAC GACAGAATGG
CATGTAGATA ACTCAAAATG GTTAGAAGCT ATTTATGGGA AAATCAATAA TGGTATTTTA
TTGATAATTG ATTACGCTAA AGAAGCTAAA AAATACTATA ACTCTAAGAA TTCTGATGGG
ACTATAGTTT CATATGAAAA TCAAAAAATG AGGAATAATG TCCTAGATTC TCCTGGAAAT
TGCGATTTAA CATCTCATGT ATGCATAGAA ACTTTAATTA ATGATGCTGA GACTCTTGGA
TTTGATACTG TTGGAATAAC AAAACAAGGA GAGGCTTTAT TGGCGCTTGG ATTGGCTGAG
AGACTTTATG GGATTCAGAA AGAATTTAAG GAGAATTTAT CAAATGCTCT TTTAAGAAGA
GAGGCATTAC TTAGACTAGT AGATCCTGTT TGTTTAGGTG ATTTTAAGTG GTTTGTTTTT
AAAAAGTTTA ATGAGAAGAA AATAAATATA AATTCAACCT GTTTGCGTTA A
 
Protein sequence
MNSLPANNPD WLVKKIIKMG GTISFYDFMN FALNDPINGY YGSGKAELGV RGDFVTSPSL 
SDDFAFLVGK QIEDWLIQFK SSFLSNETLS VTEFGAGDGS FMSGLIKYFL ENSKNFLEGI
SFVIIEPNEG MVEKQKNKLE EFLNLGIDIL WKGLDEVEEN NINGIVLANE VLDALPVERI
TFSKGKLIRQ AVSIDKKSHK LFFDKMPITR ELEKSFELAK SELGITIPPE DALEGWTTEW
HVDNSKWLEA IYGKINNGIL LIIDYAKEAK KYYNSKNSDG TIVSYENQKM RNNVLDSPGN
CDLTSHVCIE TLINDAETLG FDTVGITKQG EALLALGLAE RLYGIQKEFK ENLSNALLRR
EALLRLVDPV CLGDFKWFVF KKFNEKKINI NSTCLR