Gene A9601_02761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02761 
Symbol 
ID4716961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp252780 
End bp254681 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content28% 
IMG OID640077976 
Producthypothetical protein 
Protein accessionYP_001008671 
Protein GI123967813 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0749184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAATTA ATAAAAATAT CAATAATCAA AATAGCAAAT TTATTTGGAT AAATACTTTT 
GAAGAGACTT TATCAAGTTC ATCCGAAATT TCTGTTGAAA ATACGTCCTA TGTAAATTTA
TCAAAAGAAA ATCTTGATTT ATTTTTAATG GCTAAAGCCG AAAAGCAAGA AGAGTTAATA
ATACAATCTG ATAAACAGTC TGAAATAAAT GATGTTATTT ATGCAGAGGG AAATGTATAC
GTCTCTTATA GAGGCAAACT GCTTAAGGCA GATACTTTGA TTTACGATAA GTTAAATAAA
AAATTTAGTG CTAAAGGAAA TATATCTTTG GAATTAGGGG ATCAATTCTT CAATGTTGCC
CAACTAGAAT ACAGCTTTAT AACTAAAAAG GGTTATTTAT TAGATGTCAA GGGATCTATT
AATACTAATA CTTTGATGGA TGACTTATCC TCAAATTTCA GTGTTTCAGA TATTAAGAAG
GTAGACCGTT TACTAAAAAT AAAAAAAAAT GAAGTTTTGT ATACCCCCAA TAAGGTTGAG
AATTGGTTGT TTAACACAGA TAAGATGACT ATAGATGGGG GGACATGGAA AAGTAAGAAG
GCTTTTTTTA GTAATGATTT ACTAGATTCA AATCAATTAA AATTAGCAGT TAATTCATTA
GAGGTTTATC CTCGAGATGA AAAATTACGA TTTAAGTCTT CATTAAATTA TTTAATACTT
GACGAGAAAG TATCAATTCC TTTTTGGTTA GGTGATAGAG CTTTTGACTT TAATAATTCT
AAGATTAATA ACAGATGGAA TTTAGGATAT GAAAATTTAG ATAAAGATGG TTATTTTATT
GGCAGAAAAT TAAACTCTAT AGCAATAAAC AATAATTTTA TCCTCGATCT AGAGCCGCAA
TTCTTAATTC AACGCTCATT AAAAGGATAT ACAAAAAGTT TTGTTAAAAA GGATGAATCA
ATAACTTCAG AGAAGGTCAG GAGAAATACA AGTTTTGAAG ACTATTTAGC TCTAAGATCT
CAGATAAAAG GCACGATAAA AAATTGGTAT ATAGAAATAG ACAAGAATTT AAATTCTCTT
GATTTTGATA AATTTTCAGA TGCATTTAGA TTTAAAACTG AATTGAGTAA AGAAATTGAT
CTATTAGATT CAAAATGGGA AAAAAGTTTT TATGGAGTTT ATAGAGAGAG GTTCTGGAAT
GGTTCATTGG GTGAAGCAGA AATCTACTCA GGTTATGGTT CAAAATTGCA AAAAGAAAAT
AATTGGATTA CTGATGGCAT TAAAAAGTCA GAATTTTTAT CTTTCGATCT AGCCAACATA
ACAGCTGAGG CTTTAAATAG TAAAAATCTG GTAACTAACC TAAAAGGTAA TTTGTTTTAT
TCTCTTGATC AGAATTTTCC AATTAGTATT GTAAATCCAA AAAAGAAATC TATTGATATT
TCATATAAGT ACATTCCTGA ACCAATCACA AAAGGATTAA GTCTTAATAC AAGATTAGAA
GCATCATATT CTTTCTATGA AACTGGAGAT CATCAAGAAT ATTTAGGGCT AGGTATAGGC
CCAGAATTAA TATTTGGTAA TTTTAAAAAC AAAACTTTTG ACTATACTCG TATAAGTCTT
TTACCTTTCT ATAAATTTAA TAGTGGCGAA AGCGTTTTTA AGTTTGATCA GAATTATGAG
AACTTTACCT TAAATATTAC TTATGATCAG CAATTATATG GACCAATTAT TCTTAAAAGT
TTTGGAATTT TAAACTTAAC AAACGATTCA AATGATTATG GTGAATTTAT TGACTCTAAA
ATTTCTTTAA ATTGGAAAAA AAGATCCTAT GAAGTTGGTA TTTTTTATCA ACCTCATAAT
CAAGCTGGAG GTATTTCTTT TAGTCTCTCT GGATTTAAAT AG
 
Protein sequence
MKINKNINNQ NSKFIWINTF EETLSSSSEI SVENTSYVNL SKENLDLFLM AKAEKQEELI 
IQSDKQSEIN DVIYAEGNVY VSYRGKLLKA DTLIYDKLNK KFSAKGNISL ELGDQFFNVA
QLEYSFITKK GYLLDVKGSI NTNTLMDDLS SNFSVSDIKK VDRLLKIKKN EVLYTPNKVE
NWLFNTDKMT IDGGTWKSKK AFFSNDLLDS NQLKLAVNSL EVYPRDEKLR FKSSLNYLIL
DEKVSIPFWL GDRAFDFNNS KINNRWNLGY ENLDKDGYFI GRKLNSIAIN NNFILDLEPQ
FLIQRSLKGY TKSFVKKDES ITSEKVRRNT SFEDYLALRS QIKGTIKNWY IEIDKNLNSL
DFDKFSDAFR FKTELSKEID LLDSKWEKSF YGVYRERFWN GSLGEAEIYS GYGSKLQKEN
NWITDGIKKS EFLSFDLANI TAEALNSKNL VTNLKGNLFY SLDQNFPISI VNPKKKSIDI
SYKYIPEPIT KGLSLNTRLE ASYSFYETGD HQEYLGLGIG PELIFGNFKN KTFDYTRISL
LPFYKFNSGE SVFKFDQNYE NFTLNITYDQ QLYGPIILKS FGILNLTNDS NDYGEFIDSK
ISLNWKKRSY EVGIFYQPHN QAGGISFSLS GFK