Gene A9601_18891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_18891 
Symbol 
ID4718627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1623866 
End bp1625179 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content34% 
IMG OID640079623 
ProductATP-dependent DNA ligase 
Protein accessionYP_001010279 
Protein GI123969421 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.986725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTAAAC AAGAAATAAT ACATCAATTA GAATTACACC CCAGTAGATT AGATAAAGAA 
AAAATCATTT TAGAAGCAAT GGAAGAAGGT CTAGATGATT TTTTTGAAGG TATACGTATG
GCACTTGATC CATTGGTAAC TTTTGGTGTA AAAATTGTCC CTGAGAAAGA GACTGAAAAA
AGTAAAAATT TTTTATGGGA AGATTTTAGA AAATTAGCCA ATAAGCTTAT TCAAAGAGAA
CTTACTGGTC ATGCTGCTCG TGATGCAATT CTTAAGGCTA TGGAATCTGC AACAAAAGAA
GAGTGGAATG GATTTTATAG ACGAGTTTTA ATTAAAGATC TTAGATGTGG TGTATCTGAA
AAAACAATCA ACAAGATAGC AAAGAAATTT CCCAAATATG CTATTCCTAT TTTTTCTTGT
CCTTTAGCTC ATGACAGTGC AAATCATGAA AAAAAAATGA TAGGAAAAAA GCAAATTGAA
ATCAAATTAG ATGGTGTACG CGTCTTAACT ATAATTAGAC AAAATAAAGT AGAAATGTTT
TCTCGTAATG GGAAACAATT CCACAATTTT GGTCATATTA TCTCGGAACT AGAAAACGCC
TTAAAAGAAG ACCCAGCACC TTATGACTTA GTACTCGATG GTGAAGTGAT GAGCTCTAAC
TTTCAAGATT TAATGAAACA GGTACATAGA AAAGATGGCA AACAAACCAA AGACGCAGTT
CTCCACTTAT TTGACTTATG TCCCCTGGAA AACTTTCAAA AAGGGAGATG GAATACTAGT
CAAACAAAAA GAAGTTTATT AGTAAAAGAA TGGGTAGCAA AACATTCTAT GCTTCTAAAA
CATATACAAA CACTTGAATG GGAAAATGTA GATCTCGACA CTATTGAAGG ACAAAAAAGA
TTTGTAGAGC TGAATAAATC TGCTGTAGAA GGTGGGTATG AAGGAGTAAT GATTAAAGAT
CCTGATGCTA TGTATGAATG TAAAAGAACA CACAGTTGGT TAAAAGCAAA ACCTTTTATT
GAAGTTACTT TAAAAGTTAT ATCGGTTGAG GAAGGTACAG GTCGCAACAA AGGAAGACTG
GGAGCAATCC TGGTAGAAGG AGAAGATGAT GGGTATGAAT ACAGTCTTAG TTGCGGAAGC
GGATTTAGTG ATATCCAACG TGAAGAATAT TGGTCAAAAC GTAATCATCT CGTTGGTCAA
CTTGTAGAAA TCAGAGCTGA TGCAAAAACC AAGTCAAAGG ATGCAGTTAC CTTTAGTCTT
AGATTTCCTA GATTTAAATG CTTTAGAGGA TTTAAAGAAG GAGAAAAAGT TTAA
 
Protein sequence
MFKQEIIHQL ELHPSRLDKE KIILEAMEEG LDDFFEGIRM ALDPLVTFGV KIVPEKETEK 
SKNFLWEDFR KLANKLIQRE LTGHAARDAI LKAMESATKE EWNGFYRRVL IKDLRCGVSE
KTINKIAKKF PKYAIPIFSC PLAHDSANHE KKMIGKKQIE IKLDGVRVLT IIRQNKVEMF
SRNGKQFHNF GHIISELENA LKEDPAPYDL VLDGEVMSSN FQDLMKQVHR KDGKQTKDAV
LHLFDLCPLE NFQKGRWNTS QTKRSLLVKE WVAKHSMLLK HIQTLEWENV DLDTIEGQKR
FVELNKSAVE GGYEGVMIKD PDAMYECKRT HSWLKAKPFI EVTLKVISVE EGTGRNKGRL
GAILVEGEDD GYEYSLSCGS GFSDIQREEY WSKRNHLVGQ LVEIRADAKT KSKDAVTFSL
RFPRFKCFRG FKEGEKV