Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_00781 |
Symbol | dap2 |
ID | 4719955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 82068 |
End bp | 83996 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 640079740 |
Product | esterase/lipase/thioesterase family protein |
Protein accession | YP_001010394 |
Protein GI | 123965313 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.149278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAATC ACAAAACTTT GCCCATTAAT AATGTTTTTA AGAAAAAGCC AGTTTTTAAT CAAATATCAT TTTGTAAAAA TATTATTTTT TGGATAGATA TAGTTAAAAA AGGAAAAACA TATAGCAACT CTATTTTTGC AAGACCATTT AATCAAGAAG AGGCAGGTAT TCAAAAATTA ACTGGAGATG AGTTTAATAT TAAAAGTCAT TTTCATGGTT ATGGAGGTAA ATCTTATCAA TGTATAGAGG TTGATGAAAA ATTATATTTA GTTTGGATAG ATGAGATTTC TGAATCACTT TGGATTCAAG TTTTTACAGT TAATAAATCA GAAACTATAA ATAATAATCA ATATCTTTTA TGTAAAAATA AACCAAGGCA ATTAGTGGAA TCATTAAAGG GTAATTTTGA TTCTTCTTTT GCGATAACTA ATCAAAACAT TTTACTTGGA TTATATGAGT TAAATAATAA GGACTATTTA TTCTCTGTTG ATATACGAAA AAGTAAACAA GAAATACGAA TATTGAGAGA GTTTGATGAT TTTGCTGGAA ATTTATCTTT AAGTAAAAAT GGAAAAAATT TGTCTTGGTT GGAATGGAAA ACACCATTTA TGCCTTGGGA AAAAAATGAA TTATTTTTCG CAGTAATTAA TCAAAATGGA GTATTGGATA AGATCAAAAA GTTTAAAAAT GATAGTATTA ACTTAAATAA AAATGTATCT TTTTTCCAAC CTTATTGGAT ACATGAGGAC ATCATCGTTT GTTCTGAAGA CAGCTCAGGA TGGTGGAATT TATTATTTAT AGATGTAAGT GATATTAACA ATATAGTTAT CAAAAAAAGA ATTTTAAAAG AGTTTTTTGA ATATGGAATT CCTCAATGGA TTTCTGGCTT GTCATTATTT TCTGGATCTT TTAATAATTT ATTTTGTTTA GCTAAAAATA AATGTTCTTG GGTATTGGAA CAATATATCG ATTTTTCATT AGTGAGAACA ATAGATTTAC CTTATGATTT ATTGAGAGAT TTACATGCTG TAGATGATAA CTTAATTTTA ATAGGTTCTA GTAATACTTG TAATGAAAGA TTATTGGAAC TGGAATGTAA TAATAAAAGG CTTATTAAGC TCTCTAAAAA GTCGTTTTTT TTATCTCAAA ATAATTGTTC AAAACCAGAA TCTTTTTGGT TCAAAGGTTT TAATAATCAA TCCACACATG CATTTATCTA TAAGCCACTT TATGAAAGAT TTGTAAACTC ACCATTAATC GTTAAAGCAC ATAGTGGGCC CACTTCTTGT TTTGATGGAT CTTTAAATTC GGAAGTTCAA TATTGGACTT CCAAAGGATT TATGGTTGCA GAACTTAATT ATGGTGGTTC CTCTGGTTTT GGTAGAGAAT ATAGAGAAAG ATTAAATTAT AAATGGGGAA TCCTTGATTC TTTTGACTGT AAAGCATTGG TACTCGATTT AATTAGATTA CATCTTGTTG ATAGTTCTAA AGTCGCAATC TTAGGTAATA GCGCTGGAGG TTTAACTGCA ATTAATGCTT TATGTGAAGG TGATCTTTTT AAAGTGGCAA TTTGCAAATA CCCTGTTATT GATTTAAATG ATATGCATCA TAAAACTCAT AGATTTGAGA AAGGGTATTT AAATTCTCTC ATAGGTGAAT ATTCAACTTG TCTTGAAAAG TATCAAATTC GATCACCAAT TAATAAAATA AATCAATTGA AAAAGCCAGT TTTATTATTT CATGGGAAGA AAGATTCAGT GATTTCATAT AAAAAAACAT CACAAATAAA AGATTTACTT ATCGGAAATA ATAAAAATTC AGAAGTTATA TTTTTTGATA ATGAAGGGCA TGGTTTTAAA AATTTAGATA ATAAGCAACA GGTTCTTATA AAAACTCAGA AATTTTTAGA GAGAACTTTA AATATTTAA
|
Protein sequence | MINHKTLPIN NVFKKKPVFN QISFCKNIIF WIDIVKKGKT YSNSIFARPF NQEEAGIQKL TGDEFNIKSH FHGYGGKSYQ CIEVDEKLYL VWIDEISESL WIQVFTVNKS ETINNNQYLL CKNKPRQLVE SLKGNFDSSF AITNQNILLG LYELNNKDYL FSVDIRKSKQ EIRILREFDD FAGNLSLSKN GKNLSWLEWK TPFMPWEKNE LFFAVINQNG VLDKIKKFKN DSINLNKNVS FFQPYWIHED IIVCSEDSSG WWNLLFIDVS DINNIVIKKR ILKEFFEYGI PQWISGLSLF SGSFNNLFCL AKNKCSWVLE QYIDFSLVRT IDLPYDLLRD LHAVDDNLIL IGSSNTCNER LLELECNNKR LIKLSKKSFF LSQNNCSKPE SFWFKGFNNQ STHAFIYKPL YERFVNSPLI VKAHSGPTSC FDGSLNSEVQ YWTSKGFMVA ELNYGGSSGF GREYRERLNY KWGILDSFDC KALVLDLIRL HLVDSSKVAI LGNSAGGLTA INALCEGDLF KVAICKYPVI DLNDMHHKTH RFEKGYLNSL IGEYSTCLEK YQIRSPINKI NQLKKPVLLF HGKKDSVISY KKTSQIKDLL IGNNKNSEVI FFDNEGHGFK NLDNKQQVLI KTQKFLERTL NI
|
| |