Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_07811 |
Symbol | |
ID | 4781014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 712984 |
End bp | 714219 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640084056 |
Product | insulinase family protein |
Protein accession | YP_001014604 |
Protein GI | 124025488 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.375644 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0630683 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATAA TTCTGGACAA ACTAGATAGC AAAAATATAA TGTCAGCCAA GCTATGGATA GAGGATGGCA GTAGGAATGA TCCAAAAGAT AAAAAAGGAA TTCATCAACT CTTAAGCTCA ACAATGCTTA GAGGTTGTGG GCCATACAAT AATAAGGAAA TCGCTGAAAT TGTAGAAAAT TGTGGTGCAA ATTTAAACTG TGATACATAC GAAGATGGTC TTTTAATAAG TCTTAAATGT GTCGAAACTG ATGCGTATAA ACTTCTTCCC TTAATTGGTT GGATGATTAC AAAACCTATA CTTCAAATAG ATCAGTTTGA ATTAGAAAAA GATCTAACAA TAAAAGCCAT TAAAAGACAA AAAGAGAGTA CATATCAACT AGCTTTTGAT GGCTGGAGAA AGATGGTATA TGGAGATGGA CCATATGGAC ATGATCCACT GGGATCAATC GATGATATAA ATAAAATCAA TAAAGAACAT ATATTGCCAA TCGCAAGCTC ATTAATTCAC AGAAAAAAGA ACTTAGTAAT TTCAGGAAAG TTTCCAATTA ACCTAAAAAA TTATATAGAG AACACAATTG AATTCAAGGG AATTAGTAAT CATAATAAAG CATTTAAAAA TATTAATAAA ATAGAAACTC CAAGCGAACA GAGAAGTAGT ATTTGTACTC GTTCATTGAA TACAAAGCAA GTCATTCTGC TTCTTGGTAA AGCAACAATT AGATATGATA ATAAATCTGA TATTTTGCTT AGATTATTAT CTTGTTACTT AGGTTATGGA ATGTCAAGTT TATTATTTAA GGTTCTCAGG GAAAAGTATG GAGTAGTTTA TGAAGCAGGC ATTTATCATC CTATTAGAGA GCAGCAAACG CCCTTTATTA TGCACGCTTC AACAAGTGAA GAAAAAGGGA TCATTACTCT TCAATTACTT AAAGAGTGTT GGGAGAAAGT TATCAATAGT GAAATCTCTC CTGATGAATT AGAACTTGTA AAAATAAAAT ATCGAGGTCA AATGGCTCAT TCTTTGCAGA GTATTAGTCA AAGAGCTGAA CATAAAGCTC ATCTTTTAGG AATTGGGCTA ACCAAGGATC ACGATGAAGA AATTCTGCAA AGACTCGAAA GTATAACTAG TAAAGAAATC AAGGATGCTG CAAATAGATA TTTAAAGAAC CCATTACTAA GCGTATGCAG TAACAAAGAA GTTATTCGAA AAATCTTTAA AGACTGGAAA GCTTAA
|
Protein sequence | MNIILDKLDS KNIMSAKLWI EDGSRNDPKD KKGIHQLLSS TMLRGCGPYN NKEIAEIVEN CGANLNCDTY EDGLLISLKC VETDAYKLLP LIGWMITKPI LQIDQFELEK DLTIKAIKRQ KESTYQLAFD GWRKMVYGDG PYGHDPLGSI DDINKINKEH ILPIASSLIH RKKNLVISGK FPINLKNYIE NTIEFKGISN HNKAFKNINK IETPSEQRSS ICTRSLNTKQ VILLLGKATI RYDNKSDILL RLLSCYLGYG MSSLLFKVLR EKYGVVYEAG IYHPIREQQT PFIMHASTSE EKGIITLQLL KECWEKVINS EISPDELELV KIKYRGQMAH SLQSISQRAE HKAHLLGIGL TKDHDEEILQ RLESITSKEI KDAANRYLKN PLLSVCSNKE VIRKIFKDWK A
|
| |