Gene NATL1_07811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07811 
Symbol 
ID4781014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp712984 
End bp714219 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content32% 
IMG OID640084056 
Productinsulinase family protein 
Protein accessionYP_001014604 
Protein GI124025488 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.375644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0630683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATAA TTCTGGACAA ACTAGATAGC AAAAATATAA TGTCAGCCAA GCTATGGATA 
GAGGATGGCA GTAGGAATGA TCCAAAAGAT AAAAAAGGAA TTCATCAACT CTTAAGCTCA
ACAATGCTTA GAGGTTGTGG GCCATACAAT AATAAGGAAA TCGCTGAAAT TGTAGAAAAT
TGTGGTGCAA ATTTAAACTG TGATACATAC GAAGATGGTC TTTTAATAAG TCTTAAATGT
GTCGAAACTG ATGCGTATAA ACTTCTTCCC TTAATTGGTT GGATGATTAC AAAACCTATA
CTTCAAATAG ATCAGTTTGA ATTAGAAAAA GATCTAACAA TAAAAGCCAT TAAAAGACAA
AAAGAGAGTA CATATCAACT AGCTTTTGAT GGCTGGAGAA AGATGGTATA TGGAGATGGA
CCATATGGAC ATGATCCACT GGGATCAATC GATGATATAA ATAAAATCAA TAAAGAACAT
ATATTGCCAA TCGCAAGCTC ATTAATTCAC AGAAAAAAGA ACTTAGTAAT TTCAGGAAAG
TTTCCAATTA ACCTAAAAAA TTATATAGAG AACACAATTG AATTCAAGGG AATTAGTAAT
CATAATAAAG CATTTAAAAA TATTAATAAA ATAGAAACTC CAAGCGAACA GAGAAGTAGT
ATTTGTACTC GTTCATTGAA TACAAAGCAA GTCATTCTGC TTCTTGGTAA AGCAACAATT
AGATATGATA ATAAATCTGA TATTTTGCTT AGATTATTAT CTTGTTACTT AGGTTATGGA
ATGTCAAGTT TATTATTTAA GGTTCTCAGG GAAAAGTATG GAGTAGTTTA TGAAGCAGGC
ATTTATCATC CTATTAGAGA GCAGCAAACG CCCTTTATTA TGCACGCTTC AACAAGTGAA
GAAAAAGGGA TCATTACTCT TCAATTACTT AAAGAGTGTT GGGAGAAAGT TATCAATAGT
GAAATCTCTC CTGATGAATT AGAACTTGTA AAAATAAAAT ATCGAGGTCA AATGGCTCAT
TCTTTGCAGA GTATTAGTCA AAGAGCTGAA CATAAAGCTC ATCTTTTAGG AATTGGGCTA
ACCAAGGATC ACGATGAAGA AATTCTGCAA AGACTCGAAA GTATAACTAG TAAAGAAATC
AAGGATGCTG CAAATAGATA TTTAAAGAAC CCATTACTAA GCGTATGCAG TAACAAAGAA
GTTATTCGAA AAATCTTTAA AGACTGGAAA GCTTAA
 
Protein sequence
MNIILDKLDS KNIMSAKLWI EDGSRNDPKD KKGIHQLLSS TMLRGCGPYN NKEIAEIVEN 
CGANLNCDTY EDGLLISLKC VETDAYKLLP LIGWMITKPI LQIDQFELEK DLTIKAIKRQ
KESTYQLAFD GWRKMVYGDG PYGHDPLGSI DDINKINKEH ILPIASSLIH RKKNLVISGK
FPINLKNYIE NTIEFKGISN HNKAFKNINK IETPSEQRSS ICTRSLNTKQ VILLLGKATI
RYDNKSDILL RLLSCYLGYG MSSLLFKVLR EKYGVVYEAG IYHPIREQQT PFIMHASTSE
EKGIITLQLL KECWEKVINS EISPDELELV KIKYRGQMAH SLQSISQRAE HKAHLLGIGL
TKDHDEEILQ RLESITSKEI KDAANRYLKN PLLSVCSNKE VIRKIFKDWK A