Gene NATL1_05501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05501 
Symbol 
ID4780324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp497581 
End bp499110 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content37% 
IMG OID640083827 
Productcarboxypeptidase Taq (M32) metallopeptidase 
Protein accessionYP_001014377 
Protein GI124025261 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.30292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCAAAGT CTGCTTGGCA GCTTTTGGGT GATTACCTAA AAGATACGCA GTTGTTGGGA 
TCTATACAAA GCACTCTCTA CTGGGATCAA AATACATCTA TGCCTATTGC TGGTTCCAAT
TGGAGAGGAG AGCAATTAAG TCTTTTAGCT AAGCAACTTC ATGCAAGACA AAGTTCTGAA
CAGTTCGAAA TTTTAATAAA AGAAGCTAAA TCTGAACTTC AAAAATCAAA AGAAAAAGAT
GATTTTGAAT CACAACTCAT CACAGATAGA TTTAGAAATA TTGATTTGCT TGAGCAAGAT
TTCAATAGAC AGAAAAGTTT GGATCCTCAA TTGGTCGTTG AGCTCGCAAC AGCAAAGTCT
GAAGGGTATA TGTGTTGGCA GGAAGCTAGG AAAAATAATG ATTTCAAAAG CTTTTCTCCA
GCTCTTAAGA AATTAATTGC ATTACGAACA GAACAATCCA ATCAGCTCTG TGAAGAAAGA
AGTTGCTGGG AGACACTTGC CCAGCCTTTT GAACCGAATT TAACGATTGA TCGTGTAAGC
GAACTATTTG AACCTTTACA AAAGAGATTG CCAGAATTGA TTCAGAAGGC TGAGACAATT
ACCAATAAAA AGAGTGAAAA ATGGGATTTA GCAATTAGTG ATCAAGAAAA ACTCTGTCAA
ATACTTTTAA ATGATTGGTC TAGGGATCCT GCTAATACAG CGATAGCTAA GTCCCCTCAT
CCATTCTCTA TAACTTTAGG TCCGGATGAT TATCGAATTA CGACTCGAAT AGTTAAAGGT
CAGCCCCTTT CTTGCTTATT AGCTACTGCC CATGAGTGGG GTCATTCTCT TTATGAACAA
GGTTTGCCTT CTAAAAGTCA CCAATGGTTT GCATGGCCGT TAGGTCAAGC AACCTCTATG
GCTGTTCATG AGAGTCAATC TCTATTTTGG GAAAATAGGA TTGCTAGGAG CTTTTCATTT
GCAAAGTCTT TTTGGCATCA TTTTGAGAAT GCAGGTGCTC CAATTCACTC TGGAGATGAT
TTATGGATCA ATCTAAATCC ATTTACTCCG GGATTGAACC GAGTAGAGGC TGATGAACTC
AGTTATGGCT TGCACATAAT GATTAGGACT GAATTGGAAA TTGATCTTCT CGAGAGAGGC
CTTTCTGTGG AAGATCTGCC TAATGAATGG AATAAAAGGT ATTTGAACCT TTTAGGTGTG
TCGCCTAAAA ATGATACTGA AGGATGTTTG CAAGATGTGC ACTGGAGTGA GGGGATGTTT
GGTTATTTCC CTTCTTATTT GCTTGGTCAT CTTATTAGCG CTCAGTTGAC AAAAACTCTT
GAAGAAGATT TAGGGAAAAT TGAAAATCTT ATTGAATCTA CGGAAATCAG TAAAATATTG
GGTTGGCTTC GCAAAAATGT TCATCATTAT GGGAGAAGTT TAGATTCTGA GGAACTTGTA
AGGAAGGTCT CTGGAGCAAA ATTATCACCA ACTTATTTTC TTGAATACTT AGATAATAAA
CTTGAAAAGC TGTCTACAAT CTCTAAGTAA
 
Protein sequence
MSKSAWQLLG DYLKDTQLLG SIQSTLYWDQ NTSMPIAGSN WRGEQLSLLA KQLHARQSSE 
QFEILIKEAK SELQKSKEKD DFESQLITDR FRNIDLLEQD FNRQKSLDPQ LVVELATAKS
EGYMCWQEAR KNNDFKSFSP ALKKLIALRT EQSNQLCEER SCWETLAQPF EPNLTIDRVS
ELFEPLQKRL PELIQKAETI TNKKSEKWDL AISDQEKLCQ ILLNDWSRDP ANTAIAKSPH
PFSITLGPDD YRITTRIVKG QPLSCLLATA HEWGHSLYEQ GLPSKSHQWF AWPLGQATSM
AVHESQSLFW ENRIARSFSF AKSFWHHFEN AGAPIHSGDD LWINLNPFTP GLNRVEADEL
SYGLHIMIRT ELEIDLLERG LSVEDLPNEW NKRYLNLLGV SPKNDTEGCL QDVHWSEGMF
GYFPSYLLGH LISAQLTKTL EEDLGKIENL IESTEISKIL GWLRKNVHHY GRSLDSEELV
RKVSGAKLSP TYFLEYLDNK LEKLSTISK