Gene NATL1_21201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21201 
Symbol 
ID4780956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1776515 
End bp1778464 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content41% 
IMG OID640085417 
Productmetallo-beta-lactamase superfamily hydrolase 
Protein accessionYP_001015940 
Protein GI124026825 
COG category[R] General function prediction only 
COG ID[COG0595] Predicted hydrolase of the metallo-beta-lactamase superfamily 
TIGRFAM ID[TIGR00649] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCAA GTTCAATGAA TTCTCAAGGT TCAAAAAATA ATCAAACCAA ATCTCCTTGT 
TTGAGGATTA TTCCCTTAGG CGGCCTCGGA GAAATCGGTA AAAATACTTG CGTCTTTGAA
TACGGAAACG ACATCATGAT TCTTGATGCT GGTCTTGCTT TCCCAACTGA TGGGATGCAT
GGAGTAAATG TTGTCATGCC AGACACTACT TATCTACGAG AAAATCAAAA TCGAATAAGA
GGTTTGGTGG TTACTCATGG GCATGAGGAT CATATTGGTG GAATTTCTCA TCATTTAAAG
AACTTTAATA TTCCAATTGT TTACGGCCCT CCCCTAGCTA TGTCTATGCT GAGGGGGAAA
ATGGAGGAAG CTGGAGTTGC TGATCGGACA ACAATACAAG TATGTGGACC AAGAGAGGTT
ATTAAAGTTG GGCAACATTT TTCAGTTGAA TTTGTAAGAA ATACGCATTC AATTTCTGAT
AGTTATTCCT TTGCAGTTAC GACTCCAGTG GGAGTAGTAT TTTTCACTGG AGATTTTAAG
TTCGACCATA CCCCACCTGA CGGACAGCCT GCAGATTTAG CCAGGATGGC TCACTATGGA
GACAAAGGTG TTCTTTGCCT TCTATCTGAT TCAACCAACT CTGAAGTTAC AGGATTCACG
CCCTCTGAAT ATTCTGTTTT TCCAAATTTA GATAGATACA TTGCTACTGC TGAAGGCAGA
GTTATGGTTA CTACTTTTGC TAGTTCAACT CATCGTGTGG CGATGCTTTT GGAGCTAGCA
ATGAAAAACG GTAGAAAGGT TGGCTTGCTA GGTCGTTCAA TGCTGAATGT GGTTGGTAAG
GCAAGGGAAC TTGGTTATAT GCGTTTTCCT GATGATTTAT TTTTTCCAAT TAAACAGATC
AGAGATTTGC CTGATAGAGA GACTTTTTTA TTGATGACAG GTAGTCAAGG AGAATCAATG
GCTGCTTTAA GCCGTATTGC TAGAGGCGAG CATCAACATG TTCAATTGAA AACTAGTGAT
ACTGTTATTT TTTCTGCTAG CCCTATTCCT GGGAACACTA TTTCTGTGAT GCATACGATT
GATAAATTAA TCAAACTGGG TGCAAAAGTA ATTTATGGCA AAGATAAGGG GATTCATGTT
TCAGGCCATG GATGCCAAGA AGATCAAAAA TGGATGCTTG GATTAACAAG ACCTAAATAC
TTCATTCCTG TTCATGGTGA GTACAGAATG CAAGTTTTAC ATGGGAAAAC TGCTGTCTCA
ATGGGGGTGC ATCCAGAGAA TGTATTGGTG ATGGAAAATG GGGACGTTGC TGAATTAAGA
CCAGATTCTC TATTGCAAGG CTCACCAGTT AAATCTGGAG TGGAATTACT CGATTCTTCT
AGAACGGGGA TTGTGGATAC TCGAGTATTA AAAGAACGTC AGCAACTTGC TGATGATGGA
GTTATCACCG TTCTTACTCC AATTAGTACT GATGGGGTGA TGGTTGCACC ACCTAGGGTA
AATCTTCGTG GTGTTATCAC TAATGTTGAT GCCAAAACCA TGGTTAATTG GACTGAAAGA
GAGATTAATT GGGTCTTAGA AAATCGATGG AAACAGCTCG TTCTTAAGAC TGGGGGAAAA
TCAGTTGAGG TCGATTGGAT AGGATTGCAA AGAGAGGTTG AATCAGGATT ATCTCGTCGG
TTGAGAAGAG AGGTTCAAGC GGAGCCTTTA GTTCTTTGCC TTGTTCAACC TGCTCCAGGA
GGAACTCGCG CATATAAACC TCAACTTGAT CAACAGCAAG ATTCACGACA AGTAGTGAAG
AAAACTACTG ATAAAGCCCC TAAGACGACA AAGGCATCCG TAGCAAATCA AGAGACAAGT
TCACCAGCAG AACAAAAAAC AAATAAGGAA CCTAATGCTG AGGAAATGCC CACAGGAAGA
ACAAGACGTC GCAGATCAGC AATTAGCTGA
 
Protein sequence
MTSSSMNSQG SKNNQTKSPC LRIIPLGGLG EIGKNTCVFE YGNDIMILDA GLAFPTDGMH 
GVNVVMPDTT YLRENQNRIR GLVVTHGHED HIGGISHHLK NFNIPIVYGP PLAMSMLRGK
MEEAGVADRT TIQVCGPREV IKVGQHFSVE FVRNTHSISD SYSFAVTTPV GVVFFTGDFK
FDHTPPDGQP ADLARMAHYG DKGVLCLLSD STNSEVTGFT PSEYSVFPNL DRYIATAEGR
VMVTTFASST HRVAMLLELA MKNGRKVGLL GRSMLNVVGK ARELGYMRFP DDLFFPIKQI
RDLPDRETFL LMTGSQGESM AALSRIARGE HQHVQLKTSD TVIFSASPIP GNTISVMHTI
DKLIKLGAKV IYGKDKGIHV SGHGCQEDQK WMLGLTRPKY FIPVHGEYRM QVLHGKTAVS
MGVHPENVLV MENGDVAELR PDSLLQGSPV KSGVELLDSS RTGIVDTRVL KERQQLADDG
VITVLTPIST DGVMVAPPRV NLRGVITNVD AKTMVNWTER EINWVLENRW KQLVLKTGGK
SVEVDWIGLQ REVESGLSRR LRREVQAEPL VLCLVQPAPG GTRAYKPQLD QQQDSRQVVK
KTTDKAPKTT KASVANQETS SPAEQKTNKE PNAEEMPTGR TRRRRSAIS