Gene NATL1_15691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15691 
Symbol 
ID4781044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1274772 
End bp1275617 
Gene Length846 bp 
Protein Length281 aa 
Translation table11 
GC content36% 
IMG OID640084851 
Producttransglutaminase-like superfamily protein 
Protein accessionYP_001015391 
Protein GI124026275 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.798972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.306347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA AAATAATTCA TACATTAGTT TATACTTATG AAATACCTGT TTTACTTGAG 
GAGCATTTAA TTTGTCTTAA GCCAAGATCT AATAGCTTTC AGGAATTATC TAATTTTGAG
CTAAAAATCT TTCCGTCTCC AGATACTATT TTCCCTTTGC TTTCAGATAA TGGAGATGAT
ATTTTTAAAG TGGTTTTTAC TGGGTCTACA AATTCTTTAA CGATAAAGTC AGAAAGTGAT
ATAGAAACAA AAATCCACCC AGATTTATAT CAATTGAAAA ATGATTTTGA TTTAAGTTTG
CCTTTGAAGA TTGAATCTAA TAATTCATTA ATTGGATTTA CTCAAGGATG GTTCCCTAAC
GGTCAGCATG ATCCTTCCGC TATTAAAATT GCCCAAGAGG CTTTGGCAGG AAAAAATAAT
AATGTATTGG ATTTTTTATA CCAGTTGATT GAATTAATTA AAGATAGAGT TAAATATACC
CCTAGGCATA TTGGGCCAGC ATGGACTTCT GCAAGGACTC TGAGCGAGAG AGTTGGCTCT
TGTCGAGATT TGGCAATATT GTTAATGGAA GCTTGTAGAT GTGTTGGGAT ACCTAGTCGT
TTTGTTAGCG GATATCAATT CATGGATCAA GCTCCTGACA AGTATGAACT ACACGCATGG
ACAGAGGCCT TCATACCTGG TTTTGGATGG AGAGGTTTTG ACCCAAGTGG TTGCGGGCAG
ATTAATCATA ATTATGTTGC TTTGGCCTCC TCTTCAAAGT CTGAACTTGT ATCACCAGTA
AGAGGGAGCT TCGTAGGACC TTCTAACTTA AGAACTGAAC TAGAGTGGAA TATCGAGATT
ACTTAA
 
Protein sequence
MKKKIIHTLV YTYEIPVLLE EHLICLKPRS NSFQELSNFE LKIFPSPDTI FPLLSDNGDD 
IFKVVFTGST NSLTIKSESD IETKIHPDLY QLKNDFDLSL PLKIESNNSL IGFTQGWFPN
GQHDPSAIKI AQEALAGKNN NVLDFLYQLI ELIKDRVKYT PRHIGPAWTS ARTLSERVGS
CRDLAILLME ACRCVGIPSR FVSGYQFMDQ APDKYELHAW TEAFIPGFGW RGFDPSGCGQ
INHNYVALAS SSKSELVSPV RGSFVGPSNL RTELEWNIEI T