Gene NATL1_07101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07101 
Symbol 
ID4780085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp653007 
End bp654020 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content36% 
IMG OID640083984 
Producthypothetical protein 
Protein accessionYP_001014533 
Protein GI124025417 
COG category[S] Function unknown 
COG ID[COG2138] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.129916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.330282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTTCCC CTGCATCAAT TAACTATAAA TATCCATCTA ATATTGGAAT TTTGTTATGC 
GGACATGGTA GTAGAGATCC CCAAGCAGTA AAAGAGTTTA TAAATGTAGT AAATAAAATA
AAATCTAGAA TACCTGATAT CCCAGTTGAA TTCGGTTTTC TAGAATTTAA TCGACCAATA
ATTAGTGAGG CCCTAGATCA GCTTAGGGAT TTGGGAGTTG AGAGAGTGAT TGCTTTACCA
GCTATGTTAT TTGCTGCCGG GCATACTAAA AATGATATCC CTGCGGTTTT GAATAAATAC
TCCGCTGATA ATGGACTTCT AATTCAATAT GGCAGGGAGC TTGGTTTGAA TTCTTTGATG
ATTGGAGCAG CAGGAGCAAG AATCAAAGAA ACAATTGATA GTAATCCAAT ATTTCCTCTT
CATGAAACAT TACTTGTCGT CGCAGGTAGG GGATCGTCCG ATCCAGATGC TAATTCAAAT
GTATGTAAGA TTACAAGGAT GCTTGTTGAG GGATTTGGTT TTGGATGGGG AGAAACTGTT
TTTTCAGGAG TAACATTTCC CCTTGTTGAT CCTGGCTTGA GACATGCTCT CAAATTAGGT
TTTAAAAGAG TAATTCTCTT ACCTTATTTT CTTTTTTCTG GAGTTTTAGT CAGTCGCGTT
AGAGAACATT CTACGAGAGT TGCAAATGAT AATCCTGATG TGAAGTTTCT AAACGCAAGT
TACTTATCAG ACCAAGATTT AGTCATTGAT ACTTTTATGG AAAGAATTCA AGAAGTTTTT
GATGGTGAGA ATTTTATGAA CTGTGCTTTA TGTAAATATC GTTCTAATTT ATTAGGTTTT
GAAAGCGAGG TTGGATATGA GCAGATCAGT CATCATGATC ATGTTGAAGG TTGTCTAGAC
ATTCGCCGAG AAAACAAAGA GCATAATCAC GAGCATGAAC ATTTTCCTTA TCCACATGCA
AAGCATCCTT TAGGACCTGT CACGCTTCCC TCTTTAAACA AAAGCCAAAT CTAA
 
Protein sequence
MISPASINYK YPSNIGILLC GHGSRDPQAV KEFINVVNKI KSRIPDIPVE FGFLEFNRPI 
ISEALDQLRD LGVERVIALP AMLFAAGHTK NDIPAVLNKY SADNGLLIQY GRELGLNSLM
IGAAGARIKE TIDSNPIFPL HETLLVVAGR GSSDPDANSN VCKITRMLVE GFGFGWGETV
FSGVTFPLVD PGLRHALKLG FKRVILLPYF LFSGVLVSRV REHSTRVAND NPDVKFLNAS
YLSDQDLVID TFMERIQEVF DGENFMNCAL CKYRSNLLGF ESEVGYEQIS HHDHVEGCLD
IRRENKEHNH EHEHFPYPHA KHPLGPVTLP SLNKSQI