Gene NATL1_05811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05811 
Symbol 
ID4780009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp526414 
End bp527574 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content34% 
IMG OID640083858 
Productphage integrase family protein 
Protein accessionYP_001014408 
Protein GI124025292 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGA ATCAAAAGTT ACTAGATATC AACCAAGACC TTGAATCAAA GGGAATCAAT 
CTAAGAATTG AGAAAAGGGG TAAAGTTTTA AACATTCGTG GTTCTTTGCC AGATAAGAAA
TCTCATGATC TTTCTAAAGT TCAAAGAATA AGTCTGAAAC TTCCACACGA CATTAACGGT
CTAGAAGAAG CTAGAAAAGC TATAGAATTG ATAGATTTTC AACTTAAAAA AAATCAATTT
TGTTGGTCTA ATTGGATTAA AGAGAAAGCT CTCTCATCAA CAAAGACTAA TAAAACTGTA
ATAAGCAATG AAATAGAAAG CTTTAAAAGA CAATTTTTTT CTGATACATC CAGGAGCAAA
TCATCAGCCG GAATGATCAG CACTTGGCAG TCTGCTTACA AACCATATTT GAACAGATTA
ATTGGAGTAA GTCATAAATC TACTCTCAAA TTAAGCGAGG AGCTCCTAGT GAAAATCCTT
TTAAGTTACA AAGAAAATTC AAGGAGCAGA CAACAATGTG GAATTGCTTT AAGTGCTTTA
GCTAGACACC TTAAAGTAGA GCTACCCAAG AACTGGAAAC AACTTCAAAG TGGTTATGGA
ATACACGAAT CAAATTTCAG AGAGTTACCT AGTGATAAAG AAATTATTAA TAGCTTTCAA
TTAATACCAA ATCCAAAATG GAGATTTGTT TTTGCGTTAA TGGCAATTTA TGGGCTTAGA
AATCATGAAG TCTTTTTTAG TGACTTATCT TGTTTAAAAA AAGGTGGGGA TAAAATACTC
CGGGTTTTCC CAAATACGAA AACAGGAGAA CATCAAGTTT GGCCATTTCA TCCTGAATGG
GTTGGTTTAT TTGAGCTAGG GAACATAACT GATACTTCAG ATTTACTCCC AGATATTAAA
ACAGATCTAA AAGAGACAAC TCTTCAACAT ATAGGAAGAA GAGTATCTGA GCAATTTAGA
AGATATGAAA TATCTTTTAC CCCCTATGAT TTAAGGCATG CATGGGCAGT TAGGACCATC
TTAATAGGCC TACCAAATAC TGTAGCTGCA AAAATGATGG GACACTCAGT GTCAATACAT
ACAAAAACCT ATCATCATTG GATAACAAGA AGAGATCAGC AAATTGCAGT CGATAGTGCT
CTATCTAGAG TCAAATATTA A
 
Protein sequence
MDENQKLLDI NQDLESKGIN LRIEKRGKVL NIRGSLPDKK SHDLSKVQRI SLKLPHDING 
LEEARKAIEL IDFQLKKNQF CWSNWIKEKA LSSTKTNKTV ISNEIESFKR QFFSDTSRSK
SSAGMISTWQ SAYKPYLNRL IGVSHKSTLK LSEELLVKIL LSYKENSRSR QQCGIALSAL
ARHLKVELPK NWKQLQSGYG IHESNFRELP SDKEIINSFQ LIPNPKWRFV FALMAIYGLR
NHEVFFSDLS CLKKGGDKIL RVFPNTKTGE HQVWPFHPEW VGLFELGNIT DTSDLLPDIK
TDLKETTLQH IGRRVSEQFR RYEISFTPYD LRHAWAVRTI LIGLPNTVAA KMMGHSVSIH
TKTYHHWITR RDQQIAVDSA LSRVKY