Gene NATL1_18821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18821 
Symbol 
ID4780092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1539004 
End bp1540083 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content38% 
IMG OID640085171 
Productribosomal RNA large subunit methyltransferase N 
Protein accessionYP_001015702 
Protein GI124026587 
COG category[R] General function prediction only 
COG ID[COG0820] Predicted Fe-S-cluster redox enzyme 
TIGRFAM ID[TIGR00048] radical SAM enzyme, Cfr family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAAAAC TTCCTAAACT ATCAAGTAAT TCATCCTTGC TTGGCTTAAG TTCGGAAGAT 
CTTGAAGAAT TTGCTCGTCA GGAAGGTGAA AAGTCTTTTC GTGGTAGGCA AATTCATGAA
TGGATTTATC AAAGAGGGGC AAAAAGCTTA GATTCAATAA GTGTTCTTCC AAAAAAATGG
CGAGATTCAT TAGTACGTAA GGGAATTCAG ATTGGAAGGC TGGACGAAAT TAATAGAGTT
GTAGCCGAAG ATGAGACATT GAAATTATTA ATGGGGACTT TTGATGGTGA GATTGTAGAA
ACAGTAGGAA TACCAACAGA TAAAAGACTT ACTGTTTGTG TCTCAAGTCA AATTGGCTGC
CCAATGGGTT GCAAATTTTG TGCGACTGGG AAGGGGGGGC TTAATAGATC TCTTGATGTG
AATGAGATAG TTGATCAAGT TATTAGTGTT AGAGAAACAA TGAATAGGAG GCCTACTCAT
GTCGTGTTTA TGGGTATGGG CGAGCCACTC CTAAATATTC AGAATGTTCT TGACTCTATA
GAATGTCTCA CAAGTGATAT TGGTATTGGT CAAAGGAAGA TAACGGTTAG TACTGTTGGG
ATACCGAATA CTCTTTCAGA TTTAGCAAAA TTAGCTCAAG ACCGTTTAGG AAGAGTTCAA
TTCACACTTG CAGTCAGTCT TCATGCACCT AATCAGACGT TGCGTGAATT GATAATCCCC
TCGGCGAGTT CATATCCAAT TAATTCATTA CTGAAAGACT GTAAAAAATA TATAGATCTC
ACTGGTAGAC GAGTAAGCTT TGAGTATATA CTTCTTGGCG GTTTGAATGA CAAAGATATT
CATGCAGAGC AGTTAGCTAA TCTGATGAGA GGCTTTCAGA GCCATGTTAA TTTGATAGCT
TATAATCCAA TCGCTGAAGA GAACTTTAAG CGACCAAGCC AATCTAGAGT TAATGCCTTT
AGAGAGCTAT TAGAAAATAG GGGAGTTGCT GTAAGTGTTC GTGCAAGTAG AGGTAGAGAT
AAAGATGCGG CATGTGGACA ATTAAGAAGG CAAACAATCG ATAAAATAAA AATCAACTAA
 
Protein sequence
MTKLPKLSSN SSLLGLSSED LEEFARQEGE KSFRGRQIHE WIYQRGAKSL DSISVLPKKW 
RDSLVRKGIQ IGRLDEINRV VAEDETLKLL MGTFDGEIVE TVGIPTDKRL TVCVSSQIGC
PMGCKFCATG KGGLNRSLDV NEIVDQVISV RETMNRRPTH VVFMGMGEPL LNIQNVLDSI
ECLTSDIGIG QRKITVSTVG IPNTLSDLAK LAQDRLGRVQ FTLAVSLHAP NQTLRELIIP
SASSYPINSL LKDCKKYIDL TGRRVSFEYI LLGGLNDKDI HAEQLANLMR GFQSHVNLIA
YNPIAEENFK RPSQSRVNAF RELLENRGVA VSVRASRGRD KDAACGQLRR QTIDKIKIN