Gene NATL1_07201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07201 
SymbolprfC 
ID4781061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp663894 
End bp665561 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content39% 
IMG OID640083995 
Productpeptide chain release factor 3 
Protein accessionYP_001014543 
Protein GI124025427 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG4108] Peptide chain release factor RF-3 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00503] peptide chain release factor 3 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.577625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0107792 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAA CTCCCCAAGA CAAGACTAAT TTAGATCCAA ATTTAGTAGA AGAATTAACT 
GTATCTTTAA GTAAAGAGGT TAGTAAAAGA AGAAATTTTG CGATTATTTC CCACCCTGAT
GCAGGGAAAA CTACTTTGAC AGAAAAACTT TTACTTTATG GCGGAGCTAT TCAACAGGCA
GGAGCAGTAA AAGCAAGAGG AGAACAGAGA AAAGTTACTT CTGATTGGAT GGAGCTTGAA
AAACAACGAG GCATATCAAT TACATCAACT GTTTTACAAT TTGCTTATAA AGATAAGACG
ATAAATTTAT TAGATACCCC TGGCCACCAA GACTTCTCGG AAGATACGTA TAGGACACTT
GCCGCTGCTG ATAATGCTGT AATGCTTGAG GATGCAGCAA AAGGTCTTGA ACCTCAAACA
AGGAAGCTTT TTGAGGTTTG CAGAATGCGT CAGATACCAA TTTTTACATT CATCAACAAG
ATGGATAGGC CAGGTCAAGA ACCGCTGGAA TTATTAGATG AGATTGAGTC TGAATTAGGG
TTATTGACCT TTGCAGTTAA TTGGCCAATA GGGAGTGGTG AGCTTTTTAG AGGTGTTGTT
GAGCGTGCTA CCAAAGAAGT TGTTCTATTT TCTAGAGCGG AAAGAGGGAA GCAATCTAAT
GAAATAAGGT TAAAAATCAA TGATCCTGAA CTTAAAAACT TAGTCGAAGA AGAGCTATTG
ACAAAAGCAC TTGAAGAGAT TGAGATCCTA GATGAGGCTG GTTGTGACTT AAATCAAGAA
TTAATCTTGT CTGGCGAATT AACTCCTGTG TTTTTTGGAT CTGCAATGAC CAACTTCGGG
GTTAGGCCAT TTTTAGATAA TTTTCTTGAT CTTTCTCAGG GACCTGTCGC TCGAAATAGC
TTTGATGGAC CAATCGTTCC TACAAGAGAA TCATTTAGTG GATTTGTATT TAAATTACAA
GCAAATATGG ATCCAAAACA TAGGGATAGA GTTGCTTTCG TGCGTGTATG TAGCGGGAGG
TTTGAAAAAG ATATGACAGT TCAACACGCC AGGACAGGTA AACAAATCAG ACTCTCAAGA
CCCCAAAAGA TTTTCGGTCA AGATAGGGCT GTTGTTGATG ATGCTTATCC TGGGGATGTT
ATTGGTTTGA ATAATCCGGG GATGTTTTCT ATTGGAGATA CTTTATTTAT TGGTCCGAGA
GTCGAATTCG AGGGCATCCC ATGTTTTAGC CCCGAGATAT TTAGTTGGTT AAGAAATCCA
AATCCTTCAG CTTTTAAAAA CTTTAGGAAA GGGGTCAATG AATTAAGAGA GGAGGGAGCA
GTCCAAATTC TTTACGATAA AGATCAAAGC AAAAGAGATC CAATTTTGGC CGCAGTTGGT
CAGCTTCAGC TTGAGGTGGT ACAGCATCGA CTTGCTAGTG AATATGGGGT GGAAACTCGG
CTTGAACCAA TGGGTTATCA AGTCGCCAGA TGGGTAAAAG GGGGATGGCC TGCTTTGGAC
GAGGTTGGAA GAATTTTTAA TTGCAAAACC GTTCAAGATG CTTGGCTTAG ACCAGTACTG
CTTTTTAAAA ATGAATGGAA TCTTAATCAG TTAAAGGAAG ATCATCCTGA AATGGAATTA
AACTCAGTTG CTCCGGTTGT TAGTGGTGTT GATCCTGTTT CTCTCTAA
 
Protein sequence
MKSTPQDKTN LDPNLVEELT VSLSKEVSKR RNFAIISHPD AGKTTLTEKL LLYGGAIQQA 
GAVKARGEQR KVTSDWMELE KQRGISITST VLQFAYKDKT INLLDTPGHQ DFSEDTYRTL
AAADNAVMLE DAAKGLEPQT RKLFEVCRMR QIPIFTFINK MDRPGQEPLE LLDEIESELG
LLTFAVNWPI GSGELFRGVV ERATKEVVLF SRAERGKQSN EIRLKINDPE LKNLVEEELL
TKALEEIEIL DEAGCDLNQE LILSGELTPV FFGSAMTNFG VRPFLDNFLD LSQGPVARNS
FDGPIVPTRE SFSGFVFKLQ ANMDPKHRDR VAFVRVCSGR FEKDMTVQHA RTGKQIRLSR
PQKIFGQDRA VVDDAYPGDV IGLNNPGMFS IGDTLFIGPR VEFEGIPCFS PEIFSWLRNP
NPSAFKNFRK GVNELREEGA VQILYDKDQS KRDPILAAVG QLQLEVVQHR LASEYGVETR
LEPMGYQVAR WVKGGWPALD EVGRIFNCKT VQDAWLRPVL LFKNEWNLNQ LKEDHPEMEL
NSVAPVVSGV DPVSL