Gene NATL1_21841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21841 
SymbolpurU 
ID4780271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1844418 
End bp1845272 
Gene Length855 bp 
Protein Length284 aa 
Translation table11 
GC content35% 
IMG OID640085482 
Productformyltetrahydrofolate deformylase 
Protein accessionYP_001016004 
Protein GI124026889 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0788] Formyltetrahydrofolate hydrolase 
TIGRFAM ID[TIGR00655] formyltetrahydrofolate deformylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.310068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.572618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATCA AAACAGTTAT TTTGCAGTTC ATTTGTCCTG ATAAGCCAGG ACTTGTAAGT 
GATTTGGCAA GTTGGATAGC GAGTAAAAAT GGCAATATTA GACATGCTGA TCACCATACA
GATGCAGATG CAAAATTGTT TCTAAGTCGC ATTGAGTGGG ATTTAGATGG ATTTCTGCTT
GATAAAAACG AAATCACCTC TGAGGTAAAT TTACTGGAGC AAAGATTGAA TGGGAAAGCG
GCCTTGAGCT TCTCTGATGA TTTTCCTAAT GTTGCAATCT TTGTTAGTAA GCAGAGTCAT
TGTTTAGTTG ATCTTTTATG GAGAGTTAAG GCAGGCGAAT TATGTATGAA TGTACCTTTA
GTTATCTCAA ATCATTCAGA TTTAGAGGAA ATTTGCTCGA GTTTTTCCAT TCCTTTTAAG
CTTATAGAAG TAAATAAAAA CAATAAAGCA GATTCCGAAA GTAAAATTTT AGATTTACTA
CATGACTACA ACATTGATTT AGGCGTTTTG GCTAAATATA TGCAAATTTT AAGTAGTTCA
TTTTTAGAGC AGTTTCCAAA TCTCATTAAT ATTCATCATT CTTTCCTTCC TGCTTTTAAA
GGGGCTCAAC CATATCATCA AGCTTGGGAT AGAGGAGTAA AATTAATTGG TGCAACAGCG
CATTATGTCA CTAAGGATCT TGATGCTGGT CCCATAATTG AACAGACTAT ATCTAACGTG
AGTCATCGTG ATGAAGTATC AGATTTAATA AGAAAGGGTA GAGATTTGGA GCGAGTCGCT
TTAGCAAGAG CTTTAAGATT GCATTTAAAA AGACAAGTTA TTGTTTATAG AGGCAGAACG
GCTGTATTTA CATGA
 
Protein sequence
MSIKTVILQF ICPDKPGLVS DLASWIASKN GNIRHADHHT DADAKLFLSR IEWDLDGFLL 
DKNEITSEVN LLEQRLNGKA ALSFSDDFPN VAIFVSKQSH CLVDLLWRVK AGELCMNVPL
VISNHSDLEE ICSSFSIPFK LIEVNKNNKA DSESKILDLL HDYNIDLGVL AKYMQILSSS
FLEQFPNLIN IHHSFLPAFK GAQPYHQAWD RGVKLIGATA HYVTKDLDAG PIIEQTISNV
SHRDEVSDLI RKGRDLERVA LARALRLHLK RQVIVYRGRT AVFT