Gene NATL1_20151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20151 
SymbolthiF 
ID4779548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1657930 
End bp1659075 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content33% 
IMG OID640085307 
Productmolybdopterin biosynthesis protein 
Protein accessionYP_001015835 
Protein GI124026720 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.28602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAAA GCCAGAGTAA GGCAAATTTA AGTTCAGAAG AAATTGCCAG GTATGCAAGA 
CATATAAGTC TCCCAGAGAT AGGTATCAAA GGCCAAGAAA AATTGAAGAC AAGCTCAGTT
GCTTGCATTG GGACAGGAGG GCTAGGATCT CCACTTTTAA TTTATCTTGC AGCAGCTGGA
ATTGGACGTA TCGGAATAGT TGATTTTGAT GTCGTTGAAT ACTCAAATTT ACAAAGACAA
ATCATTCATA CAACACATTC AATAGGTCTA TTAAAAACAG ATTCGGCCAA ACAAGCTATA
CGCAAAATAA ATCCTTCTTG TCGAGTTGAT TTATTCAATC AAAAGCTAAC AAGTAGTAAT
GCTTTGGAAA TACTTAAAGC TTATGATGTG ATATGTGATT GTTCAGACAA TTTCCCAACG
CGTTACCTGA TTAATGATGC TTGTCTAATA CTTAACAAAC CTAATATATA TGGTTCAATC
GCAAGATTCG AAGGACAAGT AAGTGTATTT AATTTGAAGG AAGATAGCCC TAACTATAGA
GACCTTATCC CCATACCCCC TCCACAAGAG TTAATTCCAT CTTGCTCTGA AGCTGGAGTG
ATGGGAATTC TTCCAGGAAT TATTGGTACA ATTCAAGCAG CAGAAGCTAT AAAGATAATA
ACAAACATTG GTTATCCACT TAACGGTAGG ATTCTCATTT TTAATGCATT AAAAATGCAA
TTTAAAGAAC TAACTTTGAA ATCCAATCCA GAAAATAAAA ATATCCATAA ATTAATAGAT
TATAAAAGTT TCTGTTCAGA AATTTCAGTT AAAGATGAAG TAGAATGTGA TATAGAAAGT
ATTTCAGTTA AAGAATTAAA AGTACTTCTT AGACAATCTT CAAAAGAAAT GTTATTAATA
GATGTTCGCA ACCAAGATGA ATATCATCAA TGTTCAATTA CAGGTTCATT GCTCATACCT
CTTAACTCTA TTGAAAGTGG TAAAGCCATT GATGAAATTA AAATCCTTAC CGCAAAAAAA
AATCTTTATG TATTTTGTAA AAGTGGAAAA AGATCATTGC TTGCATTAAA GCATTTAAAC
AAATTTGGAA TTAGAGGTAT TAATATTCTT GGAGGTATTG ATGCGTGGAA TAGCGAAAAA
AATTAA
 
Protein sequence
MEQSQSKANL SSEEIARYAR HISLPEIGIK GQEKLKTSSV ACIGTGGLGS PLLIYLAAAG 
IGRIGIVDFD VVEYSNLQRQ IIHTTHSIGL LKTDSAKQAI RKINPSCRVD LFNQKLTSSN
ALEILKAYDV ICDCSDNFPT RYLINDACLI LNKPNIYGSI ARFEGQVSVF NLKEDSPNYR
DLIPIPPPQE LIPSCSEAGV MGILPGIIGT IQAAEAIKII TNIGYPLNGR ILIFNALKMQ
FKELTLKSNP ENKNIHKLID YKSFCSEISV KDEVECDIES ISVKELKVLL RQSSKEMLLI
DVRNQDEYHQ CSITGSLLIP LNSIESGKAI DEIKILTAKK NLYVFCKSGK RSLLALKHLN
KFGIRGINIL GGIDAWNSEK N