Gene NATL1_17241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17241 
SymbolcodA 
ID4779432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1409970 
End bp1411283 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content35% 
IMG OID640085009 
Productputative cytosine deaminase 
Protein accessionYP_001015544 
Protein GI124026429 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.918771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTTTAG AAAAAGAAGA ATTCAATTTA TCTACTAAAG CATCAGGTCG TATTGATGTG 
TTAGTACCGA GATGCTTGAT TGGAGAGGGT GCAAATATTC TGGGAGTAAC AGTTGACTTT
GAAGGGCTAT GTTCGCTTCA AGTTGAGTGG AGGCATGGAA AAATCTGCTC AATTAAAGGT
TTAAAAGATG CTTCAAAAGT TCCTAATGAA ATCCTTCTAC CTAGATTTTC TGAACCTCAT
GCTCATTTAG ATAAAGCATT TTCATGGTCT CGAGCTCCTA ATTATAAAGG GAGTTATCAA
GAAGCTTTAG TAGCCAATTT AAATGACTAT AAAAGTAGGT CTCAAGGCCA ATTGCTTTTT
AGTGTTGAAA AATCTCTGAA CCTAGCCCTT GTTAATGGTA TCCGTGCAAT TAGATCTCAT
ATAGATAGCT TTGGAGAAAA TGTAATGAGA GATTGGGACC TGTTAGATGA TATTAGAAAA
AAATGGCGAG ACAAAATTTT CTTACAATTT GTGGCTTTAG TCCCATTAGA ATTTTGGCAA
ACGTATGAAG GTGAGCTTTT AGCGCAAAGA GTTGCTTTGA ATGGAGATCT CCTAGGAGGA
GTCATAGCTC CTCCTTTTAA TAAAAAGAAG ACAATTCAGT CTTTATTACA CTTAGTTCAA
CTTGCAAATA GACTTAATTG TGATATTGAT CTTCATATTG ATGAGTCTCA GTCTTGTCCT
GCTGCAGGGG TGAAATTACT TCTTGAAGTA TTAGGCCGTA TTAAAAATGA GATATCAATA
ACATGTAGTC ATTTGAGCAG TATGGCTTTA CTAAGAGAAA AATCGATTTC AAATTTGGCA
AAGGAAATAG CTGAAAAGAA ATTAAATGTT GTTGCTTTAC CACTCACGAA TTCTTGGCTG
CTCGGTAGAA ATGAACGATC TACTTCAATT AAAAGACCTC TAGCTCCAAT ATTTCAACTT
CAAAAGGCTG GGGTTGTTGT ATCTGTAGGA GGAGACAATG TAAATGATGC ATGGTTTCCA
TTCACTAATT TTGATCCAAT AAATTTAATG GCTTTTTCAA TGCCAATTGC TCATTTAACT
CCTTGGGAGA GATTGGGCCT TTCTCCATTT ACTTCATCCG CAGCAAGTAT TCTTAATCTT
CAATGGGATG GCGTTTTACA AAAAGGAAGT CCTGCCGATT TTGTTTTGTT AGATTCAAAT
AGTTGGGTAA AAGCTTTGTC TGAAAGACCT AAAAGAAGAG TAGTAGTTAA TGGCGAATTT
TTAAATGAAT TGCCTAAAAA CAAAAAATCA ACATTCAACA ATTCTCACTC ATGA
 
Protein sequence
MTLEKEEFNL STKASGRIDV LVPRCLIGEG ANILGVTVDF EGLCSLQVEW RHGKICSIKG 
LKDASKVPNE ILLPRFSEPH AHLDKAFSWS RAPNYKGSYQ EALVANLNDY KSRSQGQLLF
SVEKSLNLAL VNGIRAIRSH IDSFGENVMR DWDLLDDIRK KWRDKIFLQF VALVPLEFWQ
TYEGELLAQR VALNGDLLGG VIAPPFNKKK TIQSLLHLVQ LANRLNCDID LHIDESQSCP
AAGVKLLLEV LGRIKNEISI TCSHLSSMAL LREKSISNLA KEIAEKKLNV VALPLTNSWL
LGRNERSTSI KRPLAPIFQL QKAGVVVSVG GDNVNDAWFP FTNFDPINLM AFSMPIAHLT
PWERLGLSPF TSSAASILNL QWDGVLQKGS PADFVLLDSN SWVKALSERP KRRVVVNGEF
LNELPKNKKS TFNNSHS