Gene P9301_14901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_14901 
SymbolcodA 
ID4912716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1259549 
End bp1260784 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content32% 
IMG OID640161083 
Productputative cytosine deaminase 
Protein accessionYP_001091714 
Protein GI126696828 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.352726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTAATT CCGGCACAGC TGAGGTTCTT ATTCCCAGAA GTCTTTTTGT AATAGAAGAT 
ATAGATAACC TCATTATCGA TGTAGAGGAT TTATGTTCAG TTTCAATTAG TTGGGAGGAT
GGATTTGTTT CTGAGTTAAA GCCTTTAAAA AATAAAATTA CAAAACCAAA AAATATTTTA
TTCCCAAGAT TTGTTGAACC GCATTCGCAT TTTGATAAAT CATTTACATG GGCAGCCTTT
CCTAATCTGG AATCAAACTA TGGAGGAGCA TTATCAGCAA ATCTTCAAGA ACATAAAACT
AGAAGTACAG ATAAGGTTCT TGAAAGAGTT GATAAATCAT TAAAACTTGC CATACAAAAT
GGATACCGAG CAATAAGAAG TCATATTGAT ACATACAAAG GTCAATCTAT TGATATTTGG
AGTGAACTTT TTAAATTACA AAAAAAATTT TCATCTGAGT TGACTTTACA ATACGTTGCT
TTAGCTCCAT TGGAATTTTG GAATACAACT GATGGAGAAG ATTTGGCAAA AATGTTTTCT
TCTAATGGAG GCATTTTAGG GGGTGTTGTT GTACCCCCTT TCAATAAAAA AGACACAAGC
AAATTTCTAG CAAAGATGCT TCTTCTTGCG AGTAAAAATA AATTAGAAAT TGATTTGCAT
ATAGATGAAT CAATTATTGA ACCTGGAGCT GGAATAAAAG TTTTATTAGA AACAATCGAA
AATTTAAATA TTAATAGTAT TCCGATCACT TGTAGCCATT TGAGTAGTCT TATTTCTCTA
ACTAATAGAG AGATTTTAAA GTTAGGGGAG AAAATGGCTG AGAGAAATAT TAAAGTTATT
GCCTTACCAC TAACAAATTT TTGGCTGCTC AATCGAAGTA ATAAAACTAC TTCATTAAAA
AGACCAGTTG CGCCAATAAA GCAATTACAA AATTCACATG TTGATGTATC TCTTGGTAGT
GATAATGTGC AAGACCCTTG GTACCCATTT GGTGATTTTG ACCCTTTTTA TACGTTGTCT
TGCTCGATGC CTATGCTTCA ACTAAATCCC TGGGAGAGAA TGACTCTATC TTCTATTTTT
TTAGCTCCAA GCAGATTATT AAATTTAAAA TGGGATGGTT TAATTAAAAA AGGTTGTCCT
GCTGATTTTG TGATTTTAGA TGCACAAAGA TGGGCAGATG TTTTTTCGAG AAATTTAAAG
AGAAAAGTAT TTATAAACGG CGATTTATAT TGTTAA
 
Protein sequence
MSNSGTAEVL IPRSLFVIED IDNLIIDVED LCSVSISWED GFVSELKPLK NKITKPKNIL 
FPRFVEPHSH FDKSFTWAAF PNLESNYGGA LSANLQEHKT RSTDKVLERV DKSLKLAIQN
GYRAIRSHID TYKGQSIDIW SELFKLQKKF SSELTLQYVA LAPLEFWNTT DGEDLAKMFS
SNGGILGGVV VPPFNKKDTS KFLAKMLLLA SKNKLEIDLH IDESIIEPGA GIKVLLETIE
NLNINSIPIT CSHLSSLISL TNREILKLGE KMAERNIKVI ALPLTNFWLL NRSNKTTSLK
RPVAPIKQLQ NSHVDVSLGS DNVQDPWYPF GDFDPFYTLS CSMPMLQLNP WERMTLSSIF
LAPSRLLNLK WDGLIKKGCP ADFVILDAQR WADVFSRNLK RKVFINGDLY C