Gene A9601_15031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_15031 
SymbolcodA 
ID4718225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1286448 
End bp1287683 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content32% 
IMG OID640079225 
Productputative cytosine deaminase 
Protein accessionYP_001009893 
Protein GI123969035 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTAATT CCGGCACAGC TGAGGTTCTT ATTCCCAGAA GCCTTTGTCT AATAGAAGAT 
ATAGATAACC TCATTATCGA TGTAGAGGAT TTATGTTCAG TTTCCATTAG TTGGGAGGAT
GGATTAGTTT CTGAGTTAAA GCCTTTAAAA AATAAAATTA CAAAACCAAA AAATATTTTA
TTCCCAAGAT TTGTTGAAAC GCATTCGCAT TTTGATAAAT CTTTTACATG GGCAGACTTT
CCTAATCTGG AATCAAACTA CGGAGGAGCG TTATCAGTAA ATCTTGAAGA ACATAAAACT
AGAACTACAG ATAAGGTTCT TGAGAGAGTT GATAAATCAT TAAAACTTGC CATACAAAAT
GGATACCGAG CAATTAGAAG TCATATCGAT ACATACAAAG GTCAATCTAT TGACATTTGG
ATTGAACTTT TTAAATTACA AAAAATATTT TCATCTGAGT TGACACTACA ATACGTTGCT
CTAGCTCCAG TGGAATTCTG GGATACAACT GATGGAGAAG ATTTGGCAAA AATATTTTCC
TCTAATGGAG GTATTTTAGG AGGTGTTATT GTACCCCCTT TCAATAAAAA AAATACAAGC
AAATTTCTAG CAAAGATGCT TCTTCTTGCT AGAAAATATA AATTAGAAAT TGATTTGCAT
ATAGATGAAT CAATTATTGA ACCTGGAGCG GGAATAAAAG TTTTATTAGA AACAATAGAA
AATTTAAAAA TTAATAGTAT TCCAATCACT TGTAGTCATT TGAGTAGTCT TATTTCTCTA
AGTAATAGTG AGATTTTAAA TTTAGGAGAA AAAATGGCTG AGAAAAATAT TAAGGTTGTT
GCTTTACCCC TAACAAATTT TTGGCTGCTC AATCGAAGTA ATAAAACTAC TTCATTTAAA
AGACCAGTTG CGCCAATAAA GCAATTACAA AAATCACATG TGGATGTATC TCTTGGTAGT
GATAATGTTC AAGACCCTTG GTACCCATTT GGTAATTTTG ACCCTTTTTA TATGCTGTCT
TGCTCGATGC CTATGCTTCA ACTAAATCCC TGGGAGCGAA TGACTCTATC TTCTATTTTT
TTAGCTCCAA GCAGATTATT AAATTTAAAA TGGGATGGTT TAATTAAAAA AGGTTGTCCT
GCTGATTTTG TGATTTTAGA TGCACAAAGG TGGGCAGATG TTTTTTCGAG CAATTTAAAG
AGAAAAGTAT TTATAAACGG CGATTTACAT TCCTAA
 
Protein sequence
MSNSGTAEVL IPRSLCLIED IDNLIIDVED LCSVSISWED GLVSELKPLK NKITKPKNIL 
FPRFVETHSH FDKSFTWADF PNLESNYGGA LSVNLEEHKT RTTDKVLERV DKSLKLAIQN
GYRAIRSHID TYKGQSIDIW IELFKLQKIF SSELTLQYVA LAPVEFWDTT DGEDLAKIFS
SNGGILGGVI VPPFNKKNTS KFLAKMLLLA RKYKLEIDLH IDESIIEPGA GIKVLLETIE
NLKINSIPIT CSHLSSLISL SNSEILNLGE KMAEKNIKVV ALPLTNFWLL NRSNKTTSFK
RPVAPIKQLQ KSHVDVSLGS DNVQDPWYPF GNFDPFYMLS CSMPMLQLNP WERMTLSSIF
LAPSRLLNLK WDGLIKKGCP ADFVILDAQR WADVFSSNLK RKVFINGDLH S