Gene Tery_4570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4570 
Symbol 
ID4246224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7030421 
End bp7031815 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content36% 
IMG OID638109443 
Productcytosine deaminase-like protein 
Protein accessionYP_724019 
Protein GI113477958 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.882556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGTC AAAAGTTAAT ATTAAAAGGT CAAAAGTCAA AATTCTCTAT ATATTTTATG 
AACCATCAAC AATTTTCAGT CATGCCTGAG ACTTCTAACT ATTGGCTAAA AAATGCTCAT
GTACCTGCAT GTCTCATAGA AAGAGAATTA GAGATATCAG ATCAAACAAG AGTTGGTTTG
TCTTTAGTAG ATATAGAAAT TAAAGAGGGA GTCATAACAC AAATTGTCCA CTCGTTAGCC
GACTTGCCTT CTCTGAACAA TGGGGACATT CCTAGAGTTG ACCTCAAAGG TGGTATGGTG
TGGCCATGTT TTGTAGATAT GCATACCCAT TTAGATAAAG GCCATATTTG GGAGCGATCG
CCTAACCCAG ATGGTAGTTT TAATGGCGCT TTAGATGCAG GTGCTAGGGA TGGGGAAAAA
TATTGGAATG CTGAAGACAT TTACCGTCGA ATGGAATTTG GCCTGAAATG CAGTTATGCT
CATGGCACAA AAGCTATTCG GACTCATCTT GACTGTTTTG CACAACAGGC AAATATTAGT
TTGGAAGTAT TTCAAACATT ACAAAAAAAG TGGGAAAGTA GATTAATATT ACAACCTGTT
TCTTTGGTTA CTGGAGATTA TTATCTTACT GAAGCTGGAG AAAAATTAGC CGATCAAATT
GCTGATATTG GTGGGATATT AGGAGGGGTT CCATTTATAA ATTCAGACTT AGATAACCAA
TTAGATAGAT TATTTAGTAT AGCCAAAGAA CGTCAATTAG ATTTAGACTT ACATACTGAT
GAAAATGGAG ACCCTAATTC CCGGGTTTTA CACAAAGTTG CTGAGAAAGC AATTAGTCAT
CAATTTTATG GCAAAATTAT CTGCGGACAT TGTTGTAGTT TAGCTGTACA AACCCCTGAA
AAAGTAACAG AAACTATTAA GTTAGTTAAG GAAGCTGGTA TTAGTATTGT TAGTCTGCCA
ATGTGTAATC TTTATCTCCA AAATCGCCAA GAAAATTATA CTCCTAGATG GCGGGGTGTG
ACTTTATTAC ATGAATTAAA AAAACAGGGA ATTTCAGTTG CTATTGCCAG TGATAATTGT
CGCGACCCGT TTTTTGGATT TGGTGACCAT GATGTTTTAG AAGTCTTTAA TATGTCAGTG
AGAATTGCTC ATTTAGATAT GCCTTATGGG GATTGGCCTT GTGCTGTTAC TAAGACTCCT
GGAGATTTAA TTGGTTTGCC AAATGTGGGA ATAATTAAAG TTGGATTACC CGCAGATTTA
ATTCTATTTA AAGGTCGGAA TTTTAGTGAA CTTTTATCAC GGTCTCAGCA TGATAGAGTA
GTATTAAGAA ATGGTCTAGT TATTGATACA ACTTTACCTG ATTATTCTGA ATTAGATAGC
TTGTTTTACG GATAG
 
Protein sequence
MGRQKLILKG QKSKFSIYFM NHQQFSVMPE TSNYWLKNAH VPACLIEREL EISDQTRVGL 
SLVDIEIKEG VITQIVHSLA DLPSLNNGDI PRVDLKGGMV WPCFVDMHTH LDKGHIWERS
PNPDGSFNGA LDAGARDGEK YWNAEDIYRR MEFGLKCSYA HGTKAIRTHL DCFAQQANIS
LEVFQTLQKK WESRLILQPV SLVTGDYYLT EAGEKLADQI ADIGGILGGV PFINSDLDNQ
LDRLFSIAKE RQLDLDLHTD ENGDPNSRVL HKVAEKAISH QFYGKIICGH CCSLAVQTPE
KVTETIKLVK EAGISIVSLP MCNLYLQNRQ ENYTPRWRGV TLLHELKKQG ISVAIASDNC
RDPFFGFGDH DVLEVFNMSV RIAHLDMPYG DWPCAVTKTP GDLIGLPNVG IIKVGLPADL
ILFKGRNFSE LLSRSQHDRV VLRNGLVIDT TLPDYSELDS LFYG