Gene CA2559_04135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_04135 
Symbol 
ID9296316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp940065 
End bp941402 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content33% 
IMG OID 
Productputative deoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_003715591 
Protein GI298207412 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.414308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGGG AACAACTATT ATCTCTTAAG CGTTTTGGAG ATACAAATAA GCGTTTAAGA 
AAAGAACAAG ACGATACACG ATTGGGCTTT GAGGTAGATT ACGATCGTAT TATATTTTCC
AGCTCTTTTA GGAGTTTGCA GGATAAAACT CAAGTTATCC CATTGTCTAA AACAGATTTT
GTGCACACAA GATTAACACA CAGTTTAGAG GTAAGTGTTG TGGGGCGAAG TTTAGGACGA
GTTGTTGGTA AGAAGCTTTT AGAAAAACAC CCACATCTTT CTGAGACGTA TGGGCATCAT
TTTAATGATT TCGGGGCTAT TGTTGCTGCA GCATCTTTAG CACACGATAT TGGAAATCCG
CCGTTTGGTC ACTCTGGCGA AAAGGCTATA GGAGACTTTT TTAAATCTGG AAAAGGAAAT
AGATTTAAAG ACTCTCTTAC AAACGTTCAA TATCAAGACC TTTGTACCTT TGAAGGAAAC
GCTAACGGAT TTAAGCTTTT AACTGAAACA AAAAACGGAG TAACTGGCGG TTTAAGGTTA
TCTTACTCAA CCTTGGGTGC TTTTATGAAA TACCCAAAAG CTTCGTTACC TTATAAACCA
ACAACTCAAA TTCACCATAA AAAATATGGT TACTTCCAAA GTGAGCAGGA AGTCTTTAAT
GATGTAGTTA AGGATTTAGG ATTAATTTCT GAAACTGTAA AAGATTCAGA AACTTACAAA
AGACATCCGT TAACGTTTTT AGTTGAAGCC GCAGATGATA TCTGCTATAC AATTATAGAC
TTTGAAGATG GTATAAATTT AGGTTTAATA GATGAGGAGT TTGCTTTAGA ATATCTAATT
AATTTGGTTA AGGATAAGAT AGACACAAAA AAATATCATC AACTCGTTAC CAAATCTAAT
AGAGTAAGTT ATTTAAGAGC ATTAGCTATT GGAGTGCTTA TTGAAGAGGC AGCTTCAATT
TTTATTGCAA ATGAAGAAGC TATACTTAAA GGTGACTTTA GTTCTGCATT ATTAGATAAG
TCACAGTACA CGGCACAAAT AGATGATATT ATAAAAATTA GCATTAACAA TGTATATCAA
TCTCAAGACG TCTTAGAGAA GGAGATATTG GGCTACCAAG TAATCGGAAC ATTATTGGAA
GTTTATACAG ATGCCGTGTT TAGTAAGAAA AACAACACAA ATACAAATTT TAATTCATTG
ATTTTGAAAG GTTTTCTTAA AGAATTCGAC TTAAATCAAG ATGATTATTC TATTTTAATT
GAAATTTCTT CACTTGTAGC CTCTTATTCA GACAGTGAAG CCCTTAGAAT TTACCAGAAA
ATTAAGGGCA TGTTATAG
 
Protein sequence
MNWEQLLSLK RFGDTNKRLR KEQDDTRLGF EVDYDRIIFS SSFRSLQDKT QVIPLSKTDF 
VHTRLTHSLE VSVVGRSLGR VVGKKLLEKH PHLSETYGHH FNDFGAIVAA ASLAHDIGNP
PFGHSGEKAI GDFFKSGKGN RFKDSLTNVQ YQDLCTFEGN ANGFKLLTET KNGVTGGLRL
SYSTLGAFMK YPKASLPYKP TTQIHHKKYG YFQSEQEVFN DVVKDLGLIS ETVKDSETYK
RHPLTFLVEA ADDICYTIID FEDGINLGLI DEEFALEYLI NLVKDKIDTK KYHQLVTKSN
RVSYLRALAI GVLIEEAASI FIANEEAILK GDFSSALLDK SQYTAQIDDI IKISINNVYQ
SQDVLEKEIL GYQVIGTLLE VYTDAVFSKK NNTNTNFNSL ILKGFLKEFD LNQDDYSILI
EISSLVASYS DSEALRIYQK IKGML