Gene CA2559_02595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_02595 
Symbol 
ID9296004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp612232 
End bp613392 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content36% 
IMG OID 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_003715285 
Protein GI298207106 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTTT ACCATAAACT CGGTAAGATT CCTCATAAGC GTCATACCAT TTTTAAGAAA 
CCAGATGGCT CATTATACTA TGAGCAGCTT TTTGGCACAA TTGGTTTTGA TGGTATGAGC
AGTAACTTAT ACCACGAACA TAGACCTACA CAGGTTAAAA AAATAGACGG CAGTTATGAT
GTAACTCCTA AAGTAGCTAC CAAAAACAAT ATGCATTCTT TACGCCTTAA GGGTTTTCAG
GTTATACCAG AACCAGATTA CTTAGAAAGC AGAAAGGTAG TGCTTACAAA TAGCGATGTA
GATATTACAT TAGCATCGCC TCAAAATTTA ACACAAGACT ATTTTTATAA AAATGCAGAT
AGTGATGAGT TATTATTTGT ACATAAAGGT AGCGGTGTCT TAAGAACGCA TTTAGGTAAT
TTAGATTTTA AATATGGAGA TTACCTTTTA ATACCTAGAG GTGTTATTTA TAAAATAGAT
TTTGATGATG AAAACAATAG ACTATTTATA GTTGAGTCAC GTCGTCCTAT ATACACTCCT
AAACGTTACA GAAATTGGTT TGGACAATTG TTAGAGCATT CTCCATTTTG TGAGCGTGAC
CTAAGACAAC CTCAAGACTT AGAAACTCAT GATGAGGTTG GAGATTTTGT AATTAAAGTA
AAAAAGAATA ACGAAATCTT CAATATGGTT TATGCCACGC ATCCTTTTGA TGTTGTTGGG
TATGATGGCT ATAATTATCC ATATGCATTT TCAATACATG ATTTTGAACC TATAACTGGT
CGCATACACC AACCGCCACC AGTACACCAA ACATTTGAGA CAGATGCCTT TGTAGTATGT
AGTTTTTGTC CGAGAAAATA CGATTACCAT CCAGAAAGCA TTCCTGCACC TTACAACCAT
AGCAATATAG ATAGTGATGA AGTGCTGTAT TATGTAGATG GTGATTTTAT GAGCAGAAAT
GATATTGAGC CAGGACACAT ATCACTGCAT CCTGCCGGCA TACCTCACGG CCCACATCCA
GGTGCTGTAG AACGTAGCAT AGGGCAGACA GAAACTGAAG AGCTTGCTGT TATGGTAGAT
ACTTTTAAAC CATTAATGGT AACTGAAGAA GGTGCTAAAA TAGCAGATAA ATCTTACCAC
CAATCTTGGT TAGAACACTA A
 
Protein sequence
MPFYHKLGKI PHKRHTIFKK PDGSLYYEQL FGTIGFDGMS SNLYHEHRPT QVKKIDGSYD 
VTPKVATKNN MHSLRLKGFQ VIPEPDYLES RKVVLTNSDV DITLASPQNL TQDYFYKNAD
SDELLFVHKG SGVLRTHLGN LDFKYGDYLL IPRGVIYKID FDDENNRLFI VESRRPIYTP
KRYRNWFGQL LEHSPFCERD LRQPQDLETH DEVGDFVIKV KKNNEIFNMV YATHPFDVVG
YDGYNYPYAF SIHDFEPITG RIHQPPPVHQ TFETDAFVVC SFCPRKYDYH PESIPAPYNH
SNIDSDEVLY YVDGDFMSRN DIEPGHISLH PAGIPHGPHP GAVERSIGQT ETEELAVMVD
TFKPLMVTEE GAKIADKSYH QSWLEH