Gene Noca_4189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4189 
Symbol 
ID4596703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4428222 
End bp4429424 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content70% 
IMG OID639778795 
Productputative fumarylacetoacetate hydrolase 
Protein accessionYP_925373 
Protein GI119718408 
COG category[R] General function prediction only 
COG ID[COG3970] Fumarylacetoacetate (FAA) hydrolase family protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACAC CCCAGGATCT GCTGCCTGAG GACGCCGACC AGGCGCTTCT CGTGGGCCGC 
GTGCACGACC CGGAGGTGGG CGGACCGTGT GTCGTCGTCG TCACCGGCGG CCGCGCCATC
GACATCTCTG ACGTCGCCCC GACTGTGTCG GACCTCTTCG AGCGCGACGA CCTCCTCGAC
GCCCTGGCCG CGGCCACCGC CGCCGCGGCC GAGCGGCGCG GCTGGTCCAT GACCGACCTC
GCTGCGGCCA CCACCGACCA GGACGCTGCG CGCCCACGGC TGCTGGCCCC AGTCGATCTC
CAGGTCGTCA AGGCCGCCGG AGTCACCTTC GTGGAGAGCA TGCTGGAGCG GGTGATCGAG
GAGCGCGCTA AGGGCGATGC CGCCCGCGCG GCCGAGATCC GCGCACAGCT CTCGGAGATC
ATCGGGGGGG CCATCTCGTC GGTGCGCCCC GGCAGCGAGA GCGCCGCGCG CGTCAAGGAG
CTGCTGATCG AGGAGGACCT GTGGTCCCAG TACCTCGAGG TCGGCATCGG CCCGGACCCG
GAGATCTTCA CCAAGGCGCC GGTGCTGTCG GCTGTCGGAT CCGGCGTGGA GATCGGCGTG
CACAGCCGCT CGACGTGGAA CAACCCCGAG CCCGAGGTCG TGCTCGCGGT CCGGTCCGAC
GGTCGGGCGA TCGGTGCCAC GCTGGGCAAC GACGTCAACC TCCGTGACTT CGAGGGGCGC
AGCGCCCTGT TGCTCACCGA GGCCAAGGAC AACAACGCCT CGTGCGCACT GGGCCCGTTC
ATCCGTCTCT TCAACGACTC ATTCACGATG GACGACATCC GTCAGATCGA GGTCCAGCTG
ACCGTCACCG GCGTCGACGA CTTCGTGCTC GACGGCCGCA GCTCGATGCG GAACATCAGC
CGTGACCCGG AGGACCTAAT CCATCACGCC TACGGTGACC ACCACCAGTA CCCGGACGGC
TTCATGCTCT TCACCGGCAC CCTGTTCGCT CCCACCCAGG ACCGGTTCGC CGAGGACGCC
GGGTTTACCC ACGTGCTCGG CGACGTCGTC CGCATCGCCA CCCCGCGACT CGGCGCCCTG
GTCAACCGGG TCACCCACTC CGAGCAGGCC CCCCGCTGGG AGTTCGGCAT CCGTGCGCTG
ATGTCCAACC TCAGCGAGCG TGGCCTGCTC GCCCCGTCCG CGCCCGTCCC GGCGCAACTC
TGA
 
Protein sequence
MITPQDLLPE DADQALLVGR VHDPEVGGPC VVVVTGGRAI DISDVAPTVS DLFERDDLLD 
ALAAATAAAA ERRGWSMTDL AAATTDQDAA RPRLLAPVDL QVVKAAGVTF VESMLERVIE
ERAKGDAARA AEIRAQLSEI IGGAISSVRP GSESAARVKE LLIEEDLWSQ YLEVGIGPDP
EIFTKAPVLS AVGSGVEIGV HSRSTWNNPE PEVVLAVRSD GRAIGATLGN DVNLRDFEGR
SALLLTEAKD NNASCALGPF IRLFNDSFTM DDIRQIEVQL TVTGVDDFVL DGRSSMRNIS
RDPEDLIHHA YGDHHQYPDG FMLFTGTLFA PTQDRFAEDA GFTHVLGDVV RIATPRLGAL
VNRVTHSEQA PRWEFGIRAL MSNLSERGLL APSAPVPAQL