Gene Ent638_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1888 
Symbol 
ID5113523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2047511 
End bp2048491 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content53% 
IMG OID640492077 
ProductNitrilase/cyanide hydratase and apolipoprotein N-acyltransferase 
Protein accessionYP_001176616 
Protein GI146311542 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAC CAGAATTCCT AAAGGCGGCA ACCGTACAGT TTCAGCACCA GGCCAATAAC 
AAAAAATACA ACCTACTGAT AATCGAGAAA TTTATTGAAC AGGCGGCGCT TGAGCAGGTG
AACATTCTCG TTTTCCCTGA AATGTGTATC ACCGGCTATT GGCATGTCCC CAAGCTCACC
GCAGCGGAGG TGTCTGCTCT GGCAGAGCCG ATTGCTGAGA GCCCTTCCCT GACGTTAATT
CGCTCTCTGG CGATAAAACA TCAAATGCTG ATTGGCGTGG GTCTTATTGA AAGAGCCGAC
GATGGCCGCC TCTACAATGC GTATGTTGCC TGTATGCCTG ATGGCACAAT GCATACGCAT
CGAAAACTCC ATGCTTTTGA ACATCCGGCT ATCAGCAGCG GGGACCGCTT TACCGTTTTC
GATACGCCCT GGGGAGTGAA GGTCGGTATC CTGATTTGCT GGGACAATAA TCTGGTCGAA
AATGTGCGCG CGACGGCGCT GCTGGGTGCA GATATTCTTC TCGCACCGCA TCAAACAGGC
GGCACCGATT CCCGTAGTCC TCATGCGATG AAGCCTATTC CGCTGGCGCT CTGGGAAGAG
CGAGAAACGC GCAAGGAAGA AATTACGGCC GCATTCAAAG GGGCAAGCGG TCGCGAATGG
CTGATGAGAT GGCTGCCAGC CCGTGCGCAT GACAACGGAT TATTTCTTCT CTTCAGCAAC
GGTGTCGGTG CAGATGATGA CGAAGTGCGC ACAGGCAATC CGATGATCCT CGATCCTTAC
GGACGCATCA TTAATGAAAC CTGGGCGGCT GACGACGTTA TGGTGAGCGC CGAACTGGAT
TTAAGCCTGC TTGCAATGAG CACCGGACGG CGCTGGATCC ACGGTCGCCG TCCTGATTTA
TACCAGATAC TGACGCAGCC GCAGGGCTAT GAACGTGATG CAATCAGTGC ACGATTTTCG
AATGTGCCCC CTGCTCCCTA G
 
Protein sequence
MNEPEFLKAA TVQFQHQANN KKYNLLIIEK FIEQAALEQV NILVFPEMCI TGYWHVPKLT 
AAEVSALAEP IAESPSLTLI RSLAIKHQML IGVGLIERAD DGRLYNAYVA CMPDGTMHTH
RKLHAFEHPA ISSGDRFTVF DTPWGVKVGI LICWDNNLVE NVRATALLGA DILLAPHQTG
GTDSRSPHAM KPIPLALWEE RETRKEEITA AFKGASGREW LMRWLPARAH DNGLFLLFSN
GVGADDDEVR TGNPMILDPY GRIINETWAA DDVMVSAELD LSLLAMSTGR RWIHGRRPDL
YQILTQPQGY ERDAISARFS NVPPAP