Gene Arth_1926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1926 
Symbol 
ID4445545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2168946 
End bp2170037 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content62% 
IMG OID639689736 
ProductNitrilase/cyanide hydratase and apolipoprotein N-acyltransferase 
Protein accessionYP_831408 
Protein GI116670475 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.342174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGATC CATTTGTCGT CGTTGCGGTT TCGCCGCGGA CAATCAATGT GAAGAATCCC 
GGCGACGGCG TCGCCAACGT CAAGCGCATC AACGAATTCA TCGACACGGC GGTCATGGTC
GGCGCCTGGG AAGGTTCTCC GGTCAAGCTG GTAGTCCTGC CCGAGATGGC GATCCAGGGC
ATGATGGCCA ACACGCCTGG GAACCGTAAG AAGGAGGCCC ACTTTGCCGT GACGATCCCT
GGTCCGGAGA CGGACGAGCT GGCGAAGAAG GCCGTGGAGC TCAATACCTA CATCGCTGCC
GAGCTGTACA TGGTCAAGGA CGAGGACTTC CCGGACCGCC ACTTCAATGT CGCCTTCATC
ATCGATCCGC AGGGCGAGAT CATCTACAAG CGCTACAAGG CCACCAGTGA TGCCTACGAA
GGAGGCATGC TCGGCAACAT GAACCCGCAC GACGTGTGGG ACGAGTGGAT CGAAAAGAAG
GGAAATGGCA ACGCAATGGA CGCCATCTTC CCTGTGGCTA AGACCGAGAT CGGCAACATC
GGGTACGCCA TCTGCCACGA GGGTGTCTAC CCCGAGGTGC CGCGTGGGCT CGCGATGAAC
GGCGCCGAGA TCATTATCCG GGGCACCCTG ATCGAGCCGG CCGTCCAAAA CGGCATGTGG
GAACTGCAGA ACCGGGCACA CGCCATGTTC AACTCGGCGT ACATCGTCGC TCCGAACCTG
GGGCCCGAAG TCCGCGACGA CGGGAGCATG CAGGACCTGT TCGGCGGCCA GTCCATGATC
GTCGGTCCAC GCGGGCAGAT CCTCACCAAG CAGCAGGGCT GGACCTCGGG CGACTCGTTC
GTCTGCACCA CAATCGACAT CGAAGCGCTC CGCCGGGCCA GGGTCGCCAA CGGCCTGTAC
AACCAGTTCA AGGACCTGCG CACCGAGCAG TACCGGGTCA TCTATGACAA CCCGATTTAT
CCGAAGAACC AGTACCTCGA CGCGCCGCCG AGCGAGGGAT GGCTCGCCCG GGAAGACGCA
ACGCGGGCCG CTAATATCGA GAAACTCATC GAGCGCGGCG TGCTCACACC GCCCTCGGGC
TACAGGGCAT AA
 
Protein sequence
MVDPFVVVAV SPRTINVKNP GDGVANVKRI NEFIDTAVMV GAWEGSPVKL VVLPEMAIQG 
MMANTPGNRK KEAHFAVTIP GPETDELAKK AVELNTYIAA ELYMVKDEDF PDRHFNVAFI
IDPQGEIIYK RYKATSDAYE GGMLGNMNPH DVWDEWIEKK GNGNAMDAIF PVAKTEIGNI
GYAICHEGVY PEVPRGLAMN GAEIIIRGTL IEPAVQNGMW ELQNRAHAMF NSAYIVAPNL
GPEVRDDGSM QDLFGGQSMI VGPRGQILTK QQGWTSGDSF VCTTIDIEAL RRARVANGLY
NQFKDLRTEQ YRVIYDNPIY PKNQYLDAPP SEGWLAREDA TRAANIEKLI ERGVLTPPSG
YRA