Gene Sare_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4226 
Symbol 
ID5704397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4797767 
End bp4798930 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content71% 
IMG OID641273645 
ProductIMP dehydrogenase family protein 
Protein accessionYP_001538998 
Protein GI159039745 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01304] IMP dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.311533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGACG TGGTCGAGAT CGGGCTGGGC AAGACCGCGC AGCGCGGCTA CCACCTGGAC 
GACATCGCGA TTGTGCCGAG CCGTCGCACC CGGGACGTGG ACGACGTGTC GACAGCCTGG
CAGCTCGACG CGTACCCGTT CGACATTCCC TGCGTCGGCC ACCCCTCCGA CGCGACGATG
AGCCCCGCCT CGGCGGTCCG GCTCAGCCAG CTCGGCGGCC TCGGCGTGCT CAACGTGGAG
GGTCTGTGGA CCCGCTACGA GAACCCGACG AAGGTACTGG AGGAACTGGC CAGCCTGGGC
GTGGACGCCT CGGGCCCGTC ACCACGTACC CCCCGCGCCG CCGCGGCCCG GCCCCGCCAC
ACCCGGCGGC TCCAGGAGGT GTACGCCGAG CCGATCCGCG CGGACCTGAT CGCCGAGCGG
GTCCGAGAGC TGCGGGCCGG CGGTGGGACG GTGGCGGTAC GTGTCTCACC GCAGCACACC
CTGGCGCTCG CCCCGGTGAT CCTCGACGCC GGGGTGGACA TCCTGGTGAT CCAGGGCACC
ATCGTCTCCG CCGAGCACGT CTCCACCACC GACGAGCCGC TGAACCTCAA GGAGTTCATC
GCCGACCTCG ACCTACCGGT GGTCGTCGGC GGCTGCACCG ACTACAAGAC CGCTCTGCAC
CTGATGCGTA CCGGTGCGGC CGGGGTGATC GTCGGTATCG GCGGCGACGA CTGGTCGACC
ACCGAATCGG TGCTGGGGAT CCGGGTGCCG ATGGCCACCG CGATCGCCGA CGCCGCCGCG
GCCCGTCGGG ACTACCTGGA CGAGACCGGC GGCCGGTACG TACACCTGAT CGCCGATGGC
GATATCCGGA CCTCCGGTGA CATTGCCAAG GCGCTCGGCT GCGGCGCCGA CGCGGTGATG
CTGGGCGAGC CGCTCTCGCT GTGCCCCGAG GCGCCGGCCG GTGGCGCCTG GTGGCACTCG
GCCGCCAGCC ATCCAGCTCT GCCCCGGGGC GCCTTCGAGG TCGCCGGAGA GCCGTTCGGC
TCGATGGAAC AGCTGCTGTA CGGACCGGCC GACGAGCCGG ACGGCCAGCT CAACCTCTTC
GGCGGGCTAC GCCGGGCGAT GGCCAAGTGC GGCTACCGTG ACCTCAAGGA GTTCCAGAAG
GTCGGCCTGG TCCTGGACCG CTGA
 
Protein sequence
MRDVVEIGLG KTAQRGYHLD DIAIVPSRRT RDVDDVSTAW QLDAYPFDIP CVGHPSDATM 
SPASAVRLSQ LGGLGVLNVE GLWTRYENPT KVLEELASLG VDASGPSPRT PRAAAARPRH
TRRLQEVYAE PIRADLIAER VRELRAGGGT VAVRVSPQHT LALAPVILDA GVDILVIQGT
IVSAEHVSTT DEPLNLKEFI ADLDLPVVVG GCTDYKTALH LMRTGAAGVI VGIGGDDWST
TESVLGIRVP MATAIADAAA ARRDYLDETG GRYVHLIADG DIRTSGDIAK ALGCGADAVM
LGEPLSLCPE APAGGAWWHS AASHPALPRG AFEVAGEPFG SMEQLLYGPA DEPDGQLNLF
GGLRRAMAKC GYRDLKEFQK VGLVLDR